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PREFACE 


The  beginning  of  the  theory  of  filtering  is  generally  rather  closely  associated  with  the  development  of  the  Wiener 
filter,  and,  as  so  often  happens,  this  development  was  motivated  by  applied  issues,  namely,  the  development  of  improved 
fire  control  techniques  in  World  War  11.  In  this  application  the  (radar)  sensors  have  stochastic  (noise)  inputs.  Additionally, 
the  objects  (targets)  being  tracked  were  buffeted  by  stochastic  (wind)  disturbances.  The  criterion  for  optimum  (Wiener) 
filter  design  was  the  minimization  of  the  time  average  of  the  error  squared,  where  the  error  was  defined  as  the  difference 
between  the  true  target  position  as  a  function  of  time  and  the  best  or  optimum  estimate  of  the  target  position  generated 
as  a  function  of  time  by  the  Wiener  filter.  The  development  of  the  Wiener  filter  was  based  on  the  solution  of  the  Wiener 
Hopf  integral  equation  for  the  case  of  stationary  stochastic  processes  whose  descriptive  statistical  parameters  were 
invariant  with  time.  The  Wiener  Hopf  integral  equation  was  formulated  as  an  integral  equation  for  the  weighting 
function  of  the  linear  time  variant  system  which  would  minimize  the  ensemble  average  of  the  error  squared  in  the  case 
of  non  stationary  stochastic  processes,  that  is,  stochastic  processes  whose  descriptive  statistical  parameters  were  functions 
of  time.  By  limiting  the  stochastic  processes  under  consideration  to  stationary  stochastic  processes  it  was  possible  to 
take  the  Fourier  transform  of  the  Wiener  Hopf  integral  equation,  which  was  formulated  in  the  time  domain.  Therefore, 
it  was  possible  to  go  from  the  time  domain  to  the  frequency  domain,  in  the  case  of  stationary  stochastic  processes,  and 
more  readily  solve  the  Fourier  transformed  Wiener  Hopf  integral  equation  for  the  transfer  function  of  the  (time 
invariant)  Wiener  filter. 

Now  while  the  development  of  the  optimum  filter  for  stationary  stochastic  processes  is  useful  in  many  applied 
instances  there  arc  many  applied  instances  where  nonstationary  stochastic  processes  are  involved.  Thus  the  development 
of  the  optimum  filter  in  this  case  was  also  essential. 

In  the  period  after  World  War  H  until  the  late  1950's  a  great  deal  of  effort  was  expended  on  the  international 
scene  to  developing  the  optimum  filter  in  this  case.  While  there  was  much  in  the  way  of  interesting  research  results 
published  in  the  technical  literature,  it  remained  for  the  development  of  the  Kalman  filter,  presented  in  published 
papers  and  reports  in  and  around  ll>60,  for  an  effective  solution  to  this  problem  in  the  continuing  time  case  and  a 
computationally  efficient  solution  in  the  much  more  pervasive  discrete  time  case.  The  discrete  time  case  is  much  more 
pervasive  today,  of  course,  because  of  the  implementation  of  virtually  all  Kalman  filter  applications  by  digital  computers. 
The  continuous  time  Kalman  filter  may  be  viewed  as  being  generated  by  the  dlfferentation  of  the  Wiener  Hopf  integral 
equation.  What  results  in  the  process,  very  simply,  is  the  structure  of  the  Kalman  filter,  and  that  is  what  is  really  desired 
in  any  event,  namely,  the  implementation  of  the  Kalman  filter  in  the  continuous  time  case.  In  the  case  of  the  Kalman 
filter  in  the  discrete  time  case  this  is  very  simply  generated  by  noting  that  the  minimum  variance  filter,  that  is  the  fitter 
which  minimizes  the  ensemble  average  of  the  error  squared  at  discrete  times,  is  the  conditional  expectation  of  the  system 
state  at  any  particular  discrete  time  conditioned  on  the  specific  observations  or  measurements  which  have  resulted  at  all 
discrete  times  up  to  and  including  the  discrete  time  at  which  the  minimum  variance  estimate  is  being  determined  by  the 
Kalman  filter.  By  utilizing  certain  rather  straightforward  and  basic  expressions  this  Kalman  filter  estimate  can  be 
generated  rather  readily  in  a  convenient  expression  form  as  a  linear  function  of  the  Kalman  filter  estimate  of  the  system 
state  at  the  just  preceding  discrete  time,  for  instance,  plus  the  noisy  observation  or  measurement  of  the  system  state 
(vector)  at  the  present  time. 

The  next  area  for  which  the  theoretical  foundation  had  to  be  developed  was  that  of  filters  for  nonlinear  dynamic 
systems.  Regardless  of  whether  a  dynamic  systems  of  interest  is  linear  or  nonlinear  the  minimum  variance  estimator  is 
the  conditional  expectation  of  the  dynamic  system  state  (vector),  conditioned  on  the  previous  observations  or 
measurements  of  the  noisy  system  state  measurement  vector.  In  the  caw  of  the  Kalman  filter  this  is  rather  easily 
generated  as  a  convenient  expression  at  noted  above.  In  the  aw  of  nonlinear  dynamic  systems  this  is  an  extremely 
difficult  task,  in  general,  and,  with  rare  exceptions,  has  to  be  dealt  with  through  some  approximate  and  effective 
linearization  technique.  Specifically,  in  order  to  generate  the  conditional  expectation,  an  expression  for  the  conditional 
probability  density  function  has  to  be  generated.  This  it  done  through  the  use  of  the  I  to  calculus  which,  basically, 
correctly  recognizes  terms  which  appear  to  be  second  order  tetms  as  first  order  terms  in  going  to  limiting  processes  in 
the  process  of  developing  the  equation  which  describes  the  evolution  of  the  conditional  probability  density  function, 
and  thus  generating  this  equation  correctly.  This  equation,  which  is  the  equation  for  the  evolution  of  the  conditional 
probability  density  function,  is  known  as  the  Fokker  Planck  equation.  Sln«  it  is  a  nonlinear  partial  differential 
equation  of  high  older  if  the  system  state  vector  is  of  high  order  its  solution  is  a  matter  of  very  considerable  computa* 
ttonal  complexity,  and  is  very  difficult  to  implement  in  any  given  system  application,  as  a  rule.  This  then  forces  the 
generation  of  effective  approximate  linearization  techniques  which  are  quite  capable  of  providing  adequate  accurate 
filters.  Such  approximate  fitters  have  been  generated  and  ate  known  as  extended  Kalman  filters,  higher  order  extended 
Kalman  filters,  or  are  described  by  other  terms,  depending  on  the  approximation  technique. 
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Once  this  theoretical  foundation  for  Kalman  filtering  techniques,  nonlinear  filtering  techniques  and  effective 
approximation  techniques  therefore  was  put  in  place,  the  problem  became  that  of  developing  system  application 
techniques,  dealing  effectively  with  computational  issues  and  techniques,  and  it  is  the  intent  of  this  NATO  AGARDo- 
graph  to  treat  effectively  all  of  the  above  issues  and  techniques  with  particular  emphasis  on  application  to  complex 
problems  and  systems  of  paramount  interest  to  the  NATO  community.  As  such  this  NATO  AGARDograph  represents 
a  unique  document  on  the  international  scene. 

A  great  debt  of  gratitude  is  owed  by  the  editor  to  many  individuals.  First  of  all  the  co-authors  of  this  rather 
prodigious  undertaking  all  deserve  an  immeasureable  vote  of  thanks  and  gratitude  for  their  formidable  contributions 
to  this  volume.  Next  the  editor  would  like  to  express  his  gratitude  for  the  support  and  encouragement  of  his  fellow 
NATO  AGARD  Panel  members  in  addition  to  the  Panel  Chairman,  Geoffrey  Howell,  and  Deputy  Panel  Chairman, 

Ronald  Vaughn.  Additionally,  the  continual  first  class  support  of  Bernard  Heliot,  the  Panel  Executive,  on  this  rather 
elaborately  complex  undertaking  is  greatiy  appreciated.  Finally,  the  Editor’s  secretary,  Bernice  Roos,  was  of  indispensable 
help  and  support  on  this  undertaking. 


CORNELIUS  T.LEONDES 
Editor 
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SUMMARY 

After  a  historical  review  of  the  development  of  estimation  theory,  an  introduction  to  non¬ 
linear  filtering  theory  is  presented  by  means  of  a  deductive  approach. 

First  the  so-called  general  estimation  problem  is  defined  which  is  concerned  with  the 
extraction  of  useful  information  from  noisy  measurements.  Having  then  introduced  nonlinear 
stochastic  dynamical  models  for  the  signal  process,  the  exact  mathematical  solution  of 
the  general  estimation  problem  is  outlined  which  yields  the  Kushner-Stratohovich  equation 
respectively  the  Bayesian  recursive  estimator.  From  these  equations  the  practical  nonlinear 
filter  approximations  are  derived  in  a  deductive  way.  Most  important  are  the  local  approxi¬ 
mate  filters  of  first  order  (types  extended  Kalman  Filter)  and  second  order  filters.  In 
addition,  there  are  global  approximate  filters  (Bayes-lav  calculators)  available.  Several 
approximation  methods  for  these  estimators,  such  as  orthogonal  series  expansion,  Gaussian 
sums,  point  masses  and  splines  are  briefly  discussed. 


1 .  INTRODUCTION 

Nonlinear  filtering  theory  has  been  developed  since  the  beginning  of  the  sixties,  nearly 
during  the  same  time  as  the  theory  of  optimal  linear  filters  such  as  Wiener  filters  and 
Kalman-Bucy  filters. 

Nonlinear  filtering  is  the  theoretical  as  well  as  practicsl  solution  of  the  so-called 
general  estimation  problem  which  is  concerned  with  the  extraction  of  useful  information 
from  noisy  measurements  of  certain  signals. 

This  general  estimation  problem  which  will  be  discussed  in  more  detail  in  the  next  para¬ 
graph,  covers  many  problems  in  the  field  of  communication-  and  control  engineering  a vvl id¬ 
eations  such  as 

-  suppression  of  noise  by  filtering 

-  estimation  of  states  of  linear  or  nonlinear  stochastic  dynamical  systems 

-  parameter  estimation  for  statical  or  dynamical  systems. 

Interest  in  particular  modes  of  the  general  estimation  problem,  especially  in  celestial 
K.-vumiv**  dates  beck  e  long  timei  probably  even  to  the  work  of  Claudius  Ptolemaeue  /I/ 
n  the  s»  ;ond  century.  He  assumed  a  geocentric  model  structure  of  the  unlverso  which  re¬ 
quired  the  assumption  of  epicycloids  for  the  planets'  orbital  motion.  Having  used  measured 
data  from  observations,  Ftolemaeus  evaluated  the  orbital  parameters  of  the  planet  Mere, 
which  allowed  the  prediction  of  the  planet's  position  with  reasonable  accuracy. 

In  the  astronomy  of  modern  times  it  has  been  Legendre  I'll  and  independently  Causa  / J/, 
who  developed  in  179$  the  method  of  least  squares  for  the  minimisation  of  the  observation 
errors  by  determining  the  orbit  of  celestial  bodies.  The  least  squares  or  regression  ana¬ 
lysis  technique  msv  be  regarded  as  the  fundamentals  of  todays  optimal  estimation  and  fil¬ 
tering  algorithms.  Gauss  already  recognised  the  possibility  of  e  dual  approach  to  re¬ 
gression  analysis! 

It  may  be  considered  ee 

I)  e  deterministic  optimisation  problem  (minimising  the  sum  of  ths  squared  errors) 

II)  e  stochastic  estimation  problem  (evaluation  of  the  most  probable  parameter  estimates 
which  implies  calculation  of  a  probability  density  function  end  determination  of  its 
maximum) . 

AS  reported  by  Oeutech  /«/  end  Sorenson  /5 /,  Gauss  anticipated  fay  that  to  ease  extent  themexi- 
•tim-likelihood  estimation  technique  suggested  by  R. A. Fisher  fit  in  1912. 

In  the  forties  of  our  century  the  least  squares  principle  was  applied  by  Kolmogorov  fl/ 
end  Wiener  /%/  to  the  problem  of  separating  wido-band  noise  and  signal  processes,  i.  e. 
to  the  filtering  problem,  In  the  sequel  the  Wiener-Kolmogorov  filter  or  Wiener  filter 
which  may  be  realised  as  an  electrical  network  baa  found  many  applications  in  communica¬ 
tion  an?  in  waring . 

The  extension  of  the  Wiener-Kolmogorov  theory  by  Kalman  19/  19(0,  and  Kalman  and  Bucy  /10/ 
19(1  to  linear  multivariable  lnetationary  processes  by  introducing  ths  state  apace  con¬ 
cept,  wee  an  Important  improvement  end  has  opened  e  wide  field  of  applications  in  ths 
Subsequent  years  /II/.  this  is  because  the  Kalman  filter  is  e  recursive  algorithm  which  is 
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easily  to  be  implemented  on  a  digital  computer.  A  survey  of  the  history  of  early  aero¬ 
space  Kalman  filter  applications  is  given  in  /12/. 

As  mentioned  before,  the  theory  of  nonlinear  filtering  has  been  developed  independently 
but  nearly  in  parallel  to  the  linear  theory.  This  is  probably  due  to  the  fact  that  the 
approach  to  either  of  the  theories  is  quite  different. 

For  the  understanding  of  the  (linear)  Kalman  filter  on  the  one  hand,  a  least  squares 
approach  may  be  used  where  no  deeper  insight  into  probability  theory  is  required  (see 
e.  g.  /1 3/ ,  /1 4 /) .  The  nonlinear  filtering  theory  on  the  other  hand  is  usually  based  on 
a  probabilistic  approach;  the  solution  of  the  nonlinear  filtering  problem  requires  then 
the  calculation  of  the  probability  density  of  the  state  conditioned  on  all  available  ob¬ 
servations  and  the  initial  probability  density  function.  The  mathematical  tools  for  tack¬ 
ling  this  problem  are  partial  differential  equations  and  stochastic  integrals. 

Advances  in  the  area  of  nonlinear  filtering  are  due  to  Stratonovich  /15/,  Kushner  /16/, 
Bucy  /1 7/ ,  Wonham  /1 8/  and  many  others.  However,  it  should  be  mentioned  that  the  roots 
of  this  theory  may  be  traced  back  to  the  beginning  of  the  twentieth  century  and  the 
study  and  mathematical  description  of  diffusion  processes  (Einstein  /1 9/) .  This  is  because 
the  differential  equation  of  a  diffusion  process  and  the  mathematical  model  of  a  signal 
process  affected  by  white  noise  (which  is  the  basic  model  for  all  optimal  nonlinear  esti¬ 
mation  algorithms)  are  equivalent. 

After  this  historical  review  of  the  development  of  estimation  theory  we  will  give  an 
outlook  of  the  organisation  of  this  contribution. 

In  the  following  parsgrsph  we  will  first  introduce  the  general  estimation  problem  and 
its  mathematical  solution  which  will  lead  us  to  the  Fokker-Planck  respectively  Kushner- 
Stratonovich  equation  in  the  time-continuous  case  and  the  Bayesian  recursive  estimation 
equations  in  the  time-discrete  case. 

Thsn  we  will  discuss  locsl  approximations  for  nonlinear  estimators  i.  e,  easy  implemen- 
tabie  practical  filters  such  «s  first  order  filters  (type  extended  Kalman  filter  or  ite¬ 
rated  extended  Kalman  filters)  and  higher  order  filters. 

Finally  we  will  shortly  outline  some  possibilities  of  global  approximations  for  discrete- 
time  nonlinear  filters  (Bayes-law  calculators)  such  as  orthogonal  series  expansion,  Gaus¬ 
sian  sumn, point  masses, and  splines. 


2.  TUB  SOLUTION  OF  THE  NONLINEAR  FILTERING  PROBLEM 
2.1  The  gwutra.V  cultivation  problem 

Given  s  signal  *(t)  which  is  disturbed  by  additive  noise  n(t).  The  eum  of  both  signals 

y(t)  •  s(t)  ♦  n(t)  (1) 

is  observed  (measured),  in  the  meet  simple  case,  the  general  estimation  problem  conaiets 
of  the  evaluation  of  the  signal  e(tj)  by  processing  of  the  Information  contained  in  all 
available  measurements 

Yt  **  *...»y(t)}  ««  [y(t),  (2) 

through  an  estimator.  Thus  the  desired  ideal  output  of  the  estimator  (Flg.l) 

fatt)«  s(t,j  ,  t,|  t  (3) 

will  in  reality  be  an  estimate  of  this  value,  denoted  by 

fa(t)-i(t,|t) «  *  i(ti|Yt) .  (4) 


This  estimate  has  to  be  optimal  in  some  sense  which  leads  to  the  interpretation  of  the 
general  estimation  problem  as  an  optimisation  problem.  A  coat  functional  J(t)  which  de¬ 
pends  on  the  estimation  error 

£<*!)»•  s(t,»  -  e(t,lt)  (5) 

has  to  be  defined  end  minimised  with  respect  to  this  error. 

As  shown  in  Flg.l,  we  have  three  different  types  of  estimation  (prediction,  filtering 
and  smoothing)  depending  on  tbs  instant  tj  relative  to  the  present  time  t. 

Moreover,  we  distinguish  three  modes  of  estimation  which  are  related  to  uw  kind  of  sig¬ 
nal  and  observation  processes t 

-  tike  discrete  estimation  problem  where  signal  and  observation  are  (discrete- time) 
random  sequences 


-  the  continuous-discrete  estimation  problem  where  the  signal  process  is  a  (continuous) 
stochastic  process  and  the  observation  is  a  (discrete)  random  sequence. 

-  the  continuous  estimation  problem  where  signal  and  observation  are  continuous  stochastic 
processes . 

Finally,  we  speak  of  nonlinear  or  linear  estimation  when  we  process  the  observations  in 
a  nonlinear  respectively  linear  manner  in  the  estimator. 

He  will  now  follow  the  probabilistic  approach  to  the  general  estimation  problem. 

Since  (s(t)),  ln(t)},  and  thus  (y(t)}  are  considered  stochastic  processes  we  surely  want 
to  know  the  probability  of  certain  signal  values  a (t-) )  under  the  condition  of  a  given 
realisation 


Yt  M  [v<c>  *  to  s  T  it] 

of  the  observation  process  fy(t)}  •  Generalized  this  means  knowledge  of  the  conditional 
probability  density  of  s(tj)  given  Y^,  denoted  by 

p  [sit,).  tt  |  Yt]. 

This  function  embodies  all  statistical  information  about  s(t^)  which  is  contained  in  the 
available  observations. 

The  solution  of  the  general  estimation  problem  is  consequently  given  by  equations  for  the 
evolution  of  this  conditional  probability  density  function  pCs(t^),  t^lY^],  starting  with 
the  initial  information  pts(to)#  tp 3  . 

If  we  know  the  conditional  density  we  can  obtain  in  a  comparatively  simple  way  special  estima¬ 
tes  of  the  signal  (Fig. 2),  e.  g. 

-  s^ip)  the  most  probable  estimate  (Maximum  A  Fosterorl  eatim it  uj  which  indicates  the  aaxi- 

mum  of  the  a  posteriori  density  pCe<t7>*"*^ 

-  s__,  t  the  Minimum  Variance  estimate  which  indicates  the  center  of  gravity  of  the  ares 

"  under  the  density  p  t»(tj)  I  tj. 

The  minimum  varlauce  estimate  is  of  special  importance  since  the  underlying  quadratic  lose 

J(f)i*  B  («*>  -  B  {  [*(t,)-i(t,  It) 3  2  )  (6) 

welghta  larger  errors  in  a  stronger  way  than  smaller  ones,  is  independent  of  the  sign  of 
the  error,  and  la  last  but  not  least  mathematically  well  trectable. 

It  la  easy  to  show  (e.  g.  /2J/)  that  the  minimum  variance  estimate  Is  given  by  the  conditio¬ 
nal  mean  (which  we  will  indicate  in  the  sequel  by  the  symbol  a  ),«nd  we  have 

•^(t,)-  B  W(t,)  I  *t )  *•  fi(t,Jt) .  (T) 

Moreover, we  note  that  the  conditional  mean  ia  an  unbiased  estimate,  that  is 

B  ls(t,)  -  8(t,(t)}  «  B(*(t,»  -  BtS  (s(t,)|Y^U*  0 

B(eJ(tt)V  («) 


2.2  Dynamical  models  for  signal  and  noise 

For  e  practice!  development  of  the  equations  of  evolution  for  the  conditional  probability 
density  function  we  have  to  specify  mathematical  models  for  the  signal  and  nolee  processes. 
A  modal  which  Is  sufficiently  close  to  the  'real  world*  on  the  one  hand,  and  analytically 
tractable  on  the  other  hand,  is  given  by  a  Markov  process  in  state  specs  notation  which 
My  be  described  in  continuous  time  bv  a  nonlinear  vector  stochastic  differential  equation 
of  diffusion  type 


Alt)  «  f[*(t),t]  ♦  o[x(t)  ,tjwU)  ,  t  »  t0  t 


(9a) 


x  and  f  are  n-vectora,  0  is  of  dimension  n  k  q  end  (wit),  t  a  t«i  is  a  q -vector  white 
Gaussian  noise  process  with  t  ( w(t))  -  0  and  B  {y(t)-wMr)i  -  g  6  (t-t) . 

Equation  (9a)  can  neither  be  integrated  In  the  Rierann  sense  nor  in  the  Riemann- 
Stleltjes  sensei  It  requires  for  proper  handling  the  Introduction  of  a  stochastic  inte¬ 
gral  (which  la  due  to  the  delta  correlation  of  the  white  noise) .thus  we  better  write  Cq. (9a) 
as  the  (ltd)  stochastic  differential  equation 


dxtt)  •  f[x(t),t]dt  ♦  G[x(t)  ,t]d3(t)  ,  t  >  t0 


(9c) 


where  is  u  Wiener  process  with 

E{dg(t»*  o  ,  KjdQ  (t)dST(t)}  -  g(t)  dt. 

Eq. (9b)  is  formally  equivalent  to  Eq. (9a)  with  w(t) :«d§/dt. 

The  initial  state  xtto)  is  often  assumed  a  Gaussian  random  variable  with  known  mean  x(tQ) 
and  covariance  matrix  P (to)  but  it  can  be  characterized  as  well  by  any  other  distribu¬ 
tion  pCx(to).  to]. 

Moreover,  we  have  a  nonlinear  m  vector  observation  process  including  additive  measure¬ 
ment  noise, 


^(t)  -  h  jc(t)  ,t]  *  v(t) 


(10a) 


where  {v(t),  t>to)  is  Gaussian  white  noise  with  E  {v(t)>  -  0  and  Ejv(t)vT(r)}  »R  S  (t-t) 
which  we  write  in  the  same  way  as  before  as  an  It8  differential  equation'  ,  ~ 


ds(t)  ■  h*x(t!,t]dt  ♦  dMt)  j 


(10b) 


this  is  formally  equivalent  to  Eq. (10a)  with  the  definitions 

,  v(t)t- 

*•  dt  -  dt 

and  the  Wiener  process  { n (t)  }  with 

■me 

B(d-i(t)]-0  ,  E  ida  (t)d'BT(t)}  ■  S  (t)' dt  •  (10c) 


Xt  is  assumed  that  ({3(t)j  ,  l  3  (t)},  end  x(t«j)  are  uncorrelated  respectively  Independent. 

The  mathematical  model  Eq.  (9) ,  (IQ)  is  the  basis  for  the  solution  of  tha  nonlinear  filtering 
problem  in  continuous  tims.  This  model  is  imbedded  into  the  general  estimation  problem  which 
appears  obviously  by  comparing  Pig. 3  with  Pig.1. 


Olscrste  model 

In  the  time-discrete  oeee  we  have  the  stochastic  vector  difference  equation 
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as  mathematical  modal  for  tha  signal  sequence  respectively  the  nonlinear  plant  under  con¬ 
sideration.  The  initial  state  x(to)  is  normally  distributed  with  known  mean  Y(to)  and  co- 
variance  matrix  P(ta>  or  has  any  other  distribution. 

The  nonlinear  observation  sequence  la  given  by 

£(V  *  #tk'  *  £‘V  •  k  »  0,1,1,..,  <1 1t>) 

The  dimensions  of  the  vectors  correspond  i  those  of  the  continuous  cats.  The  noise 
( v (t J )  and  ( era  whits  Gaussian  ssquencss  with  sere  mean  and  covariance  matrices 
filter  reap.  flTtfc) ;  w(tk).  v(tk)  and  x(t<,)  are  assumed  uncorrelated  reap.  Independent. 


Continuous-discrete  model 


How  we  have  a  combination  of  continuous  process  dynamics  and  discrete  measurements.  Thus 
our  system  model  la  given  by  Eq.(9b)  and  Eq. (lib). 


2.1  Tha  Pnkker-Planck  equation 

This  equation,  which  la  squally  known  as  Kolmogorov's  forward  equation  la  a  first  but 
important  step  towards  tha  solution  of  tha  nonlinear  filtering  problem  In  continuous  time, 
it  describes  tha  evolution  of  the  transition  probability  density  pDt(t),  tlx(to),  t© 3 
of  the  Markov  process  generated  by  the  ltd  differential  equation  (9bf,  assuming  that  tha 
initial  probability  density  ie  known.  Tha  measurements  Eq.  (1(A)  ere  not  yat  considered. 

The  Pokker-Plenck  equation  in  lta  moat  simple  form  traces  beck  to  the  work  of  Einstein 
/1 9/  on  Brownian  motion  in  1905.  Further  research  on  diffusion  processes  and  tha  deri¬ 
vation  of  tha  corresponding  partial  differential  equation  was  given  by  Pokker  /20/  1914, 
Planck  /21/  1917  end  Kolmogorov  /2 2/  in  1931. 


This  differential  equation  reads 


2l  -is — 


4  Z 


-2[p(5  a  sT>  tj: 


3x^7x~ 


With  p*«  pfx(t),  t  lx (to)  ,  to  ] 

and  a  delta  impulse  as  initial  condition  for  the  density)  which  neons  that  the  initial 
value  x(toi  •  a  is  given. 

With  tHe  introduction  of  the  so  called  forward  diffusion  operator 


£  »C<‘>*i3  1  «  32it-HGQ  GT)m] 


of  the  diffusion  process  lx(t),  t  *  to )  generated  by  the  ltd  differential  equation  (9b), 
the  Fokker-Planck  «quation~can  be  written  as 

H  •  *(p>  .  04) 

An  analytical  solution  of  this  equation  is  possible  only  in  a  few  staple  cases  where  the 
process  model  ie  linear,  1.  a. 

f  t  *  tt) ,  t)  -  Fit)  *X  (t) 

and  the  matrix  G  in  Eg. (9b)  la  independent  of  the  state  vector  x (t) , 

Example >  To  apply  the  Fokker -Planck  equation,  we  demonstrate  the  transition  of  the  pro* 
liability  density  function  for  a  simple  example  taken  from  /J4/. 

Given  a  first  ordsr  linear  process  model  as  indicated  in  Fig. 5.  We  are  looking  for  the 
evolution  of  p  [x(t),  t  Ixlt^j-Xo}  an  solution  of  the  Fokker-Planck  equation. 

Eq.  02)  has  the  fora 
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How,  the  Wiener  process  is  Gaussian,  and  aa  we  know  normally  distributed  ttoecesstw  main  nor¬ 
mally  distributed  while  passing  through  a  linear  tystent  thus,  the  Gaussian  assumption 
for  the  transition  probability  pM,»>  (i,tiXo)  is  straight  forward.  This  yields  ftam  Eq.tlS) 
two  ordinary  differential  equations  for  the  propagation  of  mean  and  variance  with  the  so¬ 
lution 

x  •  x  wtp£-e(t-t  )*  , 

Q  °  H*a) 

4it}  I.  varix)  »§;{«-  axpl-fa lt-t0O  •  (Ub) 

Fig. 9  Illustrates  the  evolution  of  the  corresponding  transition  probability  density, 
ths  time  history  of  the  msan  value  x  describes  the  characteristic .motion  of  the  system 
while  the  variance  increases  dua  to  the  diffusion  from  taro  to  cs«(t-*«*)*  0/2*. 


3.4  the  Kuthner-fitratonovich  equation 

this  equation  represents  ths  theoretical  solution  of  the  nonlinear  filtering  problem  in 
continuous  time. 

Mow  the  system  model  Eq. (9b)  is  considered  with  the  observation  Eq.(tOb)  included  and  the 
equation  of  evolution  for  the  probability  density  p  Cx,t  lit  3  of  the  state  x  conditioned 
on  a  realisation  tt«*C«(*)t  to»T*tJ  of  the  observation  process  is  look*?  for. 

The  first  derivation  of~thls  equation  was  given  by  Stratonovlch  /IS/ in  i960,  Wc  unfortuna¬ 
tely  wee  slightly  errornoua  by  omitting  some  terms.  Kushnsr  /IS/,  /i6/  obtained  the  exact 
equation  in  1*64.  Buoy  /It/  developed  the  same  equation  in  1965  using  the  so  called  repre¬ 
sentation  theorem. 


The  conditional  probability  density  of  the  state  given  the  observations  satisfies  the 
Kushner-Stratonovich  equation 


p[x,t+dt| lt,dz  ( t) ]  -  p[x,t|Zt]  -  dp  =■  £(p)dt+[h-h]TR_1  (dz-h  dt)p 


(17) 


This  is  a  stochastic  partial  differential  equation?  it  is  stochastic  because  of  the  Wie¬ 
ner  process  contained  in  the  differential  observation  dz  and  partial  due  to  its 

Fokker-Planck  part.  Analytical  solutions  of  this  equation  practically  do  not  exist  but  it 
can  be  used  for  deriving  the  exact  equations  for  the  moments  of  the  probability  density; 
this  is  for  instance  important  to  obtain  minimum  variance  estimates. 

In  view  of  Eq.(17)  the  principal  behavior  of  the  nonlinear  estimator  may  be  summarized 
as  follows; 

-  The  conditional  probability  density  p  [x,t|Zt]  changes  as  a  result  of  the  dynamics  of 
the  process  model  and  due  to  the  observations. 

A 

-  The  measurement  information  is  used  in  form  of  the  residual  (d*-hdt;  . 

This  residual  is  weighted  with  the  matrix  [h-n J*R"1 .  In  case  of  stronger  noise  (  ||£.ll  large) 
we  have  a  weaker  influence  of  the  measurements.  If  the  measurements  are  useless 
(R"i-»0)  we  obtain  prediction  according  to  the  Fokker-Planck  equation. 

-  The  estimator  is  nonlinear  because  the  nonlinear  vectors  h  =  h(x)  and  f (x)  as  well  as 
G(x)  require  nonlinear  processing  of  the  measurements. 


2.5  Minimum  variance  estimation 

With  regard  to  the  realization  of  practical  nonlinear  filters  we  are  interested  in  charac¬ 
teristic  values  of  the  conditional  probability  density  function  p  [x,t  |  ZtJ. 

As  mentioned  in  paragraph  2.1  the  minimum  variance  estimate  is  the  conditional  mean  which 
represents  the  first  moment  of  the  conditional  density.  Thus  we  are  looking  for  the  equa¬ 
tion  of  evolution  of 

x(t,jt)  j«  B(x(t,) |Zfc}  ;  t,  >  t  .  <18«> 


In  addition,  we  need  the  second  moment,  the  conditional  covariance  matrix 


£<fci 


1 1)  E{[x(t1)-x(t1jt)]  [x(t1)-x(t1!t)]'r.Zt>  i»  2(xT(t1  ; t) x ( t ,  It) }  (18b) 


where 

x(t1 jt) :*  x(t1)-x(t1 |t)  (18c) 

is  the  estimation  error.  This  is  because  the  matrix  P  is  a  measure  of  the  accuracy  of  the 
estimates,  since  the  mean  of  P 


EfPft^t)}  -  EtEfxU,  !t)xT(t1  ;t)  |ZtH  *  E(x(t1  |t)xT(t1  |t) ) 


(18d) 


contains  the  variances  of  the  components  of  the  estimation  error  vector  (Ibc)  as  diago¬ 
nal  elements. 


Kushnei  /1 6/  proposed  in  1964  to  develop  equations  for  the  moments  of  the  conditional  den¬ 
sity,  which  would  yield  a  system  of  ordinary  stochastic  differential  equations  instead  of 
the  partial  stochastic  differential  equation  (17).  Bucy  /1 7 /  obtained  these  equations  for 
the  first  and  second  moment  in  the  sualar  case  and  Bass,  Norum  and  Schwartz  /26 /  treated 
the  general  vector  case  in  1966, 


The  conditional  mean  (i.  e.  the  minimum  variance  estimate)  and  the  conditional  covariance 
matrix  satisfy  the  ordinary  stochastic  differential  equations 
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dx(t)  -  f  [x(t)  ,t]dt+[x(t)h  -xit)h4]R(t)  (dz(t)-hdt.' 


(19a) 


[dP(t|t)]i;j»[(^x;j-fiXj)+(:tif;j-xif ix^h-xj}) TR(t)  (jQ^-hXjJjdt 


tCxix^h-^Xjh-xixjVxjQ1t2xi<tj£lTR(t)  •  (di(t)-hdt) 


A.  .  /v 


-1 


(19b) 


1*/ 


with  given  initial  conditions 

x(to),  £(to|to)  -  P(to>; 
and  the  more  simple  notation 

x  (t)  t*»  x  (tit) . 

For  technical  realization  of  a  minimum  variance  estimator  we  need,  however,  further  approxi¬ 
mations  (which  will  be  discussed  in  para, 3)  since  Eq.(ISi)  requires  knowledge  of  the  whole 
conditional  density j  this  is  quite  obvious  because  the  expectation  operations  in  (19)  e.  g. 
for  the  f  vector  are  defined  by 

m  & 

f (x) E  {f (x)  I  Zt}  »  j...  j  f (x)pCx(t) ,t| Zt]  dxi...dxn  (20) 


2.6  The  linear  minimum  variance  estimator 

We  now  consider  as  a  special  case  a  linear  system  model  which  we  may  write  like  Eq.(9a), 
(10a)  using  white  noise  instead  of  che  Wiener  process, in  the  form 

x(t/=>  F(t)  x (t)  +  G (t)w(t)  ,  tttQ  (21a) 

y  (t)  *=  H  (t)  x  (t)  +  v  (t)  ,  t  a  to  (2ib) 

The  optimal  minimum  variance  estimator  for  this  system  model  which  is  easily  obtained 
from  Eq.(19),  is  given  by 

dx  ^ 

“  F  ( *-)  X  (t)  +  K  ( t )  ( v  it)  -  H  (t)  x(t)  )  (22a) 


dP  (t) 

-ir~ 


F  (t)  P ( t)  ♦  PUJF^t)  +  G(t)£(t;GT(t)  -  Plt)HT(t)R  1 (t)H(t)P(t) 


(22b) 


Ktt)  P(t)KT(t)R  *  (t)  I  t  >  t  ,  (22c) 

initial  conditions  x(to),  P(t0). 

Those  are  the  equations  of  the  weil  known  Kalman-Bucy  filter  which  goes  back  to  Kalman  and 
Buoy  /10/, 1961  in  the  continuous  case  and  Kalman  /  9/,  1960  in  the  discrete  case. 

It  is  not  our  purpose  to  discuss  the  optimal  linear  estimator  in  this  contribution  in  more 
detail  because  our  mein  objective  is  nonlinear  estimation.  However,  we  will  give  an  im¬ 
pression  of  the  structure  of  this  linear  filter. 

This  will  be  used  later  se  s  reference  for  the  understanding  of  the  local  nonlinear  filter 
approx imat ions . 

Fig. 6  shows  the  signal  flow  of  the  data  processing  in  the  Kalman-Bucy  filter.  It  has  the 
structure  of  s  control  loop  where  the  "plant*  equals  the  system  model  under  consideration. 
The  controller  is  of  (matrix)  prqcortioanl  type  (K(t))  and  updates  the  system  model  in  the 
computer  using  the  residuals  (y-£  ).  The  time  variable  Kalman  gain  K(t)  is  obtained  as 
solution  of  the  matrix  Rloastl  differential  aquation  (22b)  for  P,  which  may  be  solved  off¬ 
line  since  P  does'  neither  depend  on  the  observations  nor  on  the~aatlmata  fi. 

The  stationary  solution  of  this  aquation  yields  s  constant  gain  for  the  filter  which  is 
then  equivalent  to  the  Wiener  filter  /?/,  /fl/. 

Finally  it  should  be  emphasized  that  the  linear  minimum  variance  filter  la  the  exact  solu¬ 
tion  of  the  general  estimation  problem  provided  that 

-  the  ays tea  model  is  linear 

-  and  the  white  noise  w(t),  v(t)  are  normally  distributed  and  uncorrelated. 

In  this  cess,  the  conditional  density  p  [ x,t  |  Xtl  1*  Gaussian  which  only  requires  first 
and  second  momenta  (fi  and  P)  for  its  representation.  K»  we  know  Eq.(22)  for  fi  and  P  con¬ 
tain  no  approximations,  i.  a.  the  evaluation  of  these  momenta  is  exact. 


2.7  The  Bayesian  recursive  estimation  equations 

The  development  of  the  recursion  relations  for  the  conditional  probability  density 
p  Cx(tk)  |  YtJ  1°  the  time-discrete  case  is  a  much  simpler  teak  than  in  the  contlnuoia 

case. 

This  la  due  to  the  feet  that  we  do  not  need  stochastic  integrals  and  stochastic  differen¬ 
tial  aquations  but  sums  and  difference  equations,  moreover ,  the  discrete  white  noise  is 
physically  meaningful. 

We  consider  the  discrete  system  model,  Eq.(ll).  The  recursion  equations  for  filtering  (up¬ 
dating  after  measurements)  and  prediction  (between  measurements)  can  be  obtained  by  using 
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Bayes'  theorem  of  probability  theory. 

a)  Filtering  (calculation  of  the  a  posteriori  density  at  t-t^) 


normalization: 


#  k  -  0,1,2,... 


(23a) 


c(tkJ  s-  P[Z(tk)  !Xfcjt_1]  -  [  Pv(tk)  Cz(tk)-h[x(tk)]] -PLx(tk)  |Yt^  Jdx(tk)  (23b) 


k-l 


Initial  conditions:  p  rx(to)  JY*.  3”  P  fx(to)] 

~  "o-1 


b)  Prediction  (calculation  of  the  a  priori  density  in  the  absence  of  measurements) 


p[x(tk+1)  |Yt  ]  -  j  p[x(tkH):x(t:t):.p[x(tk)tYt  ]dx(tk) 


(23c) 


Prw(r,  )fetV,i-2-jE<V3]  -p^(tk>  :it,;^(V 


—  -  X 


Analytical  solutions  of  Eq. (23)  are  available  only  for  linear  systems  with  Gaussian  ini¬ 
tial  -conditions  and  disturbances  which  yield  the  Kalman  filter. 


For  the  general  nonlinear  case  we  have  to  look  for  numerical  solutions  of  these  equations 
which  will  lead  us  to  global  approximations  of  nonlinear  filters  (see  para. 4).  There,  the 
main  problem  is  the  amount  of  storage  capacity  needed  for  storing  of  the  multidimensional 
conditional  probability  densities  and  the  calculation  of  the  nonlinear  convolution  inte¬ 
grals  in  Eq.(23),  A  good  survey  on  the  methods  of  density  approximation  is  given  by  Soren¬ 
son  ,'27/  in  1974. 


Eq. (23)  can,  however,  be  used  to  develop  the  exact  equations  of  the  conditional  mean  x  and 
covariance  matrix  P  for  the  nonlinear  minimum  variance  estimator. 

2.8  Continuous-discrete  estimation 

This  mode  of  estimation  is  of  special  importance  because  in  many  applications  the  under¬ 
lying  mathematical  model  of  the  system  has  continuous  dynamics  while  the  observations  are 
usually  taken  at  discrete  time  instants. 

The  system  model  for  continuous-discrete  estimation  is  given  by  Eq. (9b)  end  Eq.(llb). 
Obviously,  the  filtering  equation  is  Eq. (23a)  of  the  disorete  estimation  problem  and  pre¬ 
diction  over  one  time  Interval  is  accomplished  by  the  Fokker-Planck  equation  (12). 


U 


3,  LOCAL  APPROXIMATIONS  OP  NONLINEAR  FILTERS 

After  having  available  the  exact  solution  of  tha  general  astination  problem,  w*  are  now 
looking  for  practical  recursive  algorithms  which  can  be  directly  implemented  on  s  compu¬ 
ter  to  give  us  on-lins  estimates  of  the  state  vector  x. 

For  this  purpose,  we  have  to  approximate  the  density  functions  as  well  as  the  nur. linear 
system  functions. 

One  frequently  used  approach  to  these  approximations  is  carried  out  in  tha  following  steps: 

i)  approximation  of  the  conditional  probability  danslty  function  p  [x(t),t  I  it  ]  by  the 
momenta  of  the  distribution 

11)  consideration  of  the  first  and  second  moment  (x  and  P) ,  i.  e.  minimum  varlanoo  estima¬ 
tion,  and  neglecting  or  approximation  of  hlghar  order  moments 

Hi)  approximation  of  the  nonlinear  system  functions  t  [x(t),  t}  ,  h  [x(t),  t],  G  [x(t),t] 
by  series  expansion  around  spproprlsta  rsference-poXnts  reap,  trajectories.'*  ~ 

This  approximation  la  called  local  approximation  since  tha  resulting  filtering  algorithms 
aro  only  applicable  in  the  surrounding  of  the  reference  point. 


3.1  First  order  filter  approximations 


Obviously  first  order  approximations  are  the  simplest  ones  and  therefore 

most  often  used  in  practical  apollciM&n*.  In  the  twquwl  «e  wDl  pi'rssnt  some  of  tha  wore 

important  filter  algorithms  which  have  lean  developed  -Inca  the  beginning  of  the  sixties. 


3.1.1  The  continuous  minimum  variance  filter (extended  Kalman-Bucy  filter) 

Tha  underlying  system  model  la  given  by  the  (ltd)  stochastic  differential  equations  (9b) 
and  (10b' .  Me  search  for  an  approximation  for  the  (exact)  minimum  variance  filter  equations 
(19)  assuming 
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1)  the  conditional  probability  density  p  [x(t),t  I  Z^]  is  almost  symmetric  and  concentra¬ 
ted  near  its  mean.  Thus  odd  central  moments  are- negligible. 

ii)  The  system  functions  f  (x) ,  and  G(x)*g*GT(x)  are  expanded  in  Taylor  series  around  the 
conditional  mean  2  up“to  terms  of  first  order. 

This  expansion  e.  g.  for  f (x)  yields 

r  ,  ..  of  fx  (t)  ,  tj 

f[x(t),t]  *  f  [x(t)  ,tj  +  - - 


[x(t)-x(t)]  + 


x*x 


and,  taking  the  conditional  expectation  we  obtain 

f|x(t),t]  :=»  E(f  [x(t)  ,t]  i  Z  }  =  f[x(t),t] 

since  the  first  central  moment  E  { x -x  |  z^}  is  zero  due  to  assumption  i. 
in  a  similar  way  we  obtain 


(245 


(25a) 


h[x(t)  ,t]  s  h[x(t)  ,t] 


and 


[g  2  GTj  s  G[x(t)  ,tjg(t)G  [x(t)  ,tj  . 

Bearing  in  mind  that  \T 


in  nun  tnac  \T  \T 

r  .m/«h  \  -  / 3h  \ 

r  Av.T  '  £Ti  r,  r-  T  .  .  *  X  X1 1  —  |  j  ■  0  +  P(t | t)  I  —  1 

US  *1  il  3  "  C £r3  *  [x  hT (x) ]  +  [-  -  [tej  J  ~  -  \dxj 


(25b) 

(25c) 

(25d) 


we  immediately  obtain  wi"i  Eq.  (25)  the  differential  equation  for  2  from  Eq.(19a)>  and 
similarly  tne  equat  jn  for  the  covariance  matrix  P#  taking  into  account  that  P(tjt)  does 
not  depend  immediately  on  the  observations  (the  factor  before  Cdz-ndtl  in  Hg.(T9b) becomes 
zero),  hs  a  consequence,  the  conditional  covariance  matrix  PttltT  equals  the  unconditional 
matrix  P(t) . 


Thus  we  have  th-  equations  for  the  f;rs»  order  minimum  variance  filter 


r 


G(t>  -  *  sitHdito  -  h[Sio,t]ati 


ar ' 


Ljnti  >f  [*  tt! .  .1  »fTU<tt.O 

•  -  -  -'-I -  fl.)  4  pit)  — - -  • 

»h<“  "  ”  >it*i 

»hT[£ui,t)  ..  »£(*<*».•) 

I-  riu  — - - *  ttt  — - pui  » 

>i(U  ■  ij!(U 

l*<U  •  £U» - •*  '(U  . 

*  IJtl) 

I'.UUl  co«d.iB<V  •  5**0'  1  *  i  • 


(26a) 


(26b) 


(26c) 


For  technical  realisation,  both  differential  equations  have  to  be  so-ved  on-line  since  the 
gain  matrix  k  depends  on  the  conditional  mean  9  .  This  is  differ  int  fro*  the  (linear)  Eal- 
man-Bucy  filter  where  K  can  be  calculated  off-line  without  taking  any  obaarvation. 

Apart  fro*  this,  the  firat  order  minimum  variance  filter  is  very  eimiler  to  tha  Kalmai- 
Hucy  filter  which  ia  eaay  to  aee  by  ccetparing  tha  block  diagrams  of  both  filters.  Pi',. 7 
and  Pig. 6. 

Actually,  the  filter  equations  for  tha  minimum  variance  filter  (26)  can  ba  gained  using  the 
error  model  of  the  eyetee  equations  (9b).  (10b)  and  the  linear  Kalman-Bucy  filter  aquations 
(22) .  The  error  4x  (t)  is  defined  ee  the  derivation  of  state  vector  x  fra*  the  condi¬ 
tional  mean  x,  i,"e. 


4x<fc‘ x(t)  -  5(t) 


(27) 


Now  we  have  with  Eq.  \9b) 

d[4xU)}  -  dj3x(t)-x(c)j-(f|j{(t)  ,tj  -f  fx(t)  ,t]  )dt  +  C[x(t)  ,tjd|(t) 

and  in  view  of  Eq.  (24)  and  (9a)  with  G(x)  a*  g(x)  we  may  writs 
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•  Sx(t)  +  G[x(t)  ,tl  w(t) 

x-x  (28a) 

which  in  fact  is  a  linear  stochastic  differential  equation  for  the  error  Jx .  In  an  analo¬ 
gous  way  we  obtain  the  linearized  measurement  equation  ” 


ox  (t) 


•  3f 
3x 


<5  y  (t) 


ib 

&  X  x-x 


S  x(t>  +  v(t) . 


(28b) 


Now  the  Kalman-Bucy  filter  equations  (22)  are  directly  applicable  to  the  error  model  (28) 
and,  taking  into  account  that 

Sx  «  E  {x-x  I  tj  -  0 

He  obtain  equations  (26),  the  first  order  minimum  variance  filter. 

Because  of  this  possibility  of  developing  the  minimum  variance  filter  by  an  extension  of 
the  linear  filter  theory  to  a  linearized  nonlinear  system,  this  filter  is  also  well  known 
as  extended  or  modified  Kalman-Bucy  filter. 


3.1.2  The  linearized  Kalman-Bucy  filter 

The  assumptions!  and  ii  for  the  approximation  are  the  same  as  for  the  minimum  variance 
filter  witn  the  exception  chat  the  linearization  of  f  (x) ,  h(x)  and  G(x)  £  G1  (x)  is  now 
carried  out  about  a  known  nominal  trajectory  x  (t) .  TKis  trajectory  can  be  specified  by  the 
differential  equation. 


dx(t)  “  f[x(t),t]dt,  x(tQ)  ■  x(tQ);  t  >  tQ  .  (29) 

If  the  first  order  approximation  of  the  deviation  from  the  nominal  trajectory 

5x(t)  -  x (t)  -  x(t)  (30) 

is  small  enough  we  obtain  an  appropriate  linear  system  model  for  this  deviation  in  accordance 
with  Eq. (28)  and  hence  by  application  of  the  Kalman-Bucy  filter  (Eq. (22) )the 
equations  of  the  linearized  Kalman-Bucy  filter. 


(31a) 


(31b) 


(31c) 


(31d) 


Remarks t 

I)  Since  the  nominal  trajectory  5(t)  is  usually  not  a  constant,  the  linearised  Kalman- 
Bucy  filter  la  time  variable  ~ 

II)  the  filter  gain  matrix  £(t),  however,  can  now  be  calculated  beforehand  (off-line) 
without  taking  observations.  This  is  the  difference  to  the  extended  Kalman-Bucy 
filter  where  we  need  as  a  reference  for  the  linearisation  the  conditional  naan  8 (t) 
which  wa  only  obtain  by  on-line  filtering. 

ill)  The  filter  is  ueeful  especially  in  aerospace  applications  where  the  nominal  trajecto¬ 
ry  x(t)  la  of tan  known.  In  general,  it  la  not  a  simple  task  to  define  this  trajectory 
properly.  However,  if  the  deviation  £x  becomes  too  large  the  linearisation  conditions 
are  no  longer  valid  and  uaaleaa  estimates  result. 


*  rju*8(o.at»8U)Uy(ti-8U)4ju>Mt 
•  ru»ru>  ♦  pmfui  - 
-  Sditt’mg-'iuijtumi  ♦ 


htti  » 


Jilt)  •  gtU  •  tit* > 

|inuul  co«d.i*8l*0!  *  o.  £**„>•  1  i 


,d»f  uitMwui  £ltl  t«  ■ 


ni(gtu.t) 

9u»,.  — 


gttl  f  g(ilU,») 


gUl.gUI 


W  I 


3.1.3  The  continuous-discrete  extended  Kalman-Bucy  filter 

Because  of  the  practical  use  of  the  continuous-discrete  mode  of  estimation  we  will  now 
present  the  estimation  equations  for  this  filter. 

The  system  model  consists  of  the  (continuous)  nonlinear  stochastic  differential  equation 
(9b)  and  the  (discrete)  observation  equation  (11b). 

The  development  of  this  estimator  is  straight-forward  using  the  well-known  discrete  Kalman 
filter  equations  (which  we  did  not  give  in  this  contribution)  for 

updating  (filtering)  of  the  estimate  after  an  observation  on  the  one  hand  and  the  con¬ 
tinuous  extended  Kalman-Bucy  filter  for  prediction  on  the  other  hand. 

The  latter  equations  are  obtained  by  setting  R-1  in  Eq. (26a) ,  (26b)  equal  to  zero.  Thus 
summing  up,  we  have  the  equations  of  the  continuous-discrete  extended  Kalman-Bucy  filter 

i)  Filtering  (updating  at  the  instant  tjj  of  an  observation) 


(32a) 


(32b) 


(32  c) 


ii)  Prediction  (between  obaervations  t*  i  t  < t*+1 ) 


(33a) 


(33b) 


3.1.4  The  iterated  extended  Kalman-Bucy  filter 

There  ere  several  possibilities  of  improving  the  estimates  of  the  extended  Kalman-Bucy 
filter  by  local  Iterations.  Ths  basic  idss  for  thsss  algorithms  is  ths  reduction  of  the 
estimation  errors  by  an  itarativs  re-1 inear ire tion  of  the  system  nonlinear ltiee  h(x)  and/ 
or  f  (x)  about  improved  reference  values. 

Ha  will  demonstrate  this  for  the  measurement  nonlinearity  h(x) . 

Xn  the  filtering  equations  (33)  of  the  continuous -discrete  extended  Kalman-Bucy  filter 
the  reference  fi  for  the  evaluation  of  h  (B)  and  the  Jacobian  matrix  B(x)  la  the  predic¬ 
ted  atate  vector  x (t^Jt^-i )  baaed  on  tEe  information  of  the  observation  )  at  the 

instant  tv-i.  in  view  of  the  filtering  aquation  for  the  linearised  Kalman-Bucy  filter  with 
reterencevalue  J?(tk)  (tee  e.  g.  /24/  p.183)  we  may  write  the  corresponding  equation  (32a) 
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of  the  extended  Kalman -Buoy  filter  in  the  form 

2(tkiV  *£(tkltlc-1)  +  ,1,^]  .  {v(t)  -  h  [xtt^I  tJc_1 )  #  tk]  - 

(34) 

"  Hg(tk|tjt_i),tk]*Cx{tkltk.1)  -  2£ {t)c  1  tk_1 ) ]  > 


s*  xft^) 


Now,  the  updated  estimate  ^(t^lty)  on  the  average  will  be  better  (i.  e.  closer  to  the 
actual  value  x(t^))  than  the  predicted  estimate  •  Hence  we  relinearize  about 

—  ^tfcltv ) and  will  obtain  an  improved  updated  estimate  xlttltO  by  processing  the  observa¬ 
tion  yitjt)  once  again  through  the  filtering  equation  (34)  with  x (tk) t.) .  For  this 
purpose  the  gain  matrix  K  (Eq.(32c))  has  to  be  recomputed  as  well  using  x  (t^ it^) . 

The  iteration  can  then  be  repeated  by  linearization  about  x (t?  It.  )  which  yields  x Itl  It.  ) . 
Generally  the  iteration  procedure  will  be  terminated  when  a  further  improvement  is  notK 
possible,  i.  e.  i  ui  <i->>  I 

Is^kl*),*  •  5*^1  Si*  I  i  £r  *  o  .  (35) 

Bence  we  sum  up  the  equations  of  the  iterated  extended  Kalman-Bucy  filter  with  iteration 
of  the  measurement  model. 

i)  Filtering  (updating  at  the  instant  of  an  observation  y(t]()) 


u*n  .  ..»» 

Ktkklik»  •  I fcii- 1*  *  5tii,‘kllk>**kl ' 

11)  .  ii> 

•t)tHk)-h(i(tkltk),lk}  -  j»CiUhltk)- 

.11) 

*(8'SlV»,*S,tkl‘kOt  * 

l  *  < . (l-tii  JM,Uk|tk)i«  gUk|tk.|) 

k  •  0,1,1,...  BU><lkltk''*  fl'lk^k’ 

.11)  r.  <1> 

g(«iUklik).ik)  •  EUklv,)flTUttklt,_).ik)- 

.  in 

_  .in  -t 

•  h  Cb<\IV*S,1*S'V1 


(36m 


(36b) 


Covariance  matrix t  evaluation  only  once  after  having  terminated  the  iterations,  i,  e. 

iq.daw 


.|i»klV 


U-BlaVV**fc)9U‘HlV,*k)l' 


in 


•  stgu^lt^)  ,1^]  i* 


(36o) 


11)  Prediction  (between  observations  tvdtet^^)  Bq,  (33)  holds,  starting  the  integration 
with  the  final  iterated  filter  estimates  x^qytjfrom  Eq.(36e)  and  £(tk|tk> 
from  Eq. (36c) .  “  * 

The  Iterated  filter  considerably  reduces  the  influence  of  the  measurement  nonlinearity 
h (x)  on  the  estimation  quality.  This  was  demonstrated  by  Denham  and  Pines  /32/  in  1966. 

If  the  system  nonlinearity  f(x)  is,  however,  strongly  non  linear,  further  improvement  may  be 
achieved  by  Iterating  the  prediction  equations  as  well.  The  resulting  estimator  was  first 
developed  by  wishner,  Tabacsynski,  and  Athens  /29/  in  1 960 r  it  is  known  as  "single  stauo 
iteration  filter-,  vhila  Jaswlnski  called  it  -iterated  linear  filter  smoother"  (/23>. 
p.290).  The  discrete-time  version  of  this  estimator  is  called  discrete  conditional-ton 
iteration-filter  (Saga  and  Malsa  /30/,  p.470).  Usually,  thasa  Iterated  filter  algorithms 
converge  very  faatt  the  main  Improvement  la  alraady  often  reached  efter  one  or  two  itera¬ 
tions  /29/.  Thus,  thay  represent  a  vary  useful  tool  for  nonlinear  filtar  applications. 


3.2  Higher  order  filter  approximations 


As  already  pointed  out,  first  order  approximations  have  the  advantage  of  a  comparatively 
small  amount,  of  computational  burden  .  Moreover,  these  filters  can  be  developed  by  an  ex¬ 
tension  of  the  linear  filtering  theory.  Thus,  deeper  insight  in  the  treatment  of  sto¬ 
chastic  differential  equations  fe .  g.  the  It6  stochastic  calculus)  is  not  necessarily  re¬ 
quired.  As  a  consequence  -  with  regard  to  nonlinear  filter  approximations  -  the  extended 
Kalman  filter  type  estimators  have  found  wide  spread  applications. 

On  the  other  hand,  smaller  estimation  errors  may  result  if  second  partial  derivatives  of 
the  nonlinear  system  functions  f(x),  h(x)  are  included  in  the  estimation  algorithms. 

There  exist  a  lot  of  different  higher  order  approximation  filters;  as  an  example  we  will 
give  in  the  sequel  two  important  types,  the  continuous  second-order  minimum  variance 
filter  and  the  continuous-discrete  modified  Gaussian  second-order  filter. 


3.2.1  The  continuous  second-order  minimum  variance  filter 

This  filter  was  developed  by  Bass,  Norum  and  Sclwartz /S6  /  and  independently  by  Jazwinski 
/31  /  in  1966. 

The  system  model  is  again  Eq. (9b) ,  (10b). 

Assumptions: 

i)  The  conditional  probability  density  function  p  ['x(t),  t|  ZtH  is  almost  symmetric 
and  concentrated  near  its  mean.  Thus,  odd  central  moments~ars  negligible. 

ii)  The  system  functions  f (x) ,  h(x)  and  G(x) g  gTx)  are  expanded  in  Taylor  series  around 
the  conditional  mean  up“to  terms  of  second~order . 

The  assumption  i  is  the  same  as  for  the  first  order  minimum  variance  filter.  However,  even 
central  moments  of  fourth  order  which  will  appear  now  (due  to  the  second  order  series  ex¬ 
pansion)  are  neglected,  i.  e. 


[tXt-Si)  (Xj-Xj)  (X^-X^)  (X^-X^)] 


a  0 


(37) 


The  expansion  of  the  nonlinear  functions  and  expectation  operation  e.  g.  for  f (x)  yields 
f  [x(t),t  J  **  f  [x(t),t]  +  J  ,  p(t|t)  (38) 


wit):  the  definition 


and  the  Hessian  matrix 


(39a) 


(39b) 


we  get  after  some  calculation  with  Eq.$37)  from  Eq.(19a),  (19b)  the  equationa  of  the  con¬ 
tinuous  second-order  minimum  varlanca  filter. 


r  t  [x{  t)  ,  t]  ■* 

ditt)  -  |£[x(t)  ~  -  >  P(t|t>jdt  ♦ 

,  f  .  >Jh[x(0,t]  , 

+;(t)|di(t)-^h[*(t),t]+^  - - iP(t|t)jdtj 

dP(t|t)  -  {p[i(t!.t]P(tlt)+P<t|t)FT[*(t),t]  - 
-P<tU)HT(x(t),t]R',(t>  H(*(t),t]P(tlt)  ♦ 
♦G[i(t),t]g(t)0T[i(t).t]  + 

♦  ’  iij  <G[i(t).t]g!t)0TIx<t).tJ)iP(t|t)}dt  - 
,  T  . 

-  i  pit| t) t— — — — j —  >  P(t|tn  r  No- 

*  ”  a* 

,  r  ,  »ah[i(t),t]  1  , 

j - -p -  i  P(tttljdt) 

K(l)  •  P(t|t)HT[i(t),t}R''(t) 

UttUl  oood..  5<V  *  £Uo>>  1  i  eo 

»U[i(t),t] 

cUfiortion  i  P  »  -- - r™ —  •  B**  - : - -  • 

la  i; 


(40a) 


(40b) 


(40c) 


Comparing  this  filtar  with  the  first-order  filter  (Sq. (26))  we  aee  in  which  way  the  second 
order  terme  are  now  added  for  better  approximation  of  the  nonlinear  function!  f  and  h. 

This  ia  demonstrated  also  by  Fig, 6  in  comparison  with  Fig, 7.  Moreover,  the  equation 
(40b)  for  the  conditional  covarianoe  matrix  P(tlt)  is  now  a  stochastic  differential  equa¬ 
tion  since  it  contains  the  observations. 


3.2.2  The  continuous-discrete  modified  Gaussian  second-order  filter 

This  filter  was  given  by  Jaswinski  /32/,t966  and  Independently  by  Fisher  /33/,  1967  and 
Athens,  Mishner,  Bertolini  /34/  in  1968.  It  has  the  advantage  that  no  stochastic  differen¬ 
tial  equation  is  included  in  the  estimation  algorithm.  Thus,  it  is  easy  to  implement  on 
a  digital  computer. 


Now  the  continuous-discrete  system  model,  Eq. (9b)  and  (11b)  applies. 

Assumptions i 

1)  The  conditional  probability  density  function  p  [£<0 1*  (approximately)  Gaussian. 
Thus  odd  central  moments  are  negligible  and  fourth  central  moments  will  be  approximated 
by  elements  of  the  covariance  matrix  P. 

[(Xj-Xj^  (x^-x j )  (Xfc-Xjj)  (*!-«!>  «tn_t3  “ 


"  Pjk‘pU  *  pjl’pik  *  pkl*plj  * 

ii)  The  nonlinear  ayatam  functions  f{x),  h(x)  and  G(&)  rasp.  G(x) 
lor  striss  around  tha  conditional  mean  J  (tttfcjjj ,  tk.^  st'^ 
ordar. 


(41) 

T 

Q  G(x)  are  expanded  in  Tay- 
up“to  terms  of  sscond 
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ill)  The  filter  equations  for  x(tkltk)  and  P(tkltk)  at  an  observation  will  be  developed 
by  assuming  a  power  series  in  y(tkltfc-T)  which  is  carried  to  the  first  order  only. 
The  covariance  equation  will  then  be  modified  by  omitting  the  immediate  influence 
of  the  measurement  y(tk)  on  P(tkltfc) ,  i.  e.  the  second  term  in  the  power  series  for 
P(tkltk)  is  dropped.  ” 

The  result  of  assumption  iii  is  an  updating  (filtering)  equation  of  the  form 
x(tk|tk)  -a(tk)  ♦  Ktt^)  Ii(tk>-£(tlclt3c_1)3 


—  {tkl tTt)  "  £(tk). 


(42) 


These  are  in  fact  linear  regression  equations  for  the  vector  case,  which  yield  the  best 
minimum  variance  estimate  xitjdtjj). 

The  general  formulation  for  a,  K  and  C  can  be  found  in  Farison  /SS / .  From  Eq.  (42)  and 
/3 5 /  we  finally  obtain  after-some  calculation  the  filtering  equations  of  the  modified 
Gaussian  second-order  filter,  making  use  of  Eq.(38),  (39)  and  (41). 

The  prediction  equations  can  be  derived  from  the  continuoB  minimum  variance  filter  (40) 
by  omitting  the  terms  which  characterize  the  influence  of  the  observations.  Since  we  often 
have  additional  known  input  signals  u(t)  in  the  system  (besides  the  stochastic  inputs 
w(t),  v(t)}  we  will  take  them  into  account  in  the  prediction  equations,  supposing,  that 
now  in~Eq. (9b)  we  may  write 

f  S"  f  [x(t)  ,u(t)  ,t]  *  f  [x(t)  #t]+B[x(t)  ,t]u(t)  (43) 

Resulting,  we  obtain  the  continuous-discrete  modified  Gaussian  second-order  filter, 
i)  Filtering  (updating  at  the  instant  t*  of  an  observation  £{tk) 


;ttkuki  «  jttk|tk.,i  ♦  8»v{*«V  ' 

mkitk>  *  (i  -  KUk>j5Uk>)rltk|tk.,» 

K(tkt  •  £ttkUk.,>HTl*k>l!IUkl£ltkUk.,>MT«tk)  ♦] 


1  -  c.i.i,. .. 


Initial  coftd.1  1 

S'V 

dtluutloaai  HUkli»  — IgfljjT1 — ' 


S*S‘W»‘ 

H<S* •’  8t|ukt»k.,),»k) 


"■»  s  ;nt; 


l.).H.l->  l  ) 


*.i  *  t . . 

V*  *i<H<Vi»« 
V  VH'V*1 


(44a) 


ii)  Predlcitlon  (between  observations  t^  4.  t  <^+1 ) 


dxltlt.  ) 

-g~-  -  * 


♦  3 


1  3  JC  (x  1 1  It^)  ,t.k] 


1  fltk|tkl  * 


,  '  - - = -  .  PltltJ 

1  >i7(t|tk) 


»  rit)P(t|t,.i  ♦  p(t|tfc)rT(t)  * 


dp(t!tk) 

--3T - T  E'‘"-k'i. 

♦  (o[*(ti,t]s(t)oT(x(U,t]  |tk] 

definition  irtth  «< — JJTO -  * 

m[x(U, t]u(t>> 


■3xrcr 


x>i(t|tk>; 


(45a) 


(45b) 


m  A 

Remark*  the  approximation  of  (G  g  G  ]  la  possible  by 
T 

i)  expanding  G  g  G  up  to  second  order  and  taking  then  the  expectation  operation. 

This  yields  a  Torn  as  in  Eq.  (40b) . 

T 

ii)  expanding  G  up  to  second  order,  generating  the  product  G  Q  G  and  taking  the  expecta¬ 
tion 

lli)  expanding  0  only  to  first  order,  generating  the  product  GQG  and  taking  the  expecta¬ 
tion. 


The  procedure  i  is  more  convenient  than  ii  while  ill  is  of  advantage  with  regard  to  the 
existence  and  uniqueness  of  the  solution  /J6/. 


The  structure  of  this  modified  Gaussian  second-order  estimator  is  illustrated  in  Pig. 9. 
If  we  omit  the  blocks  containing  the  second  order  terms 


a2f 
^  ,£' 


*2(B  U) 


3x‘ 


i  P 


a2h 

— »  i  p 

3x2  “ 


(46) 


we  get  the  continuous-discrete  extended  Kalman-Bucy  filter. 


As  mentioned,  the  extended  Kalman-Bucy  filter  has  often  been  applied  due  to  its  simple 
structure  and  low  oanputaticn&l  burden.  A  good  survey  of  early  nonlinear  filter  applications  is  given  e.g. 
by  Sorenson/37/  in  1973. 

One  popular  field  of  application*  of  nonlinear  filtering  is  the  combined  estimation  of  states  and  parameters 
in  linear  and  nonlinear  systems.  In  this  case,  a  dynamical  modal  for  the  unknown  parameters 
is  assumed,  the  parameter  equations  are  added  to  the  system  model,  and  the  "state  variables" 
of  this  augmented  model  are  estimated.  However,  the  application  of  only  a  first  order  filter 
ms y  cause  biased  estimate*  /3  B/  as  a  result  of  emitting  higher  coder  terms  like  (46)  in  the  filter  approxi¬ 
mation.  That  is  why  modifications  of  the  ax tended  Kalman  filter  have  been  propoaed  /39/.  On 
the  other  hand,  aacond  order  filters  such  as  the  modified  Gaussian  second-order  filter  have 
been  succeefully  applied  for  parameter  estimation  problems  /40 /,  /41/,  /42/, 


It  should  be  noticed  however,  that  second-order  filters  are  generally  less  robust  than 
first  order  ones  end  that  they  have  a  smeller  region  of  convergenoy.  Therefore  the  initial 
conditions  should  not  be  too  bad  and  a  special  initialisation  procedure  like  a  least  squares 
parameter  estimator  will  be  of  advantage  /43/. 


Apart  from  the  filters  derived  in  the  preceding  paragraph,  there  exist,  of  course , var lous 
other  approximations  which  can  not  be  diecussed  in  this  paper  but  are  contained  to  a  large 
extent  in  standard  textbooks  on  estimation  theory  /23/,  /24/,/30/,/44/) .  Me  just  mention 
stochastic  approximated  f liters (/45/,  /«6  /)  where  stochastic  approximated  polynomials  are 
substituted  for  the  Taylor  saries  expansion  of  the  nonlinear  systems  functions  ftx)  and 


‘“V*"  * 
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4.  GLOBAL  APPROXIMATIONS  OP  NONLINEAR  FILTERS  ( BAYES- LAW-CALCULATION ) 

The  only  consideration  of  the  first  and  second  moments  of  the  density  as  we  did  in 
the  local  filter  approximations  of  paragraph  4  is  of  questionable  value  if  the  a  priori 
density  p  £x(to),toj  and/or  the  noise  processes  jw(t)J  ,  {v(t3  are  not  Gaussian,  i.  e. 
asynmetrical  or  even  multimodal. 

In  these  cases,  the  implementation  of  the  Bayesian  recursive  estimation  equations  (23) 
may  bo  necessary.  This  requires  the  approximation  of  the  complete  conditional  densities. 
As  a  result  the  validity  of  the  estimator  is  not  restricted  to  certain  reference 
points  in  the  state  space,  hence  we  speak  of  global  approximations  of  nonlinear  filters. 

Assuming  the  time-discrete  nonlinear  system  model  (Eq.(11))  we  obtained  the  corresponding 
Bayesian  estimation  equations  (23)  which  have  to  be  calculated  in  the  following  steps. 

i)  t^tp;  evaluation  of  the  filter  density  p  fx(to)  I  Xto  ]  after  the  first  observation 
y(t^)  according  to  Eq.  (23a) . 

-  calculation  of  the  product 

Pv(t0)U(to)'^fc(to)j]*  Pfe^olj 

-  normalization  by  solution  of  the  nonlinear  convolution  integral  (23b) 

ii)  evaluation  of  the  a  priori  (prediction)  density  p[x (t1 ) |^(tQ]jby  performing  the 

nonlinear  convolution  in  Eq.  (23c)  with  -  ~  ,  >-n 

—  O 

ill)  t"ti : calculation  of  the  new  filter  density  p  [x(ti)  |  Yti  1  using  the  observation 
y(t«)  as  under  i)  etc.  ”  ~ 

For  the  practical  solution  of  the  estimation  problem,  one  has  first  to  define  appropriate 
finite  collections  of  points  for  the  approximation  of  the  a  priori  density  p  [x  (t0)J . 

In  the  two-dimensional  case,  we  can  illustrate  this  by  a  grid  in  the  xi-X2-plane  where 
the  gridpoints  are  the  references  on  which  the  approximation  can  be  based.  This  grid  can 
be  maintained  or  redefined  for  every  sampling  interval  (floating  grid);  the  latter  is 
often  numerically  more  effective  /47/.  With  known  grid,  the  method  of  approximating  the 
density  has  to  be  established  as  well  as  the  method  of  numerical  integration. 


4,1  Orthogonal  Series  expansion  approximation 

The  approximation  of  the  density  function  by  orthogonal  functions  or  polynominals  has 
frequently  been  proposed  in  the  past.  Sorenson  and  Stubberud  / 48/  used  the  Edgeworth- 
seriea  which  consists  of  Hermits  polynomials.  However,  the  coupling  of  a  Hermlte  poly¬ 
nomial  with  a  Gauss-Hermite  quadrature  /49/»  /SO/  seems  tc  be  more  encouraging. 

Moreover,  a  Fourier  series  expansion  may  be  of  use  for  applications  with  periodic  proba¬ 
bility  density  functions  (e.  g.  in  phase -modulation  problems)  /Si/. 

However,  the  Inherent  disadvantage  of  all  these  approximations  by  orthogonal  series  ex¬ 
pansions  is  the  fact,  that  the  resulting  density  functions  are  not  really  density  func¬ 
tions.  They  can,  for  instance,  assume  negative  values  which  finally  yields  divergency 
of  the  estimator.  In  order  to  avoid  this  phenomenon,  the  number  of  terms  in  the  series 
have  to  be  increased  which  unfortunately  augments  the  computational  burden  at  the  same 
time.  Therefore,  another  approaah  has  been  suggested j  it  is  the 


4.2  Gaussian  sum  approximation 

In  this  esse  the  densities  in  the  Bayes-lsv  calculator  are  approximated  by  non-orthogonal 
functions  i.  e.  a  weighted  sum  of  Gaussian  probability  densities.  This  has  been  proposed 
by  Alepach  and  Sorenson  /52/,  /53/.  For  example  a  density  p(x)  is  approximated  by 

<1 

*>a<£>  -& l  Yii'Ei*  (47.) 

i»l 

with  the  definition 

Njjis,  8) »  •  (21?)‘n/2(det  B)_1/2e*p  {  -  j(x-s)  V’ (x-a) }  (47b) 

and  the  nonnegative  weighting  coefficients  characterised  by 

q 

2  ai  -  1  ♦ 
i«i 


(47c) 
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This  approximation  is  motivated  by  the  fact  that  pa  converges  uniformly  to  p  for  a  large 
class  of  densities. 

The  mean  values  represent  the  grid  points  for  the  approximation.  These  grid  points 
are  selected  uniformly  over  the  region  where p $0 is  significantly  different  from  zero. 

The  covariances  Pi  are  defined  to  be  diagonal  (CTj*-!)  and  Ol is  selected  to  minimise  the 
deviation  between  p  and  pa.  The  coefficients  *i  are  chosen  to  be  proportional  to  the  values 
of  the  density  function  at  the  grid  points  $i,  and  the  number  of  terms  g  has  to  be  increased 
until  a  suitable  approximation  is  obtained.- 

The  quality  of  the  Gaussian  sum  approximation  is  illustrated  in  Fig. 10  where  a  uniform 
distribution  is  approximated  by  this  method  /54/. 

Now  in  view  of  the  filter  realization  using  Gaussian  sums  the  proceeding  can  be  the  follo¬ 
wing  t 

i)  the  a  priori  density  p[x(to)]is  approximated  by  Gaussian  sums. 

ii) the  evaluation  of  the  filter  density  requires  the  calculation  of  the  product 

Pv(tQ)  [xtoHl  C*tto)l]  •  f>[x<to>]. 

The  result  is  unfortunately  no  longer  a  normal  distribution.  Therefore, 

Ui)  the  density  pv  [ y-fc (x) ]  will  be  linearized  around  each  grid  point}  than  an  extended 
Kalman  filter  can  be~applled  at  each  grid  point  to  calculate  xitto  |  to)  and  Pi (to  t  to), 
the  terms  of  the  Gaussian  sum  approximation  of  the  a  posteriori  density  p  Qc(toT  |  to] 

iv)  After  that  the  prediction  density  p[x(ti)  |  tQJ  is  evaluated  in  a  similar  way  as  under 


The  result  of  the  Gaussian  sum  approximation  for  realisation  of  Bayes-law  calculators  essen¬ 
tially  requires  the  parallel  operation  of  so  many  extended  Kalman- f liters  aa  we  have  terms 
in  the  Gaussian  sum. 

In  view  if  Fig.lOit  should  be  noticed  that  nearly  50  (t)  extended  Kalman  filters  are  neces¬ 
sary  for  the  implementation  of  a  Gaussian  sum  approximate  filter  with  uniform  a  priori  pro¬ 
bability  density. 


4.3  Point  mass  approximation 

This  approximation  may  be  regarded  ae  a  special  case  of  the  Gausnian  sum  approximation 
where  the  different  probability  densities  at  the  grid  points  are  Dlrec 4 -impulses  with  gi¬ 
ven  area  (the  point  masses).  Hence,  the  resulting  density  Is  always  positive. 

For  example,  tha  approximation  of  the  a  posteriori  density  can  be  written  aa 

q  r  .  i 

pCx(tk|Yt  3  •  2  Vj  <«»> 

*  i-1 

with  the  q14  grid  points  *i  (1-1,..., q)  and  the  corresponding  point  masses  mj. 

This  approximation  has  been  given  by  Buoy  /SS/.  For  reduction  of  computet ional  burden,  e 
‘floating  grid*  has  bean  suggested  in  /4?/.  The  grid  is  centered  at  the  actual  position  of 
the  conditional  mean  x  and  the  eigenvectors  are  used  to  define  the  principal  axes  of  the 
grid.  With  the  grid  points  given,  the  Bayesian  recursion  relations  are  readily  evaluated, 
which  Is  essentially  equivalent  to  using  s  rectangular  integration  rule  to  accomplish  tha 
numerical  quadratures . 


4.4  Spline  approximation 

In  this  approximation  the  grid  points  are  the  meah  points  for  the  interpolation  of  tha  pro¬ 
bability  densities  with  multidimensional  cubic  apllns  functions.  This  approximation  has  been 
suggested  by  0e  Flguelredo  and  Jan  /S6/  in  1971  for  realisation  or  discrete-time  nonlinear 
filters. 

the  probability  densities  approximated  by  splines  assume  no  negative  values  end  the  numeri¬ 
cal  treatment  of  the  spline  based  filters  is  comparatively  simple. 
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SUMMARY 

This  chapter  presents  a  unified  approach  to  derive  estimation  algorithms  for  discrete  and 
continuous  uonlinear  systems  with  and  without  delays,  corrupted  by  white  noise  as  well  aa  non-white 
noise  processes. 

In  the  case  of  continuous  systems ,  filtering  algorithms  are  derived  for  nonlinear  systems 
without  delays,  iai>edded  in  white  noise,  correlated  noise  and  noise  free  processes.  The  filtering 
equations  obtained  for  nonlinear  systems  with  white  noise  processes  are  exact,  but  for  non-white  noise 
processes  the  results  obtained  are  approximate. 

In  the  case  of  discrete-time  systems,  nonlinear  estimation  algorithms,  that  directly  yield 
the  flxad-lag,  fixed-point  and  fixed-interval  smoothing  and  the  filtering  algorithms,  are  derived  for 
nonlinear  delayed  systems  with  measurements  corrupted  by  white  noise  and  correlated  noise  processes. 
The  derivation  is  straightforward  and  clearly  indicates  the  close  links  between  three  different  class¬ 
ification  of  smoothers  and  the  filtering  estimator. 

for  systems  with  polynomial,  product-type  or  state-dependent  sinusoidal  nonlinearities,  the 
proposed  algorithms  can  be  practically  realised  without  the  need  of  approximation  under  the  assumption 
that  the  estimator  errors  are  Gaussian.  Such  an  assumption  is  significantly  different  from  the  most 
commonly  used  assumption  that  the  state  la  Gaussian. 
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SECTION  Is  GENERAL  INTRODUCTION 


1.1  INTRODUCTION 

The  Importance  of  linear  estimation  theory  aa  represented  by  Kalman-Bucy  filter[l]  is  now 
well  recognized.  It  has  found  extensive  applications  in  aerospace  systems,  such  as  Apollo  and  space 
shuttle;  guidance  and  navigation  systems;  econometrics,  seismology  and  meteorology,  biomedical, 
communications,  and  many  other  practical  scientific  and  engineering  problems. 

However,  dynamic  system  models  and  measurement  models  for  the  majority  of  realistic  control 
problems  are  inherently  nonlinear.  Most  of  the  work  in  nonlinear  estimation  is  very  theoretical,  some 
no  more  than  a  philosophy  of  approach  rather  than  a  procedure  leading  to  the  derivation  of  practical 
estimations. 

One  of  the  main  lines  of  attack  to  the  nonlinear  estimation  problem  is  the  probability  approach 
pioneered  by  Stratonovich  [2],  and  subsequently  taken  up  by  Kushner  [3],  Wonham  [4)  and  Bucy  [5].  The 
truly  optimal  nonlinear  filters  for  systems  corrupted  with  additive  white  noise,  were  given  in  Kushner 
(.3, 6],  however,  their  exact  solutions  required  infinite  dimensional  systems  which  are  practically 
Impossible  to  realize  except  in  trivially  simple  cases.  For  practical  realization,  extensive  work  has 
been  carried  out  to  approximate  the  nonlinear  filters.  One  group  of  papers  [7-9]  attempts  to  obtain  the 
numerical  solution  using  the  so-called  Bayesian  point  of  view.  They  assume  to  have  a  completely  valid 
probability  description  of  the  system,  so  that  Bayes  rule  can  be  applied  to  obtain  a  recursive  description 
of  the  a  posterioH  probability  density  function.  However,  in  many  realistic  problems  the  mathematical 
model  of  the  system  contains  uncertainty  that  cannot  be  priorly  modeled  by  probability  distribution 
functions.  Furthermore,  the  Bayesian  approach  has  the  disadvantage  of  imposing  a  rather  severe  comput¬ 
ational  burden  for  even  simple  systems.  Another  group  of  methods  [10—143  essentially  approximates  the 
mean  and  variance  of  the  a  posteriori  density  functions  based  on  perturbation  relative  to  a  prescribed 
reference.  The  majority  of  these  techniques  employ  the  Taylor's  series  expansion  of  the  dynamic  system 
and  measurement  nonlinearities,  neglecting  second-  and  higher  -  order  terms.  Recently,  Sunahara  [15] 
proposed  to  replace  the  nonlinear  functions  by  quasi-linear  functions  via  stochastic  linearization.  In 
general,  methods  based  on  Taylor's  series  expansion  suffer  from  the  defect  of  replacing  global  distribut¬ 
ion  properties  of  a  function  by  its  local  derivatives  aggravated  by  corruption  of  noise  processor.  Thus 
it  is  questionable  whether  the  more  sophisticated  approximation  [16]  provide  useful  Improvements  relative 
to  the  widely  applied  first-order  approximation  known  as  the  extended  Kalman  filter. 

In  the  areas  of  nonlinear  smoothing  estimation,  Leondes  et  al.  [17]  derived  the  exact  functional 
equations  for  the  smoothing  density  functions  and  the  smoothed  estimates;  but  their  solutions  are  pro¬ 
hibitive  except  in  trivially  simple  cases.  Other  works  on  nonlinear  smoothing  were  presented  by  Kailath 
and  Frost  [18],  Lalniotls  [19]  and  Lee  [20], 

Aa  far  as  state  estimation  problems  for  systems  with  time  delays  are  concerned,  Kwakemaak 
[21]  used  the  method  of  orthogonal  projection  to  derive  filtering  equations  for  linear  continuous  systems 
with  multiple  time  delays,  which,  when  solved,  also  yield  smoothing  estimates.  Priemer  and  Vacroux  [22, 
23]  later  considered  the  estimation  problems  in  linear  discrete  systems  containing  multiple  delays  in 
the  message  model.  Farooq  and  Mahalanabis  [24]  rederived  the  estimation  algorithms  of  Priemer  and 
Vacroux  using  the  state  augmentation  technique,  however,  it  was  pointed  out  in  [22]  that  the  augmentation 
of  state  vectors  has  the  effect  of  increasing  the  dimensions  of  the  system,  and  thus  lead  to  a  filter 
that  is  computationally  inefficient.  Biswas  and  Mahalanabis  [25]  presented  fixed-lag  smoothing  algorithms 
for  the  continuous  systems  with  time  delays  by  first  discretizing  the  continuous  systems  and  then  employ¬ 
ing  the  state  augmentation  technique.  This  was  further  extended  to  the  fixed-interval  smoothing  problem 
by  Faroog  and  Balssubromanlan  [26],  Approximate  smoothing  and  filtering  equations  were  also  derived  by 
Yu  et  al  [27]  for  a  general  class  of  nonlinear  functional  differential  systems. 

1.2  SCOPE  OF  THIS  CHAPTER 


This  chapter  is  devoted  to  the  derivation  of  nonlinear  estimation  algorithms  for  discrete  and 
continuous  nonlinear  dynamic  systems  with  and  without  delays  in  the  message  models,  corrupted  by  white 
Gaussian  nolee,  correlated  noise  and  colored  noise  processes. 

8ectlon  2  follows  the  presentation  of  (26,  30]  and  deals  with  continuous-time  nonlinear  systems 
without  delays  corrupted  with  additive  white  noise  as  well  as  non-white  noise  processes.  The  basic 
approach  makes  use  of  the  matrix  minimum  principle  together  with  the  Kolmogorov  [31]  and  Kushner  [3,6] 
aquations  to  minimize  the  error-variance,  taken  to  be  the  estimation  criterion.  However,  the  exact 
algorithms  derived  in  such  a  manner  require  Infinite  dimensional  systems  to  realise,  which  la  computation¬ 
ally  impossible.  In  order  that  the  estimation  algorithms  can  be  physically  realized,  It  is  assumed  that 
the  conditional  probability  density  functions  of  the  estimator  errors  are  Gaussian.  Techniques  are 
presented  to  show  how  one  can  exactly  evaluate  expectations  of  polynomial,  product-type  or  state-dependent 
sinusoidal  nonllnearletes  under  the  above  assumption. 

For  the  purpose  of  assessing  the  performance  of  the  proposed  minimum  variance  filter  and  to 
compare  it  with  various  other  approximate  finite  dimensional  filters.  Liang  [32]  selected  various  types 
of  nonllnaar  ayatama,  which  wars  simulated  on  a  digital  computer.  Hla  results  clearly  indicate  the 
superiority  of  the  proposed  minimum  variance  filter  over  those  of  other  filters  investigated,  and  theor¬ 
etical  explanations  are  also  given  for  the  apparent  poor  performance  characteristics  of  the  various  other 
filters  considered. 

Section  3  follows  the  presentation  of  [26,33]  to  derive  dlacreta-tlma  filtering  and  smoothing 
algorithms  for  nonllnaar  tlne-dalayed  systems  Imbedded  In  white  Gaussian  noise  processes.  The  main  tech¬ 
nique  makes  ust  of  the  matrix  minimum  prlnclpls  to  dsrlve  the  optimal  values  of  the  coefficients  In  the 
estimation  algorithms  under  the  requirements  that  the  estimates  ba  unbiased.  The  resulting  algorithms 
can  ba  recursively  evaluated  under  the  assumption  that  tha  probability  density  function*  of  the 
estimator  arrora  are  Gauealan.  Examples  are  included  to  illustrate  the  use  of  the  proposed  estimation 
algorithms,  tn  particular,  they  provide  better  Insight  aa  to,  how  one  can  properly  substitute  for  tha 


discrete-time  Indices,  in  order  to  arrive  at  the  filtering,  fixed-interval  smoothing ,  fixed-point 
smoothing  and  the  fixed-lag  smoothing  algorithms.  Results  pertaining  to  linear  problems  are  directly 
deduced  from  the  nonlinear  estimation  algorithms,  they  agree  well  with  those  derived  in  the  literature, 
using  other  optimization  techniques. 

Section  4  deals  with  discrete  nonlinear  time-delayed  systems  imbedded  in  correlated  noise 
processes  [2f>,34].  The  derivation  as  well  as  the  presentation  follow  that  of  Section  3.  Similar  to  that 
of  Section  3,  the  linear  estimation  algorithms  for  these  problems  can  be  obtained  from  the  nonlinear 
estimation  algorithms. 

Section  S  summarizes  results  presented  in  Sections  2  to  4. 

SECTION  2 

EXACT  AND  APPBOXINA’re  KTNIKDM 
VARlFNai- FILTERING- FOR~NONLINEAR 


CONTINUOUS  SYSTEMS 


2.1  INTRODUCTION 

This  section  is  devoted  to  the  estimation  problems  of  nonlinear  continuous  systems  without 
delays  corrupted  by  (1)  additive  white  Gaussian  noise,  (2)  correlated  noise,  and  (3)  noise-free  processes. 

In  Section  2.2,  the  noise  processes  are  assumed  to  be  Gaussian  white,  the  basic  approach  makes 
use  of  the  matrix  minimum  principle  to  minimize  the  error-variance  cost  function,  which  is  obtained  from 
the  exact  conditional  probability  deuslty  function,  as  presented  in  Kuahner  [3,6]  and  Kolmogorov's 
equations  [3].  Therefore,  it  is  not  unexpected  that  the  exact  nonlinear  filtering  equations  derived  for 
white  noise  problems  closely  resemble  those  of  Bass  et  al.  [13],  except  the  argument  of  the  expectations 
have  been  transformed  from  the  state  x(t)  into  its  estimator  error  x(t).  Section  2.3  shows  that  for 
filtering  problems  with  polynomial,  product-type  or  state-dependent  sinusoidal  nonlinearities,  the  filter¬ 
ing  algorithms  can  be  fully  realized  without  any  other  approximations  under  the  assumption  that  the 
conditional  probability  density  functions  of  the  estimator  errors  are  Gaussian. 

In  many  practical  situations,  nonlinear  dynamic  systems  are  imbedded  in  non-white  noise 
processes.  Therefore,  Sections  2.4  and  2.S  deal  with  more  general  nonlinear  estimation  problems,  they 
are  respectively.  Imbedded  in  correlated  noise  and  noise  free  processes,  and  the  colored  noise  problem 
can  be  considered  as  a  special  case  of  the  noise  free  estimation  problem. 

The  estimation  algorithms  derived  for  non-white  noise  processes  are  auboptlmal,  since  the  change 
in  probability  density  function  due  to  the  differential  measurements  6z  is  neglected.  In  the  special  case 
that  the  system  and  measurement  models  are  linear,  the  resulting  algorithms  are  optimal  and  agree  well 
with  those  of  the  literature  [35,36], 

2.2  OPTIMAL  MINIHUM  VARIANCE  CONTINUOUS  NONLINEAR  niTTOHC  WITH  WHITE  NOISE  PROCESSES 

Consider  the  class  of  nonlinear  systems  described  by  the  stochastic  differential  equation  [37] 
f[x(t),t]  +  G[x(t),t]w(t)  (2,1) 

with  the  measurement  given  by 

y(t)  -  h[x(t),t)  +  v(t)  (2.2) 

Where  x(t)  and  y(t)  are  the  n-dimenslonal  stats  and  m -dimensional  measurement  vectors,  f  and  h 
are,  respectively,  n-and  m-dlaenalonal  nonlinear  vector  valued  functlona,  and  G  Is  a  vector  valued 
matrix. 

The  random  voctors  v(t)  and  v(t)  are,  statistically  independent  zero-mean  white  Gaussian  noise 
processes  euch  that  for  all  t,  t  >  tQ 

Cov(w(t),  w(t)}  -  ¥w(t)  4(t-t) 

Cov(v(t),  v(t)}  -  yt)  fi(t-T)  (2.3) 

end  Cov(w(t) ,  v(t)}  •  0 

where  i(.)le  the  Dirac  delta  function,  and  the  variances  ¥  (t)  and  ¥  (t)  are  non-negative  definite  and 
positive  definite,  respectively. 

The  Initial  state  vector  x(t  )  ■  x  is  a  aero-mean  Gaussian  random  process,  independent  of  v(t) 
end  v(t)  for  t  >  t0>  with  a  positive  definite  variance  matrix 

Ver(x(t0),  x(to)>  -  Vjt(to) 

In  the  typical  filtering  problem,  it  is  required  to  compute  x(t) ,  the  unbiased  estimate  of 
x(t)  conditioned  on  the  eat  of  neaeureatenta 

T(t)m  [y(e)/t0  <  a  <  t], 
such  that  the  coet  function  for  t  >  tQ 

J(T)  -  E{[x(T)  -  a(t))T  M(t)  lx(T)  -  «(t)J/Y(t)}  (2.4) 

la  minimi rad.  Bare  M(T)  ia  an  arbitrary  symmetric  positive  definite  matrix,  T  the  matrix  transpose  and 
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E  denotes  the  expectation  operator  conditioned  upon  the  set  cf  measurements  Y(t) . 

The  filtering  algorithm  is  assumed  to  satisfy  the  general  nonlinear  differential  equation 

i(t)  *i[«(t),  t]  +  K(t)  y(t)  (2.5) 

where  i.[8(t),  t]  and  K(t)  are  yet  unknown.  Then  the  estimation  problem  is  to  determine  the  time 
varying  nonlinear  vector  valued  function  i[£(t),  t]  and  the  gain  algorithm  K(t)  such  that  the  cost 
function  of  Equation  (2.4)  is  minimized. 

In  fact,  the  nonlinear  filtering  equation  may  be  assured  to  take  various  forms,  however,  once 
enough  information  concerning  £(t)  and  y(t)  are  Included  in  the  estimator  model,  the  resulting  filtering 
algorithms  would  be  unique.  For  example,  other  dynamic  filtering  equations  such  as 

£(t)=  £(x(t)  ,  t]  +  K(t)  (y(t)  -  E[y(t)  ] }  (2.6) 

can  also  be  assumed.  Comparing  the  structures  of  Equations  (2.5)  and  (2.6),  it  is  obvious  that  an 
extra  term  -K(t)  E(y(t)l  is  included  in  Equation  (2.6),  however,  the  resulting  filtering  algorithm  using 
either  one  of  the  above  estimator  models,  would  result  in  exactly  the  same  nonlinear  filtering  algorithm. 

Now,  let  x(t)  denote  the  estimator  error  defined  by 

x(t)  -  x(t)  -  $(t)  (2.7) 

using  Equations  (2.1),  (2.2),  and  (2.5),  the  derivatives  of  Equation  (2.7)  becomes 
2<t)  -  ftx(t)  +  S(t),  t)  +  G(x(t)  +  i(t),  tl  w(t) 

-  l(*(t),  t]  -  K(t)  (h[x(t)  +  i(t),  t]  +  v(t)>  (2.8) 

%  Siucf  <(t)  la  required  to  be  an  unbiased  estimate,  it  therefore  requires  the  expectations  of 
both  x(t)  and  'u(t)  be  zero.  Hence,  if  the  expectations  of  both  sidesof  Equation  (2.8)  are  taken,  it  is 
necessary  that* 

iWt),  t]  -  t(x(t)  +  «(t),  tl  -  K(t)  (h[x(t)  +  x(t),  tl>  (2.9) 

where 

ftx(t)  +  x(t) ,  t]  -  E{f[x(t)  +  x(t),  t]/V(t)> 

h(x(t)  +  x(t) ,  tl  -  E(h(x(t)  +  x(t),  tl/Y(t)} 

Then  the  estimator  error  can  be  shown  to  satisfy  the  relation 

x(t)  »■  f*tx(t),  *(t),  t]  +  G*[x(t) ,  x(t),  tj  w*(t)  (2.10) 

where 

f*[x(t),  x(t),  tl  ■  f  (x(t)  +  x(t) ,  tl  -  f(x(t)  +  x(t),  tl 

+  K(t)  (hlx(t)  +  x(t) ,  t)  -  h(x(t)+  x(t) ,t) } 

and  G*[x(t),  x(t) ,  tl  *  (C(x(t)  +  x(t),  tl  -  K(t)l 

Furthermore,  Equation  (2.2)  can  be  rewritten  as 

y(t)  -  h[x(c)  +  x(t) ,  tl  +  v(t)  (2.11) 

now  the  filtering  problem  of  Equations  (2.1)  and(2.2)  has  been  transformed  into  that  of  Equations 
(2.10)  and (2. 11). 

let  ♦[x(t)l  ba  s  twice  continuously  differentiable  function  of  the  vector  x(t);  by  definition 
of  the  conditlonel  expectation  operator 

dE(«lx(t)l)  -  *(x(t)l  dp(x(t),  t/Y(t)]  d  x(t)  (2.12) 

where  p(x(t),  t/Y(t)l  te  the  conditional  probability  density  function. 

(text,  the  chense  In  p(x(t),  t/Y(t)l  due  to  the  dynamic  equations  of  (2.10)  snd(2.11)  must  he 
computed,  tt  can  be  shown  that  (371 

fip  «  p(x(t  +  6t),  t  +  dt/Y(t) ,  6y)  -  p(x(t),  t/Y(t),  dy) 

♦  p|x(t),  t/Y(t),  dyl  -  p(x(t),  t/Y(tYl  (2.1J) 

where  the  first  two  terms  arm  simply  the  change  due  to  tha  dynamic  aquation  of  (2.10)  and  the  last 
two  terms  art  due  to  tha  differential  measurements  dy.  These  two  changes  are  given  by  Kolmogorov  and 
Kuahnar *t  equation  {5,6] .  Therefore, 

ilatenl -  E(L  ♦tx(t))}  ♦  K(*(x(t)Hhlx<t>  ♦  i(t),  tl  -  hix(t)  ♦  i(t),  till 
Yv(t)‘l(y(t)  -  h(x(t)+x(t),  tl)  (2,14) 


where 


L<(x(t)]-E  f}[x(t),  x(t),  tl.3tL«jtjJ._  l  (G*(x(t) ,  x(t),  t] 

1  1.3-1 
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*,*00  G*T[x(t).i(t).t]}  4 


a  ♦t«(Q] 

3xi  xj 


then  the  error-variance  equation  ia  obtained  from  Equations  (2.10),  (2.11),  (2.1A)  and  2.(15)  by 
setting  4>[i(t) )  »  x(t)  x  T(t). 


for  which 


dV^(t) 


-  E{x(t)  f*T[x(t),  x(t) ,  t]  +  f*(x(t),  x(t),  t]  xT  (t)}  +  G(x(t)  +  x(t),  t] 


yt)  GT(x(t)  +  x(t),  tl  +  K(t)  »v(t)  KT(t)  +  E{i(t)  3P(t)  hT[x(t)  +  ^(t),  t] 

-  V|»(t)  hT[x(t)  +  x(t),  t]}  »v_1(t)  {a(t)  -  h[x(t)  +  x(t),  t]}  (1 

where  it  can  be  shown  that 

E{x(c)  f*T[x(t),  x(t),  t]}«  E(x(t)  fT(x(t)  +  x(t),  t]  -  x(t)  hT[x(t)  +  x(t) ,  t]  KT(t)> 

Now  the  estivation  problem  takes  the  fora  of  an  optiaal  control  problem  in  which  K(t)  is  the  only 
variable  available  for  manipulation  such  that  the  cost  function  of  Equation  (2.4)  is  minimi red. 

To  use  the  matrix  minimum  principle  to  solve  this  problem,  a  symmetric  positive  definite 
coetate  matrix  P(t)  ia  defined,  and  the  Heal Itonian  is  given  by 

H  -  trace  (\N(t)  PT(t)} 

Using  the  concept  of  gradient  matrices  [38]  che  necessary  condition  which  the  optimal  K(t) 
must  satisfy  la  obtained  as 

K(t)  -  B(x(t)  hT[x(t)  +  x(t) ,  t])  tv~l  (1 

note  that  K(t)  is  Independent  of  the  costate  matrix  P(t)  and  the  weighting  factor  M(t). 

Then  the  filtering  estimate  and  the  error-variance  equation  become 

a 

x( t)  rn  f(x(t)  +  x(t),  t]  +  E{x(t)  hT(x(t) 

+  i(t),  t])  Yv"l(t)  (y(t)  h(x(t)  +  i(t),  t))  (3 


dVj(t) 


E{x(t)  fT(x(t)+i(t),  t]+f(x(t)  +  £<t>,  t]  xT(t)} 


■b  E{0(x(t)  +  x(t),  t]  Yw(t)  GT[x(t)  ♦  xU),  t])  -  Evx(t)  hTlx(t)  +  xu),  ci > 
fv’l(t)  E(h(x(t)+  i(t>,  tl  xT(t>)+  B(x(t)  xT(t)  hT(‘x(t>  ♦  £(t),  t] 

-V£(t)  hTlx(t)  +  i(t),  tl)  tv‘l(t)(y(t)  -  h[x(t)  +  x(t) ,  t])  (2.19) 

raspactlvaly. 

It  should  ba  noted  that  before  the  above  algorithms  can  be  physically  realised,  a  nuaber  of 
difficult  expectations  must  be  evaluated,  they  involve  Infinite  dimensional  systems  except  in  trivially 
simple  coses.  Also  notice  that  the  algorithms  derived  here  closely  rsssablt  the  exact  aquations  due  to 
Bass  al  [11].  Tha  only  difference  is  that  the  argument  of  the  expectations  have  been  transformed 
into  x(t).  Such  s  transformation  is  particularly  significant  for  filtering  problems  vlth  polynomial, 
product-type  or  state-dependent  sinusoidal  noollneerltlee.  Since,  in  such  cases,  the  filtering  algorithms 
can  be  obtained  without  any  further  approximations,  under  the  eaeuaptlon  that  the  probability  density 
functions  of  the  estimator  errors  are  Gaussian. 

Furthermore,  the  above  derivation  can  be  attended  to  filtering  problems  with  non-whits  noise 
processes.  However,  In  such  cases,  the  change  in  p[x(t) ,  t/Y(t)}  due  to  the  differential  measurement* 

4y  is  neglected,  and  Equation  (2,14)  becomes  simply 


t  t  til  x(t)J) 


It  is  noteworthy  to  mention  that  when  such  an  approximation  is  made,  the  retulte  obtained  for 
the  filtering  problem  corruped  by  white  notes  processes  era  equlvaltnt  to  that  of  the  stochastic 
linearisation  due  to  Suuhara  (IS).  In  such  a  case  the  random  forcing  term  in  the  error-variance 
equation  of  Equation  (2.19)  is  neglected,  whereas  the  filtering  algorithm  of  Equation  (2.18)  remains 
unchanged. 

When  Equation  (2,20)  Is  used  In  pices  of  Equation  (2.U)  to  obtain  the  error-variance  aquation 
sad  the  nonlinear  functions  are  approximated  by  second-order  Taylor  aeries  expansions,  tha  resulting 
algorithms  axe  commonly  called  modified  minimum  variance  filter  (18). 
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In  order  to  test  the  performance  of  the  proposed  minimum  variance  filter  and  to  compare  It 
vlth  various  other  approximate  nonlinear  filters  [10-13,  15-16],  Liang  [32]  selected  various  types  of 
nonlinear  systems,  the  stochastic  filtering  equations  are  transformed  to  Stratonovlch's  forms,  and  then 
simulated  on  a  digital  computer.  The  simulation  results  obtained  from  the  proposed  filtering  algorithms 
are  compared  to  various  other  approximate  nonlinear  filters.  His  results  Indicate  the  superiority  of 
the  proposed  filter  over  those  of  other  filters  Investigated. 

2.3  EVALUATION  0?  EXPECTATIONS 


The  purpose  of  this  section  is  to  Indicate  how  polynomial,  product-type  or  state-dependent 
sinusoidal  nonlinearities  can  be  evaluated  without  the  need  of  approximation  under  the  assumption  that 
the  estimation  errors  are  Gaussian. 

Polynomial  or  Product-Type  Konlinearitlea 

For  the  sake  of  completeness  In  presentation.  It  is  worth  mentioning  that  the  expectations  of 
higher  order  power  terms  can  be  easily  evaluated  using  the  following  Lemma. 

Lemma  li  Let{x(t),  tet}  be  a  zero-mean  Gaussian  process.  Then  all  odd  order  moments  of  x  vanish,  and 
the  even  order  moments  can  be  expressed  In  terms  of  the  second  order  moments  using  the  following  formula 
(39] 

E(x(t.).,.x(t  )}  »  EB[x(t^  )x(t.  )]...E[x(t1  )x(t.  )] 

1  l2  n-1  n 

where  the  sum  Is  taken  over  all  possible  ways  of  dividing  the  n  points  Into  n/2  combinations  of  pairs. 

The  number  of  terms  In  the  summation  is  equal  to  1.3.5. ..(n-3)  (n-1). 

It  should  also  be  noted  that  Lemma  1  can  be  rewritten  as 
K{x(tj')...x(tn)}"  EfxUjJzktj)}  E{x<tj)  ...x(tn>}+  E{x(tj)x(tj) ) 

E{x(t2)x(t4)  ...x(to))  . ..+E[x(t1)x(tn))E{x[t2)x(t j)  ...x(tnl)) 

which  Implies  that  nth  order  momenta  can  be  obtained  from  the  expectations  of  n-2th  order  mementa.  This 
Is  e  rather  useful  formula  In  the  evaluation  of  the  expectatona  of  higher  order  power  terms. 

Nonlinearity  Involving  State-Dependent  Slnuoaolds 

In  prectlcal  application  of  estimation  techniques,  one  often  encounters  nonlinear  terms  Involv¬ 
ing  state-dependent  sinusoids.  Fortunately,  the  expectations  of  such  typeof  nonlinear  functions  can  be 
rigidly  obtained. 

Lswm  2t  Let  Xj  end  Xj  be  jointly  normally  distributed  random  variable.  Then 
E  (  cos(x1  ♦  Xj))  -  la  <  exp  (JfXj+Xj)  -(Vu+ 

end  E  (  sln(x2  +x2)>  •  Im  (  exp  (Jfx^j)  -  Vu*  VJ2*  2VJ2)/2J) 


The  above  relationships  are  clearly  seen  from  the  definition  of  the  characteristic  function 
^  (u)<»  E  [e^u  x)«  exp  <JuTx  -  W^w) 


where 


sod 


UT»  {Uj,  Uj,...^,...^] 

xT«  l*l,  Xj . V'V1 

Vm  »[cnv  {*t«  Xj>  ] 


lemma  3t  If  an  operatnr  (u)  la  dadoed  such  that 

W"»-J 

them  we  could  easily  da  rive  the  following  relationships! 

(I)  where  e^  is  the  kth  wait  vector, 

(II)  \  ejV^)  c,  (•.)-  *(** 


where  C  <•_)•  exp  (Jx  -  Vm) 

•  ■  Q  J 

(til)  «  \*t  eJ*n)-  «VV<W!  , 

II 

sad 

(lw)  t  (xj  tj.-.x^e1**!}*  {(OjS  IjHOj-a  I2)  ...(dB+yi 
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where  we  heve 


odd  nuaber 

it  <J.»0 
i»l  1 


Using  these  identity  relationships,  one  could  easily  calculate  the  expectations  of  all 
Gaussian  distributed  state-dependent  sinusoids.  Foe  example  j 

*  y 

Btx^  x2  cos  Xj}-  l  V12+  (Xj+  j  VjjX* j  +j  V2J>]  Re{exp(jXj - 

■  {(V12  (  x3  x2  -  V31  V2J)]coa  x3  -  (x:  V23  +  XjV^j)  sin  x^}  e'V33//2 

and 

E  {XjXj  sin  Xj  WlVjj  +^xi*2  "  V31V23^  *in  *3  +^*lV23  **2V31^  co*  *3^ 

Konllnearitv  Involving  State-Dependent  Relays 

Section  2.2  noted  that  the  lmplementst ion  of  nonlinear  filtering  algorithm  requires  the 
evaluation  of  the  expectations  of  nonlinear  vector-valued  functions  f  and  h,  as  well  aa  products  of 
these  functions  and  estimation  errors  of  states. 

For  a  nuaber  of  applications,  f  and  h  say  sisply  be  some  forme  of  state-dependent  relays. 
Expectations  of  some  of  these  state-dependent  relays  are  tabulated  in  (40]. 

Illustrative  example t  Consider  a  nonlinear  system  with  message  and  measurement  models  described  by 

x(t)-  -  sin  x(t)>  u(t) 
and  y(t)«  xJ(t)+  v(t) 

respectively.  Where  u(t)  and  v(t)  arm  tero-mcan  white  Gaussian  noise  processes,  with  variances  F  (t)  and 
Fw(t),  respectively.  u 

% 

Assuming  that  x  (t)  is  a  Gaussian  process,  then  the  expectations  of  the  nonlinear  function#  can 
be  evaluated  as  follows 

E(*J(t))«  E((x(t)  ♦  x(t))5]  »  3V%(i 

* 

Elite  x(t)]  •  K(ein(x(t)  ♦  x(t)l)  •  i 

E(x(t)  sin  «(t))«  -  coe  «t(t)  V%(t)  « 

x 

and  *2  3  .  • 

E(s  (O  x  (t)J  «  9V%  (t>  *CO  ♦  Vjt] 

X  X 

hence,  Equations  (2-1#)  and  (2.19)  become 
I  ,  -V  (0/2  , 

m(t)»  -elm  x(t)  m  x  ♦(JVf%(t) 


““  ‘  -¥..(t)/2  . 

V^(t>»  e  x  coe»(t) 

x  » 

♦  (bV^ft)  x(t) )  ▼  ‘Sotyd) 

t 

respectively. 

Mots  that  in  the  evalsatton  of  the  expectation*  the  only  assumption  needed  Is  the  Gaussian 
assumption  of  the  estimator  error,  whereas  ail  other  finite  dlmeasimsi  algorithms  delete  eon*  of  the 
higher  order  terms  of  the  error-variance.  Wham  the  nonlinear  functions  to  Equations  (2.11)  and  (2.19) 
atm  appfoxlenud  by  Taylor  series  t span* let.* ,  the  result*  obtained  can  be  identified  with  «srtesi  other 
approximate  nonlinear  furring  algorithme  <c  the  literature  (10,  12,  13,  14). 

2.4  Ml  Ml  MIX  VAX1AMCE  CtWtm  COyTlMWO  MGMMMtAE  Him 

Consider  the  general  nonlinear  tueaagt  model 


.3 

t)  x(t>  ♦  x  (t) 

-VO/2 

sin  x(t)  e  a 

-yo/2 

X 


)  xS(t> 


.2  .  .3 

♦  JVN(t)  a  JV%(t)  x(t)  •i(()| 

s  x 


2  J  J  -1 

♦V  (t)  -  ( JV  (t)  ♦  3VJti  X  <»>]V  A(t) 


v  '  ’  V 

X  J  X 

-  JVN(t)  x(t)  -  i  (0) 


x(t)  -  f|x(l).t]  e«(x(t).tl  w(t)s  flx(t).t)  u(t)  (2.21) 

with  measurement  givtfl  by 

y(t)»  hU(t).t(*  v(t)e  s(t)  (2.22) 

where vu(t)  mi  t(t)  atm  know*  Input  time  functions,  w(t)  end  v(t)  arm  correlated  no 5s*  processes  with 
mean  {^(t)  sod  ^(t).  respectively,  end  also 


Oov(w(t) ,  v(t))  mf  CO  #(t-t) 


(2.23) 


i-0 

f. i  all  other  prior  statistics  follow  that  of  Section  2.2 

Following  the  development  of  Section  2.2  and  neglecting  the  change  of  p[3f(t),  t/Y(t)]  due 
to  che  differential  measurements  5y(t),  then  the  filtering  algorithm  is  given  by 

x(t)  =  f[x(t)  +  x(t),t]  +  G[x(t)+  x(t),  t]  uw(t)  +  Etx(t)+  x(t),  t]  u(t) 

+  K(t)  {y(t)  -  uv(U  -  z(t)  -  h[x(t)+  x(t) j  t]}  (2.24) 

where  the  gain  algorithm 

K(t)  -E{*(t)  hT[x(t)  +  x(t),t]+  G(x(t)+  x(t).t]Vw<t)}’l'v"1(t)  (2.25) 

and  the  error-variance  equation  is  given  by 

V^(t)  =E{x(t)  fT(x(t)+  x(t),t]  +  f[x(t)+  x(t) , t J  xT(t)}+  E{G[x(t)  +  x(t)  ,  t] 
x 

fw(t)GT[x(t)  +  x(t)  ,t]}  -  K(t)  yy(t)  i’.T(t)  (2.26) 

In  the  particular  case  that  f  and  h  are  linear,  G  and  E  are  independent  of  the  state  variable, 

namely 

f[x(t),t]»  F(t)  x(t) 
h(x(t) ,t)=  H(t)  x( t) 

G[x(t) ,t]=  G(t)  and 
E[x(t),t]=  E(t) 

Then  Equations  (2.24)-(2.26)  are,  respectively, 

A  A  A 

x(t)  =  F(t)  x(t)  +  G(t)ytf(t)  +  E(t)  u(t)+  K(t)  (y(t)  -Uy(t)  -  z(t)  -  H(t)  x(t)} 

K(t)  =[\(t)  HT(t)+  G(t)  '*,wv<t))’f  _1(t) 

X 

and 

\(t)  °F(t)  V^(t)  +  Vjt)  FT(t)  +  G(t)  Yw(t>  GT(t)  -  K(t)¥v(t)  KT(t) 
xxx 

These  algorithms  agree  welt,  with  the  general  continuous  Kalman  filter  [36]. 

2.5  MINIMUM  VARIANCE  CONTINUOUS  NONLINEAR  NOISE-FREE  FILTERING 

Consider  the  continuous  nonlinear  message  model  of  Equation  (2.1)  with  the  nolae-free  measure¬ 
ment  model  given  by 

y(t)-  h[x(t),  t]  ..  (2.27) 

where  w(t)  is  tero-mean,  white  noise  with  non-negative  definite  variance  ¥y(t) .  h(x(t),t)  is  assumed  to  be 
continuously  differentiable  in  t  and  has  continuous  second  mixed  partial  derivatives  with  respect  to  the 
elements  of  x  and  also  considered  to  be  of  full  rank,  otherwise  an  equivalent  y(t)  of  lower  dimension 
can  be  used. 

Since  y(t)  is  noise  free¥  (t)  is  non-negative  definite,  when  y(t)  ie  differentiated,  acme  of 
its  elements  may  not  contain  any  white  noise.  Therefore,  each  element  of  y(t)  has  to  be  differentiated 

d^i  "  ft  hl  +  l^15t^tr,lcelG'l’wGT(l^2Ui>1 

and  Equation  (2.1)  la  used  to  substitute  forthe  derivative  of  the  state  variable,  until  white  poise  it 
obtained  in  the  derivatives  of  each  element  in  y(t). 

The  signals  obtained  can  be  arranged  into  two  sett.  In  the  firat  set 
y^t)-  hjjx(t) ,  *tj+  N(x(c),t]w(t) 

which  comprises  all  derivatives  ofy<t)  that  contain  linearly  independent  white  nolee. 

A 

It  ie  assumed  that  the  filtering  estimate  ?(t)  le  given  by 

x(t)  -  i(x(t),t]  +  K^t)  y1(t)+  Kj(t)  y;(t) 

since  In  place  of  Equation  (2.27),  there  are  two  seta  of  mesauraaents. 

Here,  y.(t)  lr  considered  ae  a  known  input,  it  dost  not  contain  any  new  information,  and  follow¬ 
ing  the  development  of  Section  2.2,  the  following  filtering  algortha  le  obtained 

£<t)-  f(i(t)+  £<t),tl  •>  Kj(t){y2(t>  -  h2ll(t)  *  i(t),  fw(t).  tD 


with  the  gain  algorithm 

Kj(0»  E{x(t)  h2T(i(t)+  i(t),>rw(t),  t]  +  C[x(t)  +  i(t).tiyt)  NT[2(t)+  x(t),t]) 

(E{N(x(t)  +  x(t),t]Vw(t)  NT[£(t)  +  xM.t]})'1 
and  the  error-variance  equation  is  given  as 

T 

\(t)  -  E(x(t;  fT[£(t)  +  x(t),t]+  f[x(t)+  x(t),  t]  x  (t)>+E{G[x(t)+  x(t),t] 

X 

\(t)  GT[x(t)  +  i(t).tl)  -  K2<t)E{N[x(t)  +  x(t^t]Vu(t)NT[x(t)  +  x(t)  ,t]>K2T<t> 

In  the  case  that  the  vector  valued  functions  are  linear,  the  results  presented  here  agree  well 
with  that  of  Bryson  and  Johansen  [351* 


SECTION  3 

MINIMUM  VARIANCE  FILTERING  AND  SMOOTHING 


FOR  DISCRETE_NONLINEAR_DEUYED_SYSTEMS_Um  ADOITIVE_WHITE_NOISE 


3.1  INTRODUCTION 

this  section  deals  with  discrete-time  filtering  and  smoothing  estimation  of  nonlinear  systems 
with  multiple  delays  Imbedded  in  additive  white  noise  processes.  In  general,  the  filtering  algorithm 
enables  one  to  estimate  present  values  of  the  variables  of  interest  using  present  data,  whereas  the 
smoother  allows  one  to  estimate  past  values.  A  typical  smoothing  problem  is  the  post-flight  estimation 
of  the  flight  path  of  a  missile  based  on  tracking  system  measurements  during  the  entire  duration  of  the 
flight.  If  the  estimates  of  the  missile's  position  and  velocity  at  one  particular  flight  point  are 
desired,  the  estimates  can  be  based  upon  all  the  measurements  recorded,  Including  those  made  before  and 
after  that  particular  flight  point. 

In  the  estimation  problems  for  linear  systems  without  delays  involving  additive  white  noise 
processes,  ntnerous  papers  have  been  written  to  deal  with  filtering  and  smoothing  estimation,  however, 
most  of  them  are  merely  rederivation  of  earlier  recursive  algorithms  presented  by  Carlton  [41],  Rauch 
[42]  and  Bryson  and  Frasier  [43],  or  reformulation  of  problems  using  various  estimation  techniques. 

However,  Kelly  and  Anderson  [44]  pointed  out  that  the  algorithms  for  both  discrete  and  continu¬ 
ous-time,  linear,  flred-lag  smoothing  given  in  [43  ,  18,  42,  45)  may  be  unstable,  and  therefore  impractical. 
To  be  more  explicit,  although  the  fixed-lag  smoothing  equations  are  bounded-input  and  bounded-output  stable, 
realliatious  of  these  In  [42,45]  contain  a  subsystem  which  is  unstable  in  the  sense  of  Lyapunov,  In  [44], 

It  Is  pointed  out  that  the  apparent  culprit  is  an  uncontrollable  and  unstable  block  In  the  smoother  state 
equations  which  can  be  removed  without  affecting  the  input-output  characteristics. 

Tot 23 ] ,  a  computationally  stable  smoothing  algorithm  is  derived  for  linear  discrete  systems 
containing  tima  delays,  using  the  method  of  orthogonal  projection.  The  smoother  for  linear  discrete 
systems  without  delays  can  be  considered  #a  a  special  caae  of  the  above  problem,  with  time  deley  index 
setting  to  aero.  Tie  results  of  (23)  are  rcderlved  in  a  simple  manner  In  (24),  using  the  state  augment¬ 
ation  technique.  However,  the  smoothers  derived  in  such  s  manner  ere  of  nN  dimensions,  where  n  Is  the 
oidar  of  the  meeaege  model  and  N  la  the  amount  of  the  fixed-lag. 

In  this  section,  the  mattix  minimum  principle  is  applied  to  nonlinear  discrete-time  systems 
Involving  time  doleya,  with  measurement  sequence  imbedded  In  additive  white  noise  processes.  The  resulting 
dynamic  discrete  estimation  algorithms,  as  reported  In  [28,  46),  ere  recursive  In  nature  and  dlractly 
yield  tha  f Ixad-lntarval ,  fixed-lag,  ftxad-potnt  smooching  and  tha  filtering  algorithms.  The  derivation 
te  straight  forward  and  above  the  dost  links  between  the  smoothing  and  filtering  eatimatlon  algorithms. 


In  Section  3.2,  the  problem  statement  Is  presented.  Section  3.3  presents  the  derivation  of  the 
nonlinear  smoother.  Sections  3.4  to  3.7,  provide  handy  seta  of  refetenee  equations  respectively,  for  tha 
fixed-lag,  fixed-point,  fixed-interval  smoothing  end  the  filtering  algorithms.  Section  3.6  show*  the 
applicability  of  tha  presented  algor! tha*  to  linear  eatimatlon  problem*. 


THE  FROjLjH  STATEMENT 


end 


Tha  massage  and  measurement  models  for  the  discrete  nonlinear  time -delayed  systems  are  given  by 

x(k*l)«  |  f ,[x{k-n.),  k-o.)  ♦  Gls(k) ,kj*(k)  (3.1) 

3«o  3  3  3 

y(k)  -  h(m(k) ,k)  4  v(k>  (3.2) 


Here  s  the  etete  la  an  n-vector,y  the  measurement,  an  m-vectortw  the  random  Input,  an  r-vector;  v  tha 
measurement  noiee,  an  m-voctort  G,  a  nonlinear  state  dependent  n  x  r  matrix ik-  0,1,  ...  In  the  discrete 
tin*  index.  Tha  nonlinear  vector  valued  functions  f j  and  b  are,  respectively ,  n  end  m  -  dims  as local . 

Tha  integer  quantities  Oj  represent  time  delays  which  ere  ordered  euch  that 

VW  *  *  * 


The  random  vectors  v(k)  and  v(k)  are  independent  zero-mean  white  Gaussian  sequences,  for  which 
E{w(k)  wT(j)}=  fw(k)  4fcj 

E{v(k)  vT(j)}-  ¥y(k)  ^ 
and 

E{w(k)  vT(j) }=  0 

for  all  integers  k  and  j,  where  E{.)  denotes  the  expectation  operator,  6„  the  Kronecker  delta,  and  f 

jK  V 

and  y  are  a  x  m  and  r  x  r  positive  definite  matrices,  respectively. 

The  initial  states  x(0)  and  x(-a.)  for  J«l,  ...»  1  are  zero  mean  Gaussian  random  vectors, 
independent  of  v(k)  and  w(k) ,  with  a  positive  definite  covariance  matrix 

E{x(-aJ)xT(-aJl)}«  Vx(  a.y  a£) 
for  j,i  -  0,  ....  L. 

The  smoothing  problem  is  to  obtain  x(k-i+l/k+l) ,  the  unbiased  smoothing  estimate  of  x(k-i+  1) , 
with  0  <_  1<  k+1,  conditioned  on  the  set  cf  measurements 

Y(k+1)  -  {y(0) ,  y(l) ,  ....  y(k+l) } 
such  that  the  cost  function 

J(k+1)«  Trace  (M(k)  Vjk-i+l/k+1)  ]  (3.3) 

x 

is  minimized.  Here  K(k)  is  some  symmetric  non-negative  definite  weighting  matrix,  and  V^(k-i  +l/k+l)  is 

x 

V^(k-4  +l/k+l)-  E,  , ,  [  (x(k-i+l)  -  x(k-i-*-l/k+l)} 
x  *  1 

(x(k-t+l)  -  x(k-l+l/k+l)}T)  (3.4) 

where  E^f.)  denotes  the  expectation  operation  conditioned  on  the  set  of  measurements  Y(k+1) . 

3.3  THE  DERIVATION  OF  NOKLIKEAR  SMOOTHING  ALGORITHMS 


The  smoothing  algorithm  is  assumed  to  be  constrained  by  the  nonlinear  differential  equation 


x(k-l+l/k+l)+  J  b^xOt-t-aj/k),  k-t-a^  ]+  K*+1y(k+l) 


(3.5) 


Here,  the  assumption  of  linearity  in  innovations  Is  made. 
k-i-Oj ]  and  ere  yet  to  be  determined. 


L  * 

The  nonlinear  functions  r  b,[x(k-t-a./k) , 
3-0  J  J 


In  feet,  the  smoothing  equation  may  taka  various  forma {however,  it  la  essential  that  enough 
information  concerning  xflt-t-a^/k) and  y(k+l)  art  Included  In  tha  aatimator  noded.  for  example,  it  can 
be  shown  that  other  dynamic  equations  such  as 

li(k-£+l/k+l)  -  j  bj(x(k-t-<ij/k),  k-l-Oj  1+ g£+ly(k*l/k)  (3.6) 

A# 

where  y(k+l/k)“  y(k+l)  -  E.  (y(k+l)  Kwuld  result  in  exactly  tha  same  smoothing  algorithm  as  tha  one 
constrained  by  Equation  (3.5). 


Since  the  system*  considered  hare  are  non-linear  vitb  multiple  time  delays,  it  is  realistic  to 
assume  that  the  smoothing  estimate  is  a  linear  combination  of  theiva  of  nonlinear  functions  of 
x(k-l-a,/k) ,  to  account  for  the  tins  delay  characteristics,  and  the  present  measurement  y (k+1) .  Bare 
x(k-t-«j/k)  is  assumed  to  have  made  optimum  use  of  all  the  measurements  up  to  y(k). 

the  problem  formulated  in  such  a  manner  may  therefore  lead  to  a  smoother,  optimal  with  respect 
to  the  imposed  constraints,  but  not  Identical  to  the  truly  optimal  one. 


,  «,  Now  tha  estimation  problem  is  to  determine  tha  time-varying  nonlinear  vector  function 

r  b.(x(k-l-o./k),  k-t-o.J,  end  the  elgorithn  K,  .  such  chat  the  trace  of  V. (k-l+l/k+1)  is  minimised. 

J«o  J  }  3  x 

let  x(k-l41/k+l)  denote  the  amoothlng  error  defined  by 

x(k-i+l/k+i)  -  x(k-Ul)  -  x(k-l+l/k+l)  (3.7) 


Then  we  obtain 

*(k-U>l/Ml)«  i:  f,[x(k-l-o) ,  k-t-a.l  ♦  0(x(k-i),k-l  Jw(k-t) 
3-o  1  1  1 


-  £  b,U(k-t-o,/k), 

S-o  J  1 


k-tH|J)-«J+l{b{»<k61),Jefll  ♦  v(kn» 


In  order  that  x(k-tai/k+l)  la  an  uablaeed  emoo thing  estimate,  it  is  necessary  that 

i«o  kjtntk-t-Oj/k).  k-t-^J-jj^  fJ(x(k-t-oJ),k-l-aJ/k)-  *lkn  h(*(k+l) ,  kH/kl 


(3.B) 


(3.9) 


where 


f  j  [xttc-k-Oj) ,  k-i-a^/k]  »  E^  {fj[x(k-E-  ctj) ,  k-t-a^]} 
and 

h[x(k+l),  k+l/k]  »  Ej.  {h[x(k+X) ,  k+1]} 

Substituting  Equations  (3.9)  into  (3.5),  the ^smoothing  algorithm  becomes 
x(k-i+l/k+l)  -  x(k-i+l/k)  +  K^+1{y(k+l)  -  h[x(k+l),  k+l/k]} 

for  k  •  0,1,  ....  and  0  ±  k+1. 

Where  x(k-i,+l/k)=  j  f  [x(k-E-a.) ,  k-t-a,/k] 
j=o  J  J  J 

Substitution  of  Equations  (3.10)  into(3.7),  yields 

5t(k-i+l/k+l)»  x(k-i+l/k)  -  Kj+1(&(x(k+l)/kl  +  v(k+l)} 


(3.10) 


(3.11) 

(3.12) 


where 


h(x(k+l)/k]  »  h(x(k+l) ,  k+1]  -  htx(k+l),  k+l/k] 


(3.13) 


Therefore,  the  smoothing  error-variance  equation  is  given  by 

V  (k-f*l/fctl)«  V%(k-i+l/k)  -  B^xft-Ul/k)  frT(x(k+l)/k]}  k£+1T-k£+1 
x  x 

A  T  n  fltp 

Ek{&t*(k+l)/k]  x  (k-t+l/k)}  +k£+1  »v(k+l) 

+  Kj+1Ek(S'(x(k+l)/k]  &T[x(k+l)/k]}^1  <3-14> 

i 

Here,  K.  is  the  only  variable  available  for  manipulation.  The  necessary  condition  for 
minimising  the  trace  of  (k-l+l/k+1)  and  subject  to  the  constraint  of  Equation  (3.14)  is  provided  by  the 

Mtrlx  minimus  principle  *(29],  for  which  is  considered  as  the  control  variable. 

The  necessary  condition  can  now  be  otained  from  the  condition 


.  Trace (Vjk-t+l/k+1)  ]  »(0] 


where  (0]  is  the  null  matrix,  and  the  result  is 

k£+1  «E^ (x(k-t+l/k)  ftT[x(k+l)/k])  Ifv(k+1)  +  E^ftW+D/k]  frT[x(k+l)/k]rX 

and  Equation  (3.14)  la  reduced  to 

VJk-l+l/k+1)  ■  V^k-t+l/k)  “l^+jLEk  (fi[x(k+l)/k]xT (k-i+l/k) } 

Using  Equations  (3.1)  snd(3.11),  there  la  the  relation 

x(k-t+l/k)-  ^  ^[x(k-i-aj)/k]  +  Q(  <(k-l)  ,k-t]w(k-i) 


where 


and 


and 


(3.15) 

(3.16) 

(3.17) 


si'tjWk-t-Oj),  k-t-ay  -  f j(x(k-*raJ) ,  k-t-^/k] 

«k(i(k-t+l/k)8ft(a(k+l) /k] )-  Ek(^J  Kk-t-Oj)  /k$T tx(k+i)/k] } 

Than  the  following  recursive  smoothing  algorithms  can  also  be  derived: 

V  (k-t*l»k/k)«  itt  >,[x(k-l-a,)/k]xT(k/k)) 
i  *  j-o  3  3 

V^(k-t+l ,k-m+l/k)m  J-0Ek{?1(k-t-<»1)/k]  V  ^(x(k-m-o^/k])+  C[x(k-1),  k-t]  fv(k-l)  CT(x(k-«)  ,k-m] 
4  k-l,k-ei 


(3.18) 


V^(k-l+l  ,k-»+l/k+l)  •  V%(k-t+i,  k-m*l/k)  -  kJ^tfUtt+D/k]*  (k-m+l/k)} 


(3.19) 

(3.20) 


for  0  <  l,m<y+l. 


For  k<0,  there  is  no  input  to  the  smoother  and  therefore  one  can  set  x(-a./-l)  to  zero  for 


j  »  0,1 . .  which  in  turn  leads  to 


y 


x(-ct,/-l)  ■  x(-a.) 


j 


y 


and 


V“°j.'V‘1)  =  Vaj*at} 


for  j,l  =  0,1,  ...»  L. 


Since  the  snoothing  estimator  is  unbiased,  the  expectations  that  are  in  Equations  (3.16)-(3.20) 
can  be  replaced  by  the  following 

T 

Ek{x(k-i  +l/k)  fr  [x(k+l)/k]  }=  E^xOcr-l  +l/k)hT[x(k+l) ,  k+1]} 

T  T 

Ek(ft'[x(k+l)/k]&  [x(k+l)/k]}  =  Ek{h[x(k+1),  k+l]hT[x(k+l) ,  k+1]}  -  h[x(k+l)  ,k+l/k]h  [x<k+l>  ,k+l/k] 

(3.21) 

and  T 

Ek{?i[x(k-2-ai)/k]  £  jlxCk-i-a^/k]}  =  E^f^xtt-i-Oj) ,  k-t-a^] 

A  aT 

f*[x(k-i-a.j),  k-fc-o^]}  -  f1(x(k-2-a1),  k-l-a^k]  f  Wk-l-Oj),  k-2-a  /k]  (3.22) 


It  should  be  noted  that  in  the  cose  of  nonlinear  systems  the  above  expectations  require 
infinite  dimensional  systems  for  their  realization,  the  problem  of  obtaining  a  good  approximation  to  the 
above  expectations  is,  therefore,  of  practical  importance. 

For  practical  realization,  it  is  assumed  that  the  conditional  probability  density  functions  of 
the  smoothing  error  x  are  Gaussian,  then  the  expectation  can  be  obtained  without  any  further  approximation 
for  systems  with  polynomial,  product-type  or  state-dependent  sinusoidal  nonlinearities. 


To  evaluate  the  above  expectation,  one  can  make  uae  of  Equation  (3.7)  to  replace  the  state 
variables  by  the  sum  of  their  respective  estimators  and  the  estimator  errors.  For  example,  in  the 
scalar  cane  of 

h[x(k+l) ,  k+1]  «*  x3(k+l) 


it  can  be  shown  that  ^  32 

B^xtk-i+l/k)  &  [x(k+l)/k]}-  Ek{x(k-t+l/k)  [x  (k+l/k)  +  3  x  (k+l/k) 

A  a2  as3 

x(k+l/k)  +  3  x(k+l/k)  x  (k+l/k)  +  x  (k+l/k) J) 

*2 

-  Vjk-t+1,  k+l/k)  x  (k+l/k)  +  3  Vjk-l+1, k+l/k)  VJ k+l/k) 

X  XX 


Notice  that  in  the  evaluation  of  ths  expectation,  the  only  assumption  needed  is  the  Gaussian 
assumption  of  the  estimator  error.  And  it  can  be  easily  shown  that  if  the  nonllnearltles  were  expanded 
in  terms  of  first  or  second  order  Taylor, a  series,  the  last  term  in  the  proceeding  equation  would  have 
been  droppod. 

Similarly,  it  is  obtained  from  Equations  (3.21)  and  (3.7) 

T 

Ek{5»lx(k+l//k]  5>  lx(k+l)/k]}  -Ek(x6(k+1)>  -  Ek(x3(k+1)}2  -  (15  V3(k+l/k) 

2  ^  ^  * 

+  A5  V  2 (k+l/k)  x  (k+l/k)  +15  Vn (k+l/k)  x  (k+l/k)+  x  (k+l/k)] 

.  * 

-  [x  (k+l/k)+  3  V  (k+l/k)  x(k+l/k) ]2  -  15  V3(k+l/k) 
x  x 

>.  a2  *4 

+  36  V*(k+l/k)  x  (k+l/k)  +  9  V  (k+l/k)  x  (k+l/k) 
x  x 

The  preceeding  evaluation  is  rather  straightforward.  In  ths  partlculsr  cssa  that  higher-order 
terms  do  not  improve  system  accuracy,  one  can  delete  the  higher -order  terms  and  approximate  ths  expectat¬ 
ions  up  to  the  arbitrary  order,  ea  desired. 

3,4  TOE  jjWMWA*  FIXED- LAG  SHOOTtllHC 


Replace  k+1  sad  l  by  k  sad  N,  respectively,  where  H<X,  from  Equations  (3.10) ,(3.11)  sad  (3.16) 
-(3.20)  one  would  then  obtain  the  following  recursive  nonlinear  fixed-lag  smoothing  algorithms! 

x(k-«/k)  «  x(k-N/k-l)+  X?J{y(k)  -  h(x(k) ,  k/k-U)  (3.23) 

A  |  A 

x(k-N/k-l)m  t  f  Jx(k-R  -1  -  a.),  k-N-l-a,/k-U  (3.24) 

J-°  J  T  j  j  t 

kJ  -  Ek_l(l(k-ll/k-l)  &  (x(k)/k-UH»v(k)  +  Ek_l(&(x(k)/k-l]  &  (x(k)/k-l])]“2 


(3.25) 


(3.26) 


Vjk-N/k)  -  V^k-N/k-l)  -  K?  EL  ,{ ti  (x(k)  /k-l]  xT(k-N/k-l)  } 

XX  * 

T 

Vjk-S,  k-m/k)  »  VJk-t,  k-m/k-1)  -  Ek_1{ft[x(k)/k-l]x  (k-m/k-1) } 


L  L 

Vk-l,  k-m/k-1)  *=^50  B^fjMk-l-t-a^/k-l]  fJT[x(k-l-m-aj)/k-l]}+  G[x(k-l-i)  ,k-l-2] 


^(k-l-A)  G1  [x(k-l-m)  .k-l-m]^ 


Also  ue  have 


V  (k-K,  k+l/k)  -  E.  {x(k-N/k>  j  2  (x(k-a  )/k]}  (3.28) 

x  *  j=o  j  J 

L  L  T 

Vfk+l/k)-!  s  K.  #.(x(k-a,)/k]  %  U(k-a,)/k]}+ G[x(k),  k)  ¥  (k)  GT[x(k),k]  (3.29) 

x  i=o  j=o  *  j  3 

3.5  THE  NONLINEAR  FIXED-POINT  SMOOTHING 

Setting  t  *  K  -  1W-1  where  k+1  >  H,  from  Equatiow  (3.10)  ,(3.11)  and  (3.16)-(3.20)  one  would  then 
have  the  following  recursive  non-linear  fixed-point  smoothing  algorithms: 
x(N/k+l)  »  x(N/k)  +  k£^+1  (y(k+l)  -  h(x(k+l),  k+l/k]) 

T  T 

K^~”+1  =  *  f^xOJ/k)  ft  (x(k+l)/k]}  (fv(k+l)  +  Ek{S'[x(k+l)/k]  ft  [x(k+l)/k])]-1 

VjN/k+1)  =  V%(N/k)  -  k£“”+1  Ev(h[x(k+l)/k]  x  T(B/k) } 

X  X 

L  L  T 

V .(H/k)  =  Z  l  EL  {f.[x(S-l-a.)/k]  t(x(u-l-a.)/k]}+ G[x(H-l).N-l]'l'  (N-l)  GT(x(N-1),H-1] 
x  i=o  j=o  *  1  1  3  3 


L  T 

V  (N,  k+l/k)  =  E.  (xtN/k)  l  t  [x(k-a.)/k]} 
x  j=o  j  3 


V%(kH/k)  and  V^(k-1,  k-«/k)  am  respectively,  the  same  *s  Equation*  (3.28)  and  (3.27)  and 

x  x 

also  fron  Equation  (3.20),  it  can  be  shown  that 

T 

\<H,  k-«/k)  -  \(N.  k-m/k-1)  -  k£"N  E^_^{ft[x(k)/k-l]x  (k-m/k-1)} 

3.6  THE  NONLINEAR  nXKP-lHTERVAI,  SHOOTHISG 

Setting  k+l"N ,  t<*N-k,  where  H>k,  Equations  (3.10)  ,(3.11)  and  (3. 16)— (3.20)  would  yield  the 
following  recursive  nonlinear  fixed-interval  smoothing  algorithms: 

x(k/N)  »  *(k/H-l)  +  K^"k  (y(S)  -  h(x(H) ,  N/K-XJ) 

A  L  A 

x(k/K-l)  -  Z  f.tx(k-l-a,).  k-l-ct./H-l] 

J-o  J  3  3 

T  T 

l£'k  ■  E^&k/H-l)  ft  (x(N)/H-l]>  iy»>  +  R^ftfxOO/H-l]  ft  (x(H)/N-l]}f 1 

T 

V  (k/»  -  V^k/H-l)  -  {ft(x(K)/H-l]  x  (k/H-1)} 

XX  It  1 

LIT  T 

\(k/»-l>-  Z  l  EB_1(ftIx(k-l-<J1)/S-l)  f  j[x(k-l-cij)/H-l]]  +  0(x(k-l),k-l)tw(k-l)CT(x(k-l), 

X  1*°  j"°  V-l  1 


-  I  M 

\(k,  E/X-l)-  Ejj_^{x(k/H«l)  t  ^  (xOi-l-a^/N-l]} 


The  nonlinear  filtering  algorithm  can  be  easily  obtained  from  Equations  (3.19)  ,(3.11)  and 
(3.16)-(3.20) ,  by  setting  1*0. 

x(k+l/k+l)  »  s(5t+l/k)  ♦  ^+l(y(k+l)  -  h[x(k+l) ,  k+l/k]) 

A  L  A 

x (k+l/k)  -  Z  f.(x(k-o4),  k-a./k] 


K^-Ej^Oc+l/k)  £  [x(k+l)/k]}  [yk+l)+  ^{&[x(k+l)/k]  ¥rT[x{k+l)/k]}]-1 

and  T 

VJk+l/k+1)  -  V^(k-tl/k)-K°. +.Ek{fr[x(k+l) /k]  x  (k+l/k)} 

X  X 

whereas 

(k+l/k)  is  given  by  Equation  (3.29) 
x 

3.8  ESTIMATION  IK  LINEAR  DISCRETE  SYSTEMS 

In  order  to  provide  an  insight  into  the  structure  of  the  smoothed  estimate ,  consider  the 
particular  case  of  linear  systems  and  measurements  with 

L  L 

Z  f,[x(k-a.),k-a, ]*■£  F,(k)  x(k-a.) 

j  o  J  J  J  j«0  j  j 

h[x(k),k]  B  H(k)  x(k) 

and 

G[x(k) ,k]  =  G(k) 

Then  the  linear  fixed-lag  smoothing  algorithm  can  be  directly  obtained  from  Equations  (3.23) 
to  (3.28),  respectively  as  the  following 

x(k-N/k)  -  x(k-N/k-l)  +  K^{y(k)  -  H(k)  x(k/k-l)> 

x(k-N/k-l)  -  Z  F  (k-N-1)  x(k-N-l-ct,/k-l) 
j»o  3  2 

K?  -  V  (k-N,k/k-l)  HT(k){H(k)  V  (k/k-1)  HT(k)+  ¥  (k?)-1 
*  x  x  v 

V  (k-N/k)  =»  V  (k-N/k-1)  -  K?  H(k)  V  (k.k-N/k-l) 

3f  2  ^  5 

and  I,  L 

V  (k-i,k-m/k-l)  «  Z  Z  F.  (k-l-l)  V  (k-l-t-e. .  ,k-l-m-a. /k-1) 

%  i-o  j-o  1  J  A  J 

Fj(k-m-l)  +  G(k-l-t)  yk-l-t)  GT(k-l-£)6k_£(k_, 
h  T 

Also  V(k-K, k+l/k)  -  Z  V^(k-N,k-a./k)  F  (k) 

X  J»«0  X  3  2 

and  finally, 

L  I.  _  _ 

V^(k+l/k)  -  ^  ^  Pt(k)  V, (k-a^ ,k~Oj /k) F*(k)  +  G(k)  ¥y(k)  CT(k) 


It  can  be  easily  Identified  that  the  structure  of  the  above  linear  fixed-lag  smoother  la 
almply  the  stable  fixed-lag  smoother  Introduced  by  Fn.aasv  and  Vacroux  [23].  In  the  same  manner,  one 
can  Identify  the  linear  fixed-point  amoot'  ,r  for  linear  systems  without  delays  with  that  of  Biswas  and 
Mahalanabls  [47],  Similarly,  the  results  presented  in  Section  3,6  and  3.7  can  ba  easily  applied  to  yield 
the  linear  fixed-interval  smoothing  end  the  filtering  algorithms,  respectively. 

Thus,  a  unified  tprroach  to  obtain  the  filtering  sal  smoothing  algorithms  for  linear  as  well 
as  nonlinear,  delayed  as  vel ,  as  nondalayed  systems  has  been  presented. 

The  reaults  pr-asntad  In  this  settloo  can  ba  extended  to  continuous  time  problems  through  a 
formal  limiting  procedure  [48],  The  presented  approach  can  also  be  extended  to  nonlinear  distributive 
eystama  with  or  without  i -slays. 


SECTION  4 


4.1  INTRODUCTION 


MINIMUM  VARIANCE  FILTERING  AND  SMOOTHING  FOR  NONLINEAR 
SYSTEMS  WITH  CORRELATED  NOISES 

s  ssa  K  o  a  8B=:c3«  ess  tt 


In  Section  3  the  nolee  processes  considered  are  assumed  to  be  Gaussian  white  and  mutually  Indep¬ 
endent,  however,  In  practical  situations  such  assumptions  are  often  Invalid,  since  in  physical  systems, 
independent  white  noise  processes  simply  do  not  exist.  It  is  therefore  natural  to  extend  the  estimation 
technique  developed  in  Section  3  to  more  realistic  problems,  where  message  noise  processes  are  correlated 
with  measurement  noise  processes. 

Recently,  Raja  Rao  and  Mahalanabis  [49]  derived  estimation  algorithms  for  linear  systems  with 
delay  imbedded  in  correlated  noise  processes,  however,  their  results  appear  to  have  a  fundamental  mistake 
in  the  procedure  given,  which  leads  to  self-contradictory  results;  this  is  reported  in  [50], 

In  this  section,  estimation  algorithms  are  derived  for  nonlinear  discrete  delayed  systems  with 
measurements  imbedded  in  correlated  noise  processes.  The  derivation  assumes  that  the  smoothing  estimator 
introduces  new  data  in  a  linear  additive  fashion  and  makes  use  of  the  matrix  minimum  principle  to  minimize 
the  error-variance  cost  functional.  The  resulting  estimation  algorithms  as  reported  in  [28,51],  can  be 
easily  applied  to  yield  the  fixed- lag,  fixed-point,  fixed- interval  smoothing  and  filtering  algorithms, 
by  properly  substituting  the  time  indices. 

4.2  THE  PROBLEM  STATEMENT 


Consider  a  discrete  nonlinear  time-delayed  system  modeled  by 

x(k+l)  -  £  f,[x(k-a,),  k-n.]  +  G[x(k),  k]w(k)  (4.1) 

j-o  j  j  j 

vith  measurement  given  by 

y(k)  -  h[x(k) ,  k]  +  v(k)  (4.2) 

where  the  state  x  is  an  n-vector;  the  measurement  y  an  m-vector;  the  state  noise  sequence  w  an  r-vector; 
the  measurement  noise  v  an  m-vector;G,  a  nonlinear  state  dependent  nxr  matrix.  The  nonlinear  vector 
valued  functions  f  and  h  are,  respectively,  n  and  m  dimensional,  a. represents  a  time  delay  sequence 
ordered  such  that  ^  ^ 

oQ  <  »!  <  o2  <  ...  < 

The  noise  sequences  w  and  v,  are  zero-mean  white  Gaussian  with  non-negatlva  definite  covariance 
T  and  positive  definite  Vv,  respectively.  Also 

B(v(k)  vT(J)}  -  Tw  <k)  \  j 

for  ell  integers  k  and  j,  where  tw  is  non-negative  definite. 

The  initial  states  x(0)  and  x(-o.),  for  j  “  1,  ...,  L  are  sero-meen  Gaussian  random  vectors, 
which  are  independent  of  v(k)  and  w(k),  with  a  positive  definite  covariance  matrix 

K{x(-<*j)  xT  (”«t))  -  V^cyaj) 

for  J,  i  -  0,  1 . L. 

The  smoothing  problem  is  to  obtain  the  unbiased  smoothed  estimate  8(k-l+l/k+l)  of  the  state 
x(k-t+l) ,  where  0  <  t  £  k+1,  conditioned  on  the  set  of  measurements 

Y(k+1)  -  (y(0),  y(l) . y(k*l)> 

such  that  the  following  error-variance  coat  functional  is  minimised i 

J(k*l)  -  Trace  [H(k>  VJk-l+l/k+1)]  (4.3) 

x 

where  H(k)  is  a  symmetric  positive  definite  weighting  matrix,  end  V^(k-t+l/k+l)  is  defined  by 

x 

V^k-i+l/kH)  -  Rk+1[(*(k-i+l)  -  8  (k-i+l/k+1) )  (x(k-W-l)  -  8(k-t+l/k+l) )T)  (4.4) 

4.3  TO  DERIVATION  OF  THE  SMOOTHER 

With  reasoning  similar  to  that  of  Section  3,2,  the  smoothed  estimate  is  assumed  to  be  constrain¬ 
ed  by  the  nonlinear  dynamic  equation 

8(k-i+l/kfl)  -  b^tOt-l-Yj/k),  k-t-Yj)  +  y(fcfl)  (4.5) 

L  . 

^  b^JS  (k-l-Y^/k)  ,k-i-Y^]  and  are  yet  to  be  determined. 


where 


(4.6) 


Since  It  Is  required  that  the  smoothed  estimate  be  unbiased,  It  Is  necessary  that 

L  L  —  ,  . 

E  bj  [x  (k-l-Yj/k),  k-l-Yj]  -  f  j[x(k-t-aj) ,  k-t-a^/k]  -K£+1  h[x(kfl),  k+l/k] 

where  f.j[x(k-t-a.j) ,  k-.l-Oj/k]  =  yfj  [x(k-l-a^) ,  k-l-Oj]}  (4.7) 

and  h[x(k+l),  k+l/k]  -  E^hlxOc+l),  k+1]} 

With  Equation  (4.6)  substituted  Into  (4.5),  the  smoothed  estimate  becomes 

x(k-l+l/k+l)  -  x (k-l+l/k)  +  K^yfle+l)  -h[x(k+l),  k+l/k]}  (4.8) 

for  k  *  0,1,2,  ...,  and  0  £  1  <  k+1. 

L  - 

where  x(k-t+l/k)  -  E  f  [x(k-l-a,)t  k-l-a./k]  (4.9) 

J  J  J 

The  error  state  equation  is  simply 

x(k-l+l/k+l)  -  x(k-t+l/k)  -  K^+1(ft[x(k+l)/k]  +  v(k+l)}  (4.10) 

where  ft[x(k+l)/k]  »  h[x(k+l),  k+13  -  h[x(k+l),  k+l/k] 

Then,  the  error-variance  equation  is  given  as 

T  T 

Vjk-t+l/k+l)  -  (k-l+l/k)  -  K^.1Ek{S(x(k+l)/k]  x  (k-l+l/k)}  -  Ek(x(k-i+l/k)  ft  [x(k+l)/k]} 

T  T 

•  (Kk+l)T  +  *£+1  +  K£+iyft[x(k+l)/k]  ft  tx(k+l)/k]}(ktkfl)1’(4.11) 

The  necessary  condition  for  mlnlmlxing  the  trace  of  V  (k-l+l/k+1) ,  can  now  be  obtained  from 
the  condition  x 


- - Trace  [V.  (k-l+l/k)]  -  [0] 

8Kk+l  * 

Hence,  the  gain  algorithm  la  given  by 


*  *  ip 

kJ+1  -  Ek(x(k-t+l/k)  ftT(x(k+l)/k]}  (Tv(k+1)  +  yft(x(k+l)/k]  ft  [x(k+l)/k]}]“1 

(4.12) 

and  Equation  (4.11)  beconea 

Vjk-t+l/k+1)  -  V^k-l+l/k)  -  K^+1Ek{ft[x(k+l)/k]  x1  (k-l+l/k) } 

(4.13) 

x 

and  VJk-l+1,  k-sH-l/k+1)  -  Vjk-M-1,  k-m+l/k)  -K^yftlxOt+D/kJx  (kns+l/k)} 

(4.14) 

for  0  <  i,  e  <  k+1. 

How,  there  remains  the  problee  of  evaluating  V  (k-l+l/k)  in  Equation  (4.13). 
stractlng  Equation  (4,9)  from  Equation  (4.1)  would  x  easily  yield 

Even  though  aub- 

x(k-l+l/k)  -  j„0  ^Ix(k-l-aJ)/k]  +  G(x(k-t),  k-t]  w(k-l) 

(4.15) 

where  by  definition 

fjtxOt-t-y/kl  -  fJ(x(k-l-oJ),  k-l-y  -yxfk-l-y,  k-t-o^/k] 

it  Is  teen  that  the  error-variance  equation  cannot  be  obtained  by  taking  the  expectation  of  Equation 
(4.13)  multiplied  by  its  cam  transpose,  since  the  expectation 

yxOc-t-y/kl  wT(k-t)} 

can  not  he  explicitly  evaluated. 

On  the  other  hand,  following  tha  derivation  presented  above,  the  unbiased  estimate  of  the  state 
x(k-t+l)  can  be  obtained  as 

*  1  *  «  * 
x (k-l+l/k)  -  fJ(x(k-l-oJ),  k-l-Oj/k-i]  +  x*(y(k)  -  h(x(k),  k/k-lj) 


(4.16) 


which  In  turn  leads  to 


x (k-t+l/k)  -  5  „  f,[x(k-i-a,),  k-l-a,/k-U  K*{ft[x(k)/k-l]  +  v(k)}  +  G[x(k-i)  ,k-£]w(k-i)(4.i7) 
j-0  J  J  J  K 

The  optimal  value  of  the  matrix  can  be  determined  from  the  necessary  condition  that 
— -  WOO  V.  (k-i+l/k>]  -  [0]  (4.18) 

*4 

Since  V  (k-t+lk)  is  the  expectation  of  Equation(4.17)  multiplied  by  its  own  transpose,  then 


using  Equation  x  <4.18),  the  result  is 


a  41  »\»  YT  r  1 

-[Ek_1(  E  fjtxCk-t-Ojl/k-l]  h  (x(k)/k-l]>  +  G[x(k-i),  k-i]  ^(k)  4k,k-iJ 


[>fv(k)  +  E^_^{h[x(k) /k-1]  h  tx(k)/k-U)l' 


and  V (k-l+l/k)  is  simply 


V  (k-t+l/k)  -  I  E.  .{f  [x(k-i-a.)/k-13  f.  {x(k-t-a.)/k-l]) 

5  i,J-o  *-x  i  x  3  j 

+  G[x(k-i),k-ll'i'u(k-l]GT(x(k-i),k-l] 

-  x^H'vw(k-i)GT[x(k-t),k-l]  «k(k_i 

+Ek-ltklx(k)/k~1]  L0  ^jlx<k-t'aj)/k-lll)  (4‘2 

also  V  (k-l*l,  k-m+l/k)  ■£,  E.  ,{f .tx(k-t-o  )/k-llf]'[x(k-m-a.)/k-l])  +  G[x(k-1)  ,k-l]1!  (k-l) 

5  i,J-o  *t-l  i  i  j  j  w 

•  5k-i,k-mGTlx(k“B)  ^-«]-<ktYvw(k'm>Sk,k-mGT(x(k_n)  •kH*l 
L  T 

+  ^{htxCW/k-ll  ^J^Ixtt-m-a^/k-l]}!  (4.21) 

for  0  <  l,  ■  <  k+1. 

When  the  discrete  time  index  k<0,  there  is  no  input  to  the  smoother  and  therefore  8(-a  /-l) 
la  set  to  aero  for  j  ■  0,  1,...,  L,  which  results  in  3 

x(-«j/-l)  -  *(-«‘j> 


V°1  ,-ot/-l)  -  V  {«,  ,o  ) 
x  4  J 

for  j,  l  ■  0,  1,  ...,  t. 

On  the  other  hand,  since  the  smoothing  estimator  la  unbiased,  the  expectations  that  are  in 
Equations  (4.12)  to  (4.14),  and  (4,19)  to  (4.21)  can  be  replaced  by  Equations  (3.21)  to  (3.22)  and  also 

Ek_1(litx(k-4-«1)/k-l)  Mk-m-a^/k-lJ)  -  Ek_1(f1[x(k-t-o^ f *Cx(k-m-Oj) ,  k-m-<^]} 

-  f1Cx(k-l-«1),  k-t-^/k-il  f J [ x (h-m-o^ ) , k-m-o^ /k- 1  ] 


(4.22) 

It  should  be  noted  that  In  the  case  of  nonlinear  systems  the  above  expectations  require  Infinite 
dimensional  ays terns  to  realise. 

Therefore,  as  an  approximation,  It  la  assumed  that  the  conditional  probability  density  functions 
of  the  smoothing  error  £  are  Gaussian.  Again,  it  Is  important  to  note  that  under  such  an  assumption,  the 
algorithms  presented  In  thla  section  can  be  phyeically  realised  without  any  further  approximation  for 
eystene  with  polynomial,  product-  type  or  etatm-depmadaat  sinusoidal  non  linearities. 
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Also  notice  that  three  different  types  of  smoothing  and  the  filtering  algorithms  all  follow 
immediately  from  Equations  (4.8),(4.12)-(4.14),(4.16)  and  (4.19)-(4.21)  with  the  following  substitutions: 


The  Replacement 

The  Replacement 

for  k+1 

for  i 

The  filtering  estimation 

k+1 

0 

The  fixed-lag  smoothing 

k 

N 

The  fixed-point  smoothing 

k+1 

k-N+1 

The  fixed-interval  smoothing 

N 

N-k 

4.4  THE  NONLINEAR  FILTERING 

When  i  is  set  to  zero,  Equations  (4.8) , (4.12)-(4. 13) ,(4.16)  and  (4.19)  to  (4.20)  become  the 
filtering  algorithms. 


Namely: 

x(k+l/k+l)  -  x(k+l/k)  K^tyfk+l)  -  fi[x(k+l)  ,k+l/k] } 
i(k+l/k)  -  j!_o  fj[x(k-oi)),k-a1/k-U  +  K^{y (k)-h[x(k)  ,k/k-l]} 

K^+i  -  Ek(x(k+l/k)  h  T(x(k+l)/k]>  fvv(k+l)  +  ^{hlxOttD/k;  ?'T[x(V.+l)/k]}]'i 

T 

V^k+l/k+1)-  V,(k+l/k)  -  K?  K.  (h[x(k+l/k)]x  (k+1)} 

X  X 

V  (W-l/k)  -  E  E.  ,{f .  tx(k-o. )/k-l]  f,T(x(k-a.)/k-l]>  +  G(x(k>,kJ  ?  (k)  GT(x(k),k} 
x  i»j“0  K  i  1  1  1  .1  w 

->w(W  <»T[x(k) ,k]  +  Ev._1(h(x(k)/k-ll  ^  f*  [x(k-a,)/k-lj }] 

and  finally 

*k  “  tW*  0VX<k"‘,j)/k“1]  ^T[x(k)/k~1l)  Gt*<k>.k) 

.  ’»wv(k)ntv(k)  +  Ek_1(h(*(k)/k-i]  hVw/k-inr1 

4.5  ESTIMATION  IN  LINEAR  DISCRETE  SYSTEMS 


in  this  section,  general  linear  estimation  algorithms  are  obtained  for  discrete  delayed  syatama 
corrupted  by  correlated  noise  processes.  The  algorithms  can  be  easily  converted  to  yield  the  fixed-lag, 
fixed  point,  fixed-interval  smoothing  and  the  filtering  estimators  together  with  their  respective  error- 
variance  equations.  This  is  done  by  making  the  proper  choice  of  discrete  Indices . 

Tor  linear  discrete  ayetems,  we  have 

£  1, 

J-oVx<k"V‘k“*jl  •  fj(k)  x  (k-o^) 


h(x(k),k)  •  «(k)  x(k) 
end 

G(x(k) ,k]  -  0(k) 

Then  the  general  linear  estimation  algorithms  are  as  follows « 
S(k-l+l/k+l)  •  S(k-l+l/k)  +  (y(fc+l) 

-  H(k+1)  t(k+l)/k)) 

1  j 

*<k-m/k>  -  I  yj(k-t)  fitk-l-Oj/k-l)  ■HCjiyfk)  -H(k)i(k/k-l) ) 

)£.,•  V  <k-t+l,k+i/k>  HT(k+l)  (T  (k+1)  +  UT(k+l)  V  (k+l/k)  HT(k+l)]_1 

x  * 


V  (k-l+l/k+1)  -  Vjk-t+l/k)  -  K^..  a(k+l)  V  (k+l.k-l+l/k) 

X  X  I 

VJk-t+l/k)  -  k  T  (k-DV^tk-t-^dt-f-O./k-l)  r!(k-l)  ♦  C(k-l)  Tw(k-1)  CT(k-l) 
x  1.J-0  1  x  J  J 


-k5[»  (k-t)GT(k-i)  +H(k)  l  V  (k,k-t-a./k-l)  F*(k-a  )] 

*  j-o  x  3  J  3 

“  fs  F. <k-l)v  (k-i-a. ,k/k-l)  HT(k)-KS(k-l)  Y  (k)«.  .  .] 
k  j,0  j  \  J  ■  wv  k,k-£ 

.  [Yv(k)  +  H(k)  V^(k  /k-l)HT(k)]-1 

X 

and  V  (k-t+1,  k-afl/k)  -  S  F.  (k-l)V  (k-i-a.  ,k-m-a./k-l)  F,(k-m)  +  G(k-l)f  (k-i)6.  GT(k-m) 

x  1,  j-0  X  1  J  J  W 

“  lck['fvw(k“")5k,k-BGT<k‘B)  +  ^.0H(’°  V^Ck.k-o-Qj/k-l)  F*(k-m)] 

It  can  be  easily  shown  that  in  the  special  case  of  discrete  systems  without  delay,  the  results 
obtained  for  the  filtering  algorithm  agree  well  with  those  In  the  literature  [36].  The  results  presented 
In  this  section  can  also  be  extended  to  continuous  time  smoothing  problems  and  nonlinear  distributed 
parameter  systems  with  or  without  delays  [32].  The  same  basic  approach  can  also  be  applied  to  derive 
estimation  algorithms  for  nonlinear  systems  corrupted  by  colored  noise  [31]. 

SECTION  5 
CONCLUSIONS 


A  unified  approach  Is  presented  to  derive  estimation  algorithms  for  discrete  and  continuous 
nonlinear  systems  with  and  without  delays,  corrupted  by  white  Gaussian  noi3e,  correlated  noise,  colored 
noise  as  veil  as  noise  -free  processes. 

In  Section  2,  nonlinear  filtering  algorithms  were  derived  for  continuous  nonlinear  systems 
without  delays  corrupted  by  white  noise,  correlated  noise  and  noise-free  processes.  The  main  technique 
Involves  the  application  of  the  matrix  minimum  principle  together  with  the  Kolmogorov  and  Kushner 
equations  to  minimise  tho  error-variance,  taken  to  be  the  estimation  criterion.  The  filtering  equations 
obtained  for  nonlinear  systems  with  white  noise  processes  are  exact,  but  for  non-white  noise  processes 
the  results  obteined  are  approximate. 

For  systems  with  polynomial,  product-type  or  state-dependent  sinusoidal  nonllnearltles,  the 
proposed  algorithms  can  be  evaluated  without  the  need  for  approximation  under  the  assumption  that  the 
estimator  errors  are  Gaussian.  Such  an  assumption  la  significantly  different  from  the  moat  commonly 
used  aseumptlon  that  the  state  is  Gaussian. 

Nonlinear  estimation  algorithms  for  discrete  nonlinear  delayed  systems  with  measuresents 
corrupted  by  white  noise  end  correlated  noise  processes  are  rasper tlvely,  derived  In  Sections  3  and  6. 

The  estlmetlon  elgorithme  directly  yield  the  fixed-leg,  fixed-point  end  fixed-interval  smoothing  and 
filtering  elgorithme,  with  proper  substitution  of  the  dlecrete  time  indices.  The  proposed  technique 
mtkes  use  of  the  concept  of  the  gradient  matrix,  the  minimum  principle  to  derive  the  optimal  values  of 
the  coefficients  In  the  estlmetlon  algorithms  under  the  requirements  that  the  estimates  be  unbiased  and 
the  error-vsrlancee  minimised.  The  derivation  le  straight-forward  end  clearly  Indicates  ths  does  link 
betweon  three  different  classifications  of  smoothers  end  the  filtering  estimator. 

The  reaultn  obtained  are  exact  end  optimal  with  respect  to  the  imposed  constraints  on  ths 
dynamic  equations  of  the  estimators,  minimising  the  error-verieneee.  The  algorithms  derived  can  be  fully 
implemented  In  a  digital  computer  under  the  aseumptlon  chat  the  probability  density  functions  of  the 
estimator  errors  ere  Gaussian.  For  systems  with  polynomial,  product-types  end  state-dependant  sinusoidal 
nonllnearltles,  no  further  approximation  le  needed  to  evaluate  the  expectations  Involved  In  the  elror  • 
ithme.  They  ere  expected  to  be  computationally  efficleut,  aince  no  augmentation  of  state  variable*  1* 
Involved. 

The  results  can  also  be  applied  to  various  special  caste  of  nonlinear  ae  well  as  1  Incur  yv.- li¬ 
st  Ion  problems jfor  example)  The  estimation  problems  for  linear  or  nonlinear  system*  without  d«il«v<»;  .  ,id 
the  linear  estimation  problems  of  time-dtlaytd  systems  corrupted  by  white  noise  ea  well  as  n,-i 
nolee  processes,  etc. 

For  linear  estimation  problems  the  results  obteined  are  Identified  with  published  results  In  the 
literature.  In  the  particular  case  of  linear  time-delayed  system*  corrupted  by  additive  white  noise,  the 
results  ere  stable,  but  ths  stability  behaviour  of  the  nonlinear  estimator*  Is  yet  to  be  Investigated. 

The  result*  obtained  In  Sections  3  end  A  can  be  further  extended  to  continuous  time  ayetems  and 
nonlinear  distributed  parameter  systems, 
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SUMMARY 

The  purpose  of  this  chapter  is  multifold.  One  purpose  is  tc  provide  an  overview  sur¬ 
vey  of  the  alternative  decentralized  filtering  techniques  that  have  evolved  over  the  last 
decade  and  to  indicate  the  current  status  of  each  approach.  This  aspect  is  important  as 
a  preliminary  step  in  performing  engineering  by  allowing  the  selection  of  the  approach 
that  best  fits  the  constraints  imposed  by  the  specific  application.  Several  contribu¬ 
tions  that  are  provided  herein  advance  the  state-of-the-art  for  two  decentralized  filter¬ 
ing  approaches  (viz. ,  SLU  and  SPA)  as  formulated  here  in  discrete-time  by  specifying  and 
summarizing  mechanization  equations  (with  rationale) ,  by  analytically  establishing  sta¬ 
bility  of  these  estimation  algorithms,  and  by  providing  tables  that  allow  quantification, 
of  the  computer  burden  upon  implementation  in  terms  of  required  memory  allotment  and  al¬ 
gorithm  cycle  times.  Thus  a  complete  view  of  these  two  approaches  to  decentralized 
filtering  is  provided  here.  Current  applications  and  likely  future  application  areas 
for  decentralized  filtering  are  identified.  A  primary  consideration  was  the  proper  peda¬ 
gogical  approach  to  simply  explain  somewhat  obtuse  prior  material  to  make  it  easily 
accessible  to  many  levels  of  readers  (with  a  variety  of  backgrounds  and  primary  inter¬ 
ests)  to  demonstrate  that  decentralized  filtering  do  is  in  fact  have  a  firm  theoretical 
foundation. 


l  PRELIMINARIES 


1.1  Outline  of  the  Chapter 


The  appropriate  discrete-time  models  to  be  used  for  decentralized  filtering  are  pre¬ 
sented  in  Section  1.2  with  certain  drawbacks,  that  apparently  were  previously  ignored, 
being  identified  and  resolved.  A  brief  overview  survey  of  alternative  decentralized 
filteiing  formulations,  their  current  status,  salient  features  and  advantages/disadvan¬ 
tages  is  provided  in  Section  1.3.  A  particular  drawback,  encountered  for  several  decen¬ 
tralized  filtering  approaches,  of  requiring  the  objective  application  to  be  reexpressed 
in  an  "output  decentralized"  form  is  discussed  in  Section  1.4.  A  recent  approach  for 
rigorously  handling  several  filters  (with  nested  state-variable  system  models)  operating 
at  differing  measurement  utilization  rates  for  possible  parallal  processing  implementa¬ 
tion  is  described  in  Section  1.5. 


Two  particularly  well-developed  and  appealing  approaches  to  decentralized  filtering 
are  the  Surely  Locally  Unbiased  (SLU)  filter  and  the  Sequentially  Partitioned  Algorithm 
(SPA) .  The  SLU  approach  is  completely  described  in  Section  2  and  the  SPA  is  described  in 
Section  ?.  An  analytic  proof  of  the  stability  of  both  of  these  filters  is  offered  in 
Section  4  as  a  significant  technical  contribution  of  this  investigation.  Present  and 
anticipated  future  applications  of  decentralized  filters  are  indicated  in  Section  5. 


I*2  Appropriate  Discrete-Time  Models  for  Decentralized  Filtering 

Consider  .he  following  collection  {S,,  i-l,2,...,N)  of  N  interconnected  dynamical 
subsystems  (as  In  Refs.  8-14) «  1 


(n^xl) 

8  ;  it(t)  -  ri(t)*1(t)+Lu(t)ul(t)'**i  (tj 


u.2-1) 


having  discrete  measurements  available  to  the  subsystem  of  the  fornu 
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where  P  .  io  the  projection  operator  from  Rn 

X/  1 


N 

n  =»  £  n.  (1.2-3) 

i-1  1 


ni 

to  R  and  where  the  vector-valued  interaction  input  is  represented  (by  using  the  histor¬ 
ically  standard  notation  of  weighting  matrices  popularized  in  Ref.  21  on  p.  122)  by 


•ft*11  tpi*®*  (Pjxrij)  . 

V*1  4  [luu»  |  Lu(Uj  ...|  o  (1.2-4a) 
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Notice  that  u^(t)  has  no  direct  component  of  x^(t). 

The  process  and  measurement  noises  w^,  (t>  and  v^tt)  are  assumed  (as  in  Refs.  8-14)  to 

be  independent,  zero  mean,  white  Gaussian  noises  having  associated  covariance  matrices 
Q'^(t)  and  R^(t),  respectively,  and  uncorrelated  with  the  Gaussian  initial  condition 


XjlOMNlO,^) 


(1.2-5) 


Also  w^(t)  and  v^(t)  are  assumed  to  bo  uncorrelated  with  the  noise3  and  initial  condi¬ 
tions  of  other  subsystem*  (viz.,  w^(t),  v^(t),  and  x^(0)  for  jj*i). 

The  behavior  of  an  entire  interconnected  linear  system  is  summarized  at  each  time 
instant  t  by  the  n-dimensional  full  state  vector 

x(t)$|x][(t),  Xjft) . x£(t)]T  (1.2-6) 


of  which  each  user  subsystem  ha*  an  n^-vector  valued  subset  x^tt).  Tho  projection 
operator  introduced  in  Eq.  1.2-2b  operates  on  x(t)  to  produce 

PK#i(x(t)l  •  * j ( t)  (1.2-7) 


end  so  can  be  represented  as  a  matrix  premult ipiying  x(t)  in  Eq.  1.2-7  of  the  following 
form 


(n^nlf^i*^)  :,»»!,  fn.xn. 
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The  differential  equation  that  describes  the  time  evolution  of  the  aggregate  state  of 
Eq.  1.2-6  in  continuous- time  is 


x(t)»  F(t)x(t>  ♦  w'(t) 


(1.2-9) 


where 


v(t)»  (*'*(!), 


wf(t) . w'T(t)JT 
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and 
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The  appropriate  transition  matrix  for  Eq.  1.2-9  is  ♦(t,t>  which  satisfies 
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The  extjt  solution  of  Eq.  1.2-9  is  of  the  form 

x(t}«  ?(t,f)x(T)'*  f t  l(t,s)  wields  (1.2-14) 

J  T 

end  the  corresponding  exact  discrete-time  representation  is  (p.  171  of  Ref.  19): 

s(k*l)  -  ♦(k+l,k)x(k)  ♦  w{k)  (1.2-15) 

where  t  *  (k*l)A,  x«k‘A  while  the  time  step  6  has  been  suppressed  and  the  exact  equiva¬ 
lent  to  continuous  white  noise  w* (t^)  is  denoted  by  w(k)  to  be  an  n- vector- valued 

Gaussian  whits  discrete-time  process  having  the  following  statistics t 


t[w(k^U*(^l>^s)  E|w'(s)jds  -2 


(1.2-16) 


Q(k)4*[w(k)wT(k)j  »(<K»l)a,s)rf»s)|T«k^»*»id* 

ejw(k)wy (j)j »0  for  kej 


(1.2-17) 

(1.2-18) 


th 


The  exact  difference  equation  that  describes  the  tins  evolution  of  tbs  1  subsystem 


is 


(1.2-19) 


x^k+1)  -  Px  i  x(k+l) 

hence  by  substituting  Eq.  1.2-15  into  Eq.  1.2-19  yields  the  following  results 

x.tk+D-P  *(k+l,k)x(k)  +  P  .  w(k)  (1.2-20a) 

A  ~  X#  1  — 


■iij(k+l,k)  x,  (k)  +  E  1.  .  (k+l,k)x.  (k)  +  w.  (k) 
11  1  j«l  3  1 

i+i 


(1. 2-20b) 


(1.2-20C) 


Equivalent  to  Eq.  1.2-17  (p.  171  of  Ref.  19),  Q(k)  evolves  in  time  according  to  the  fol¬ 
lowing  differential  equation 


<j(t,tk) 

-  F(t)Q(t,t)t)+Q(t,tk)FT(t)+5' (t) 

(1.2-21) 

where 

■Q'(t) 

-  diag (Q^ (t)  ,Q'2(t) .... ,0'jj (t) ) 

(1.2-22) 

Taking  a  different  tack,  the  exact  discrete-time  representation  of  the  i***1 
system's  solution  of  Eq.  1.2-1  is 

local  sub- 

(k*l)-»u  (k*l.k)  *t  <k) 

imp 

♦  u ( (k*l  W , •) Lu  t*»Ul n**1  ’*♦  u  ( <k*lU  *)»[  (*)d« 
k*  *  n 

(1.2-23) 

Applying  the  usual  simplifying  assumption  (as  in  Eq.  4-124  of  Ref.  19) ,  that  the  inputs 
are  constant  over  the  Interval  of  integration,  yields  the  following  subsystem  evolution 
equation 

xjL  (k+l)«4ii  (k+l,k)xi  (k)fLu  ik;  Ui  (k) (k)  (1.2-24) 

where 

tu  (k,^i4 !  (k*i)A,s)Lli  (s)ds  (1.2-25) 

J  k* 

and 

wt(k)  4  j’(k*'l^l{{k4-i)A,s)w[(s)  ds  (1.2-26) 

J  ka 

with  w^(k)  having  the  following  statistics s 


Ef^tk)]  -  0  (1.2-27) 

gji.i  (1.2-28) 

I u 

E(4lCs)wiT(3)l»Oi  for  3#k  (1.2-29) 

( {3l (in«  « j)|  f'l*1**  ,ai  (1.2-30) 

Notice  that,  in  general,  the  global  solution  of  Eq.  1.2-1S,  as  projected  to  the  1th 
subsystem  as  in  Eq.  1.2-20c  (although  similar  in  fora),  can  be  quite  different  from  the 

1th  local  subeystea'a  solution  of  2q.  1.2-24  unless  both  of  the  following  conditions  hold 

♦  tl(kel,k)  -fu(kel,k) 

and 


(1.2-31) 


(1.2-32a) 


f  4  ((k+llA^lL.dlds  u,  (k)  -E  ,  <k+l,k)x.t  (k) 
J  kA  ii  1  j-1  13 

j*i 

A  condition  equivalent  to  Eq.  1.2-32a  (via  Eq.  1.2-4b)  is 


^k+1)  A  ,  N  N 

J  j'  *^^((k+l)A,s)L^^(s)ds  j  j  (k)-  x  ^  OO  (k+l.k)}^  (k)  (1.2-32b) 


•j 

j*i 


j#i 


A  sufficient  condition  for  Eq.  1.2- 32b  to  be  satisfied  is  for  the  following  to  holds 


Jk+IH 


*ii(Ot+l)A,s)Lii(s)dsjLij(k)-Lii(k)Lii(k)»ii5(k+l,k)  (1.2-33) 


for  j  *  1,2, .. . ,N/{i} . 


In  general,  even  the  milder  condition  of  Eq.  1.2-32  is  not  satisfied  since  *,,(k+l,k) 
satisfies 


(1.2-34) 


with  boundary  condition 


(1.2-35) 

while* ?££(k+l,k)  satisfies 

It  *U{t,r)mV x.l  It  *l.t 

(1.2-36*) 

P(t)  T(t,r)  P*  £ 

(1.2-36b) 

"P*’i^lP*'ltFi(t)P*-i  +  Lii(t)VU)ly<t*T)IxT,l 

(1.2-36c) 

•Pt(t)  *lt(t,T)  +  l  I,  •jltt,T> 

ijt 

(1.2-36d) 

with  boundary  conditions 

hi  (T*T>"Int 

(1.2-37) 

?lj(t,T)*0  for  j  ■  1,2, 

(1.2-38) 

(Dotation  for  the  index  j  in  Eq.  1.2-38  indicates  all  values  from  1  to  N  are  taken  on 
except  for  the  value  currently  held  by  l.)  Obviously,  the  solution  of  nqs.  1.2-34  and 
-35  coincides  with  the  solution  of  Bqs.  1.2-36d,  -37,  and  -38  (as  required  for  the  con¬ 
dition  of  Eq.  1.2-31  to  be  satisfied)  for  the  special  esse  when 

I>££(t)  *»  0  for  i*»l,2,...,H 

(1.2-39) 

since  then  the  aggregate  of  the  local  state-variable  solutions  provided  by  eaoh  subsystem 

is  identical  to  what  would  be  obtained  by  solving  the  aggregated  n  order  system  of 
essentially  decoupled  subsystems  with  the  only  allowable  coupling  occurring  in  the  meas¬ 
urements,  (In  Refs.  IS  and  15,  it  is  demonstrated  that  for  the  application  of  decentral¬ 
ised  estimation  to  the  interconnected  structure  of  the  JTXDS  RelNav  net,  the  fairly 
stringent  structural  constvdbec’of  Eq.  1.2-39  is  in  fsot  satisfied.) 

The  issue  of  specifying  conditions  for  exact  correspondence  for  decentralised  estima¬ 
tion  apparently  has  not  been  considered  in  Refs.  8-14,  21,  28,  30,  and  31,  but  were 
raised  within  this  investigation  for  completeness.  However,  Ref,  54  does  address  the 
problem  somewhat  by  advocating  the  use  of  "splitting  methods"  to  obtain  a  solution  to 
the  problem  of  large-scale  linear  least  squares  estimation  (in  a  Hilbert  space)  of  a  non¬ 
dynamic  system,  the  approach  of  Ref.  54  utilises  two-level  hierarchical  coordination  (by 
communication  of  a  supremal  coordinator's  decisions)  in  an  iterative  fashion  to  several 
lower  level  decentralised  local  estimators.  In  this  manner,  the  proper  solution  to  the 
aggregate  static  problem  is  to  be  converged  upon  asymptotically.  Ref.  54  also  provides 


suggested  extensions  for  the  use  of  splitting  methods  and  a  coordinating  supremal  con¬ 
troller  in  implementing  the  classical  optimal  regulation  control  of  a  time-varying  linear 
dynamic  system  with  a  standard  quadratic  coPt  function  to  be  minimized.  Perhaps  duality 
can  be  fruitfully  exploited  to  obtain  useful  generalizations  to  optimal  estimation  of 
these  techniques  for  optimal  decentralized  regulation. 

Tentatively,  the  discrete-time  model  for  decentralized  estimation  to  be  used  here  can 
be  recapitulated  from  Eqs.  1.2-24  and  1.2-2d  as 

Si:  xiac+lV»ii(k+l,k)xi(k)+LiiOc)ui(k)+wi(b)  (1.2-40) 

with  local  measurements  available  to  the  ittl  subsystem  of  the  form 

•tv-lwvi +  VvwU  (tk,+^(v,*Hi(tk,£(v^i(v  <i-2-4i) 

where  Px  ^  and  L^t^)  are  defined  in  Eqs.  1.2-8  and  1.2-4,  respectively,  and  the  statis¬ 
tics  of  the  process  noise  w^f*)  are  provided  in  Eqs.  1.2-27  to  1.2-30. 


1.3  Brief  Survey  and  Status  of  Alternate  Approaches  to  Decentralized  Filtering 

The  implicit  motivation  underlying  all  hierarchical  approaches  in  systems  theory  is 
the  pervading  idea  that  it  is  generally  easier  to  handle  several  lower  order  subsystems 
than  one  aggregate  system  of  high  order  (Ref.  22) .  The  fundamental  idea  is  to  decompose 
the  large  system  into  subsystems  and  then  manipulate  the  smaller  subsystems  in  such  a  way 
that  the  objectives  of  the  overall  system  are  met. 

In  the  case  of  decentralized  large-scale  system  applications  that  deal  exclusively 
with  the  specification  of  adequate  deterministic  control  inputs,  the  objective  is  to 
cause  the  aggregate  of  local  subsystem  control  solutions  to  also  be  the  global  solution. 
This  objective  is  frequently  accomplished  by  coordination.  That  is,  the  globally  optimum 
rolution  being  homed-in  upon  asymptotically  by  the  aggregate  of  locally  optimum  solutions 
is  frequently  accomplished  via  a  central  controller  (viz,,  supremal  controller).  This 
general  approach  of  utilizing  a  supremal  coordinator  involves  “interconnection  con¬ 
straints*  being  routinely  imposed  us  a  further  requirement  to  be  satisfied  by  the 
assorted  controls  provided  by  the  individual  infimal  subsystems. 

For  applications  of  decentralized  control,  a  few  of  the  more  common  approaches  for 
implementing  coordination  (Ref.  22)  arei 

•  The  Prediction  Principle  (Ref.  23),  where  the  supremal  controller  predicts  a  value 
Tor  the  interconnection  variables,  provides  it  to  each  local  subsystem,  and  allows 
each  local  subsystem  to  proceed  autonomously  with  its  own  local  optimization  calcu¬ 
lations.  Eventually,  the  predicted  interconnection  variable  is  checked,  updated, 
and  reissued. 

•  The  Balance  Principle  (Refs.  23  and  24),  where  each  infimal  controller  treats  its 
Interconnection  variable  as  one  of  the  controls  to  be  specified,  then  the  aggregate 
of  calculated  decentralised  infimal  controls  is  cheoked  for  conformity  with  the 
original  interconnection  variable. 

•  Use  of  Penalty  Functions  (Ref.  25),  where  the  interconnection  constraint  is 
adjoined  to  'the  standard  cost  function  via  a  penalty  function. 


Remark  1 . 3- 1 i  While  conceptually  useful,  Ref.  22  indicates  on  p.  62  that  the  Balance 
Principle  has  practical  limitations  since  it  may  give  rise  to  singular  control  problems 
that  cannot  be  solved  by  standard  iterative  techniques. 

Remark  1.3-2 t  An  unfortunate  disadvantage  of  penalty  functions  is  that  they  are  somewhat 
loose  and  do  not  force  exact  adherence  to  interconnection  constraints. 


In  stark  contrast  to  the  decentralised  control  problem,  the  decentralized  estimation 
problem  has,  in  general,  no  mechanism  for  enforcing  interconnection  constraints,  since  no 
control  is  involved.  Historically,  the  mathematical  structure  of  both  decentralised  con¬ 
trol  and  decentralized  estimation  was  examined  in  exacting  detail  by  Pearson  in  Ref.  21 
where  the  following  observations  were  madei 

«  Significant  computational  simplifications  accrue  in  the  control  problem  with 
quadratic  cost  function  when  ell  the  subsystems  are  linear  (pp.  152-3  of  Asf.  21); 

•  Techniques  exist  for  analytically  proving  proper  coordination  via  iteration  between 
aggregates  of  linear  subsystems  vie  a  contraction  mapping  argument  (pp,  182-8  of 
Ref.  21) ,  but  this  approach  is  appropriate  only  over  a  very  brief  time  interval 


•  An  approach  to  decentralized  filtering  simplifies  the  computational  coordination 
requirements  when  a  suboptimal  rule  is  used  (p.  182  of  Ref.  21) . 

During  a  critical  examination  on  pp.  533-4  of  Ref.  13  (as  motivation  for  offering  an 
alternate  decentralized  filtering  technique)  it  is  noted  that  to  solve  an  implementation 
of  Pearson's  decentralized  filter  "in  practice  would  require  knowledge  of  the  sequence  of 
observations  over  the  entire  time  interval  k=0  to  k=NQ  which  is  inconsistent  with  the 

tenents  of  sequential  estimation  since  these  are  unavailable  a  priori. "  While  possibly 
"useful  for  parameter  estimation  (i.e.,  identification),  Pearson's  decentralized  estima¬ 
tion  approach  is  not  of  much  significance  for  state  estimation." 

The  approach  pioneered  by  Sanders  in  Ref.  8 — of  restricting  the  local  subsystem's 
filter  to  be  of  the  so-called  purely  Locally  Unbiased  (SLU)  class — appears  to  follow 
through  on  the  predictions  of  Ref.  21,  where  a  suboptimal  rule  was  called  for  to  simplify 
the  computational  burden.  Similarly,  Shah's  Sequential  Partioned  Algorithm  (SPA) 
approach  to  decentralized  filtering  reported  In  Refs.  13~and  14  also  utilizes  a  simplify¬ 
ing  suboptimal  rule.  A  return  to  examine  these  two  decentralized  filtering  approaches  in 
more  detail  occurs  in  Sections  2  and  3.  These  two  approaches  along  with  the  approach  of 
Spyer  (Refs.  32  and  33) ,  as  discussed  below,  are  emphasized  herein  as  being  perhaps  po¬ 
tentially  more  useful  (from  the  viewpoint  of  offering  sufficient  supporting  rigor  and 
ease  of  engineering  implementation)  than  the  other  approaches  encountered  to  date. 


For  completeness,  it  is  noted  that  another  relatively  recent  approach  (Ref.  26) 
exists  for  implementing  decentralized  estimation.  However,  this  approach  is  only  applic¬ 
able  to  subsystems  of  the  following  restrictive  forms 


(njxl)  N 

si*  xi  (t)+w*  (t)+  £  1.^  *j(t) 


(1.3-1) 


where  the 


Hi  Lij  *r#  isnt  for  1 , j«l,  2, . . .  ,N 


(1.3-2) 


and  with  all  local  measurement  structures  having  no  interconnection  effects  as  modeled  by 

zi(tk’“fii*i<V4-vitV  (1.3-3) 

with  contributions  due  to  other  subsystems  x^(t^)  ( jj4i)  absent  in  the  above.  Consequent¬ 
ly,  the  associated  augmented  system  has  the  following  forms 

(nxl)  t 

kU)«{diag(Fi,Fi....,FN)+C}x(tk)+  w(t)  (1.3-4) 

and 

j(tjt)«di»g(Rj,fii, . .  .,RlJ)x(tJ{)  +  v(t1t)  (1.3-5) 

where 

. «VvlT  i1’3-6* 

etty)^  Jv^(fc) ,  #^(t) , . . . ,  v^(t)p  (1.3-7) 

and,  as  defined  in  Refs.  26  and  27  the  composite  interconnection  matrix  its 

cfijLii  Lj  for  i,j»l,2,...,N  (1.3-8) 

It  ia  asserted  in  Theorem  4  of  Ref.  26  that  the  composite  interconnection  matrix 
being  factor liablo  as 


OPS 


(1. 3-9) 


where 

P  4  diagtPj,  Pt,...,PH)  (1.3-10) 

and  the  P^  are  the  positive  definite  solutions  of  the  algebraic  Riccati  equation 

0  •  (1.3-11) 

and  S  •  any  arbitrary  skew-symmetric  matrix  is  necessary  and  aufficisnt  for  the  global 


optimal  estimate  to  consist  of  the  aggregate  of  local  optimal  estimates  of  the  following 
form 


P 

An  unfortunate  oversight  is  that  ,  the  "appropriate  perturbation  of  C^,"  is  not  ex¬ 
plicitly  defined  in  Ref.  26.  A  similar  discussion  in  Ref.  27  for  decentralized  esti¬ 
mators  indicated  that  the  matrices  premultiplying  Xj  under  the  summation  sign  in  Eq. 

1.3-12  are  obtained  by  calculations  of  full  dimension  n  (as  in  Eq.  1.2-3).  This  would 
be  an  unacceptably  large  computer  burden  for  many  applications  and  could  only  be  per¬ 
formed  at  a  central  computing  facility  or  central  node. 

A  decentralized  (possibly  parallel-processing)  algorithm  for  implementing  the  n- 
dimensional  global  exact  Kalman  filter  in  a  hierarchical  manner  has  been  studied  in  Ref. 
28.  However,  the  processing  hierarchy  is  dictated  by  an  internal  system  structure  rather 
than  by  any  external  imposed  protocol  hierarchy  (e.g.,  as  exists  in  the  JTIDS  RelNav 
application)  and  it  is  assumed  that  every  subsystem  has  access  to  all  the  measurement 
data. 


Other  approaches  to  decentralized  filtering  (such  as  Refs.  29  and  30)  have  been  sur¬ 
veyed,  but  the  common  requirement  of  having  to  perform  a  transformation  to  achieve  "out¬ 
put  decentralization"  is  incompatible  with  many  applications  (see  Section  1.4  for  details 
and  Ref.  30  for  numerical  examples).  The  approach  of  Ref.  31  requires  a  central  process¬ 
ing  node  (that  could  be  vulnerable  in  a  tactical  environment  as  a  single  target  whose 
destruction  would  ruin  all  operations) . 

The  apyvoach  of  Ref.  32  strictly  pertains  to  decentralized  LQG  estimation  and  control 
of  a  K-noda  system  (where  the  K-nodes  refer  to  K  subsystems)  where  local  filters  share 
their  information  with  all  the  other  nodes.  [LQG  refers  to  Linear  Quadratic  Gaussian 
applications  involving  linear  systems  with  Gaussian  measurement  and- process  noiseB  with 
a  single  quadratic  performance  index  (cost  function)  for  control.)  For  many  filtering 
applications,  there  is 

•  no  strong  interest  in  the  feedback  control  aspect, 

•  no  constant  number  K  of  subsystems,  and 

•  no  sharing  of  information  between  all  the  subsystems. 

However,  just  the  filtering  portion  of  Ref.  32  can  be  extricated  as  done  in  Ref.  33  with 
some  simplifications. 

Given  several  redundant  measurement  sensors  of  the  following  form 

ij  (k)  -  H^CkJxtk)  tv^k)  for  j-1,2, . . .  ,M  (1.3-14) 

it  is  reasonably  well-known  (Ref.  79)  that  the  linear  least-mean-square  estimate  of  x(k) 
as  in  Eq.  1.2-6  has  the  form 

x(k|k)  - 


(k|k-l) 


H  _  .  1-lfH 

♦  l  H,T(k)R,’i(k)U,(k)  l 

j«l  3  3  3  i-i 


HiT(k)Ri"l(k)*i(k)  ♦  p“l(k  k-1) x(k  k-1) 

(1.3-1!) 


with  associated  covariance  of  estimation  error  provided  by 


“  .  M  .  ,  1-1 

P(k|k)  -  p“A(k|k-l)  ♦  l  Ha  <k)R  “i(k)H1(k) 

j**l  3  1  1 


(1. 3-16) 


where  P(k)k-l)  A  covariance  of  error  of  estimating  x  at  k  as  propagated  from  k-1  for  full 
state  aggregate.  Speyer's  filter  (Refa.  32  and  33)  is  equivalent  to  the  following  formt 


Uk|k)  •  x(k)k-l)  ♦  l  Ka(k)Ua(k)  -  Hatkjfitklk-l)  J 

3  -  3  J 

H(K)  m  .1  A 

-  x(k|k-l)  ♦  P(k|k-UBjT>j  lt*3  -  Hjx(k|k-U) 


(1. J-17a) 


(1.3-17b) 


(1.3-17c) 


MOO 


*  x(k|k-l)  +  P(k|k-1)  T  H.TR.-1[z.  '  H.x(k|k-1)] 

1=1  3  3  3  3 


j 

but  where  instead  several  decentralized  local  estimators  are  used  in  the  mechanization  as 

M(k) 


(k|k)  -  |p(k|k-l)  |Pj  A(k|k)Xj (k|k)J  +hj(k)| 


M(k) 


P(kjk-l)  l  Ps_1Ck|k)x.  Oc|k>  +  l'  h. 

1*1  3  3  1=1  3 


M(k) 


(k) 


where 


Pj(k|k)  =  E Qx(k)  -  Xj (k>) (x(k)  -  Xj(k))T  Zj  (k) 


(1. 3-18a) 


(1. 3-18b) 


(1.3-19) 


and  hj (k)  satisfies  a  recursive  equation  of  the  form 

hj  (k)  =  F(k)h.j(k-1)  +  (k)  (k)  -  H j  (k)  x^klk-l))  (1.3-20) 

where  F(k)  and  (k)  are  precomputable  matrices  specified  by 

F(k)  *  P(kjk-l) [4<k+l,k)P(k-l|k-2)*T(k+l,k)  +  Q(k-l) ]"1^(k+l,k)  (1.3-21) 

Gj (k)  =  F(k)P(k-l|k-2)Pj”1(k-l|k-l)*"1(k+l,k)  -  P(k|k-1) ^ (k-l|k-l)$T  +  Q(k-l)J-1 

(1.3-22) 


Several  ways  for  ordering  the  computations  of  Eqs.  1.3-17a  to  1.3-22  for  greater  effici¬ 
ency  become  evident  but  all  approaches  represent  a  fairly  large  computational  burden. 
Ref.  33  only  stresses  the  reduction  in  computational  burden  without  tallying  the  burden 
encountered  for  implementation.  Useful  interpretations  of  Eqs.  1.3-18  and  1.3-20  are 
provided  by  Gobbini  on  p.  35  of  Ref.  80.  Generalizations  of  the  Speyer  filter  are  found 
in  Ref.  81  and  discussed  in  Ref.  80. 


An  overview  of  the  results  of  this  section  are  sumioorized  in  Table  1.3-1.  Other 
approaches  to  decentralised  filtering  publicized  too  late  to  be  considered  here  are  Refs. 
44  and  45. 


1.4  Salient  Features  of  a  Canonical  Transformation  to  Achieve  Output  Decentralization 

Several  approaches  to  decentralised  filtering,  such  as  Refs,  26,  29,  and  30,  require 
that  the  original  system  be  of  the  "output  decentralized"  form  as  a  condition  for  applic¬ 
ability.  An  approach  is  available  that  can  be  applied  to  some  large-aoale  systems  having 
a  general  measurement  structure  of  the  formi 

.  N 

«i(k)=Bixi(k)*ai  t  L^XjtkJ+VjOO  (for  i«l,...#H)  (1.4-1) 

so  that  so-called  "output  decentralisation"  of  the  fora 


Sj (k)  ■  MjSj(k)  ♦  Vj (k)  (1.4-2) 

(for  j-l,...,p)  is  achieved,  where  each  subsystem  has  access  to  (and  responsibility  for) 
measurements  only  for  that  subsystem.  In  output  decentralisation,  the  parameter  p  is  not 
constrained  to  be  identical  to  H  and  the  subsystem  state  and  noise  groupings  are  differ¬ 
ent,  in  general,  from  those  in  Eq.  1.4-1  as  denoted,  respectively,  by  and 

"Output  decentralisation,"  if  achievable,  is  attained  via  a  single  transformation 
that  must  be  applied  to  the  entire  system  aggregate.  Only  after  the  output  decoupling 
transformation  has  been  applied  aro  distinct  individual  subsystem  revealed.  Conditions 
for  applicability  of  the  output  decentralisation  transformation,  its  mechanisation,  and 
its  general  inconvenience  to  apply  are  only  briefly  touched  upon  hero. 

The  requisite  decentralising  transformation  was  originally  presented  in  Ref.  42  and 
discussed  in  sore  detail  (as  Chapter  7)  in  Ref.  43  but  only  for  the  case  of  "control 
input  decoupling"  or  "control  decentralisation"  without  a  consideration  of  measurement 
or  process  noise.  "Output  decoupling"  results  were  left  only  as  an  implicit  afterthought 
Tviafche  concept  of  duality  between  control  end  weeuroment  structures) .  For  ease  in 
accessibility  and  use  of  the  seam  consistent  notation  throughout,  the  specific  transfor- 
maticn  for  achieving  "output  measurement  decoupling"  is  explicitly  presented  by  Kerr  in 
Appendix  *  of  Ref.  IS  following  the  approach  of  Raf.  30  but  going  further  to  show  the 
effect  of  the  transformation  on  the  measurement  end  process  noises  that  are  naturally 


TABLE  1.3-1 


SUMMARY  STATUS  OF  ALTERNATIVE  DECENTRALIZED  FILTERING  APPROACHES 


Primary 

Investigators 

Exhaustive  Open 
Literature 
Descriptions1 

Overview  Comments 

J.  D.  Pearson 

Ref,  21  (1970) 

♦  Historically  significant  treatment  of  gen¬ 
eral  structural  framework  and  indication  of 
problem  areas  or  barriers  to  be  overcome  in 
the  future. 

M.  Shah 

Ref.  14*  (1971) 

Ref.  13  (1978) 

Ref.  60 

•  Accommodates  time-varying  models. 

•  Allows  analytic  proof  of  stability  (Section 
4)  for  the  linear  case. 

•  Intuitive. 

•  "Biased"  (but  indications  are  that  this 
aspect  can  be  adequately  handled) . 

•  Anticipated  computer  burden  quantified 
(Section  3.2) . 

•  Accommodates  EKF  approach  to  nonlinear 
filtering  (p.  245  of  Ref.  9) . 

C.  Sanders 

E.  C.  Tacker 

T.  D.  Linton 

R.  Y.-S  Ling 

Ref.  8 1  (1973) 

Ref.  9  (1973) 

Ref.  10  (1976) 

Ref.  11  (1978) 

Ref.  12  (1979) 

•  Accommodates  time-varying  models. 

•  Allows  analytical  proof  of  stability  (Sec¬ 
tion  4)  for  the  linear  case. 

•  Exploits  measurement  structure  (when  possi¬ 
ble)  for  computational  savings. 

•  Unmodified  SLU  requires  a  special  SVD 
matrix  factorization  (discussed  in  Section 
2.2)  . 

•  Unmodified  SLU  is  "unbiased." 

•  Anticipated  computer  burden  quantified 
(Section  2.4). 

•  Somewhat  intuitive. 

M.  K.  Sundareshan 

Ref.  26  (1977) 

*  No  subsystem  interconnections  5  allowed  in 
£Ke  measurement  models. 

♦  Crucial  C^P  not  explicitly  defined  (see 
Section  1.3). 

M.  F.  Hassan 

G.  Salut 

M.  G.  Singh 

A.  Titli 

Ref.  29  (1976) 

Ref.  28  (1978) 

Ref.  13  (1978) 

•  No  subsystem  interconnections  §  allowed  in 
tfte  measurement  model. 

•  All  measurement  data  available  to  each  sub¬ 
system. 

D.  D.  siljak 

M.  B.  Vukcevic 

Ref.  30  (1978) 

•  No  subsystem  interconnections  §  allowed  in 
€Ke  measurement  models. 

E.  Verriast 

B.  Fried larder 

M.  Morf 

Ref.  31  (1979) 

•  Assume*  a  central  computing  node  that 
assembles  local  estimates  into  global 
estimates. 

J.  L.  Speyer 

T.  8.  Chang 

Ref.  32  (1979) 

Ref.  33  (1980) 

•  Framework  is  originally  for  both  estimation 
and  LQG  feedback  regulation  control  (where 
control  in.the  LQG  sense  and  not  in  the 
sense  of  C*)  but  exclusive  filter  framework 
can  be  extricated  (Ref.  33). 

‘no  known  concrete  application*  or  defense  related  prior  consideration*  of  these 
recent  theoretical  developments. 


’Designated  as  the  Sequentially  Partitioned  Algorithm  (SPA)  in  Ref.  13. 

'Designated  as  the  Surely  Locally  Unbiased  (SLt))  filter  (with  10  modified  variations 
of  the  original  formulation)  including « 

ROM  filter— Reduced  Order  Model 
ESLt)  filter— Expanded  Surely  Locally  Unbiased 
ESLU  filter— Sams  as  above,  but  without  explicit¬ 
ly  modeling  communications  noise 
(assumed  negligible) . 

*A  transformation  (detailed  in  Appendix  8.2  of  Ref.  IS  and  Appendix  A  of  Ref.  35  and 
suawarited  in  Section  1.4)  exists  for  converting  general  output  measurement  coupled  sub¬ 
systems  into  'output  decoupled”  subsystems,  but  technique  only  applicable  to  time-invari¬ 
ant  structure  and  must  be  applied  to  aggregate  system  (with  an  associated  harsh  computer 
burden) . 


encountered  in  filtering  applications.  The  essential  observations  ot  Kerr  in  nets,  xj 
and  35  are  now  summarized  here.  Note  that  while  the  output  is  now  represented  in  a  com¬ 
pletely  decentralized  manner  in  Eq.  1.4-2,  the  p  scalar  measurement  noises  of  the  differ¬ 
ent  subsystems  are  correlated  (a  type  of  coupling),  in  general,  unless  the  original 
system  of  Eq.  1.4-1  as  an  aggregate  also  has  uncorrelated  measurement  noise  components, 
as  indicated  by  a  block  diagonal  measurement  noise  covariance  matrix  of  the  following 
form 


R  = 


(1.4-3) 


(which  is  trivially  achievable  if  the  original  systems  aggregate  block  R  is  strictly  di¬ 
agonal,  but  not  very  likely  otherwise).  Restrictions  relating  to  the  presence  of  meas¬ 
urement  and  process  noise  as  encountered  here  while  considering  the  salient  features  of 
output  decentralizing  transformations  were  not  considered  in  Refs.  30,  42,  and  43. 


Drawbacks  Limiting  Applicability  of  Output  Decentralization 

While  "output  decentralization"  could  be  acceptable  for  many  applications,  its  use¬ 
fulness  for  many  potential  applications  appears  to  be  limited  since  requirements  for  its 
use  appear  to  exceed  certain  reasonable  computational  limits  as  now  summarized.  Apparent 
restrictions  to  applying  "output  decentralization* t 


•  Decentralized  transformation  only  strictly  applicable  to  time-invariant  linear  sys¬ 
tems  (where  the  matrices  P,  and  of  Eqs.  1.4-1  and  1.4-2,  respectively, 

are  constant) j 

•  Original  grouping  of  N  subsystems  must  consist  entirely  of  “observable  pairs" 
(H^Fj)  otherwise  computational  problems  with  corresponding  transformation  matrices 

Tj  are  encountered; 

•  Test  of  "observability"  requirements  being  met  as  a  requisite  stop  is  a  difficult 
computational  burden  for  real-time  verification  (alternative  tests  are  provided  in 
Section  S  of  Ref.  46); 

•  Formation  of  transformation  Tj  (Eq,  A. 2-10  of  Ref.  35)  appears  to  be  a  formidable 
computational  burden  for  each  j*l,...,N; 

•  While  transformations  using  T ^  can  bo  applied  at  the  subsystem  level  in  a  decen¬ 
tralized  fashion,  reshuffling  of  the  aggregate  of  states  using  tho  permutation 
matrix  P  (Eq.  A. 2-14  of  Ref.  35)  constitutes  a  computational  burden  that  is  appar¬ 
ently  only  compatible  with  a  central  computing  facility,  a  severe  restriction  for 
some  applications; 

•  Reshuffled  "output  decentralized"  representtftion  of  Eqs.  A. 2-18  and  A. 2-19  of  Ref. 
35  will  not  necessarily  correspond  to  the  state  identities  of  the  original  sub¬ 
systems  (a  consequence  that  can  be  Inconvenient  when  a  desirable  physical  associa¬ 
tion  frequently  provides  insight  for  error  overrides) . 


The  restriction  of  item  1  above  in  requiring  a  time-invariant  system  is  a  hardship 
for  nonlinear  applications  where  repeated  relinearications  are  time-varying  in  general. 
An  application  could  still  be  accommodated,  in  principle,  by  repeated  application  of  the 
T  and  P  transformations  discuased  in  tha  above  items  4  to  6  after  every  measurement  up¬ 
date  anywhere  within  the  total  system.  However,  the  drawback  to  doing  this  is  that  it 
reprasanta  a  multifold  Increase  In  a  fairly  substantial  computational  burden  that  may 
already  be  impracticable  in  many  applications  to  be  required  to  perform  these  tedious 
transformations  even  once. 


1.5  Hultlrate/Hultldlmsnslonal  Two  (or  Three)  Filter  Coordination 

In  the  restrictive  situation  (prevalent  in  inertial  navigation  system  applications) 
that  the  observation  matrix  has  the  following  special  structure 

man  mxs  mxq 
*k  •  (Ck  I  0  1 

where  the  dimensions  are  such  that 

a  <  q  and  n  •  s  ♦  q 
this  feature  can  be  exploited  to  e  computational  advantage  by  making  use  of  the  following 


(1.5-1) 

(1.5-2) 


matrix  ana  vector  partitioning: 


nxl 

A 

xk|k-l 


ixn 

pkl  k-1 


33pklk-l  I  ^Pklk-l 


PqpT  |  qqp 

pklk-i  |  k|k-l 


l3Xk|k-l 


qxk!k-l 


(1.5-3) 


where  leading  superscripts  denote  the  dimensions  of  the  submatrices  and  vectors  for  clar¬ 
ity.  Partitioning,  as  done  above,  can  be  used  to  define  the  two  separate  but  compatible 
filters  depicted  in  Figure  1.5-1  that  can  perform  their  processing  at  different  rates  in 
order  to: 


1.  make  use  of  the  measurements  at  a  fast  rate  (as  they  become  available)  and  sum¬ 
marize  the  associated  plethora  of  information  (as  a  kind  of  rigorous  data  compression)  in 

,  and 
k 


2.  hand-over  the  distilled  summary  data  sY  to  be  combined  within  a  full-scale 

*k|k 

filter  that  processes  at  a  slow  rate  (as  required  for  the  higher  dimensional  filter) ,  but 
with  consequently  greater  accuracy  associated  with  a  more  detailed  accounting  of  error 
contributors  to  be  found  in  this  higher  fidelity  model. 

Standard  Kalman  filtering  theory  prescribes  use  of  only  a  single  unified  filter.  The 
remainder  of  this  section  describes  the  rationale  that  permits  a  theoretically  rigorous 
implementation.  The  ideal  structural  and  cycle-rate  compatibility /conformability  (match 
between  what  needs  are  anticipated  for  future  integrated  navigation  system  applications 
and  what  is  offered  by  the  multirate  two  filter  configuration)  follow  as  a  consequence  of 
structural  degeneracies  of  Eq.  1.5-3  (due  to  the  beneficial  sparseness  of  the  observation 
matrix  in  Eq.  1.5-1  as,  respectively, 

"kin  •  ‘•klit-i  -  (1 . 5-4a) 


(1.5-4b) 
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figure  1.5-1.  KULTIRATE  TWO  FILTER  APPROACH s  Reduced-order  filter  processes  measurements 
•t  feet  rate  (AT)  and  hands  over  estimates  to  More  accurately  detailed 
full-order  filter  for  processing  at  the  slower  rate  (R«dl> . 


and 


'*klk  **klk-l  .  “»klk-lck(ck',*etlk-lcic-Atl"1  .  _  .* 

- -  .  — -  ♦  . . . t*k  -  Ck*»klk-l) 

l**!*  ^U-l  *,**|k-l<:klc*"Nlk-lCk*At)*1 


From  Eq.  1.5-5a  above,  the  following  two  component  equations  are  extracted: 

•*klk  •  **klk-l<***klk-l  t4lCk**»k|k-lCk***>*l<n-Ck**klk-l)) 

»  T 

^Xklk  *  ,>klk-l*t':,n«lk-l  !*«■•  44  “ttkla  *b»«  bttckku  ! 


(1. 5-5a) 


(1.5-5b) 


(1.5-5o) 


The  common  bracketed  residual  or  "innovations"  information  provided  by  the  measurements, 
occurring  in  both  Eqs.  1.5-5a  and  1.5-5c,  can  be  solved  for  from  Eq.  1.5-5b  as: 

^(CicWpjclk-lCk+RkJ^tsk-Ck^klk-l)  *  ss*k[k-l  t sXjc | ! x-x )  (1.5-6) 

The  explicit  occurrence  of  this  measurement  residual  in  Eq.  1.5-5c  is  then  eliminated  to 
result  in  the  following  update  form  for  the  additional  states  of  the  full-state  filter  as 


A  A  ^  A  A 

q«klk  •  qXkik-l  ♦  ^Pklk-l  ”Pktk-l  (*Xk|k-3Xklk-ll 


(1.5-7) 


This  equation  is  to  be  used  as  the  basis  of  the  update  step  of  the  full-atate  slow  rate 
filter  as  further  elaborated  upon  below. 

Remark  1.5-1:  Please  notice  that  this  derivation  of  the  multirate  filter  avoids  the 
severe  (but  unnecessary)  restriction  encountered  in  the  original  derivation  offered  in  a 
slide  presentation  prior  to  Refs.  61,  62  that  c*  in  Eg.  1.5-1  must  be  invertible  (or  the 
equivalent  restriction  in  Ref.  61  that  Cj<  be  equivalent  to  an  identity  matrix) .  A  re¬ 
moval  the  invertibility  restriction  was  accomplished  independently  for  the  derivation 
presented  here  and  by  the  later  work  of  Ref.  61. 

From  Eq.  1.5-4b,  the  following  three  recursive  relationships  are  obtained  as  sub¬ 
matrices  within  the  partitioning: 


•*Pkik  «  ( I-*3pk I k-lck ( ck****k  1  k-lck+Rk J “lck J  33*klk-l 
8qpklk  "  (Same  as  within  above  brackets  )  *3Pkik-l 
V  -1 

qqpklk  •  qqpklk-l-*q^klk-lMPk|k-lU-‘tSa«e  as  within  above 
brackets))  •^Pklk-I 


(1.5-0) 


(1.5-9) 


a. s- io) 


Eq.  1.5-8  can  then  be  indirectly  solved  computationally  for  the  common  expression  within 
brackets  (aa  reoccuru  throughout  Eqs.  l.S-S  to  1.5-10)  by  only  a  single  low  dimensional 
(sxa)  matrix  Inversion  and  matrix  multiply  as 


Ct-MPklk-lcI(CK“Pk|k-lCk*»k)‘^k'.  •  *»Pk|k  *3*klk-l 


(1.5-11) 


Further  aomputational  simplifications  accrue  in  evaluating  Eq.  1.5-9  and  Eq.  1.5-10 
by  utilising  a  fundamental  assumption  that  the  following  information  ia  a  good  approxima¬ 
tion  at  the  instant  of  a  f ast/a'lov  fi .e . ,  reduced-order  f ilter/full-xtate  filter!  trans¬ 
fer  (every  n<at  units  of  time  where  AT  denotes  the  step-site  of  the  reduced-order  filter, 
while  N*6T  denotes  the  atep-aito  of  the  associated  full-state  filter)  as  a  consequence  of 
the  revueed-order  filter's  timely  utilisation  of  measurements  s^r 


•*k|k  "  3*k|k 

•••km  "  “*klk 


(1.5-12) 


(1.5-13) 


(whsrs  a  superscript  asterisk  denotes  calculations  performed- ty  the  reduced- order  filter). 
Exploitation  of  this  rensonahlc  assumption  yields  a  v-w^utatlonal  approximation  for  the 
bracketed  term  of  Eq.  1.5-U  as 


«k* 


(1.5-14) 


from  which  a  simplification  of  tha  update  procedure  for  the  full-atata  filter  ensues  from 
Eqs.  1.5-14,  1.5-8,  1.5-9,  and  1.5-10  as:  . 


SSpkl 


(1.5-15) 


*  S3pklk 


3qpklk  *  S)c  ^Pjcl Jt-X  (1.5-16) 

qqpklk  -  <NPk|k-l-Sqpk|k-l  33 Pk Ik-1  (I-Sk)aqPklk-l  (1.5-17) 


Similarly  utilizing  the  assumption  of  Eq.  1.5-12,  the  update  procedure  for  the  remainder 
of  the  full-state  filter  provided  by  Eq.  1.5-7  simplifies  to 


q*klk 


qxk|k-i  +  ^Pk  I  k-lS3pk Ik-1  (35tk|k-xk|k-l] 


(1.5-18) 


*xk I lk  *  3Xkik 


(1.5-19) 


Observe  that  in  both  Eqs.  1.5-17  and  1.5-18,  an  additional  computational  consideration  is 
the  requirement  for  calculating  the  inverse  of  ss  .  This  is  easily  accommodated 

Pk|k-1 

since  ss  is  symmetric  positive  definite  and  of  low  dimension  (sxs)  and  is  to  be 

Pk|k-1 

calculated  within  the  digital  processor  hosting  the  full  state  filter  which,  in  turn, 
is  allotted  more  time  (N*AT)  to  complete  its  requisite  calculations  (notice  the  bene¬ 
ficial  parallel  processing  associated  with  hosting  the  two  filter  algorithms  on  different 
machines) .  For  both  estimator  and  covariance  calculations  performed  in  the  full-state 
filter,  an  exact  implementation  of  the  propagate  step  is  recontaenoed  (and  likewise  easily 
accommodates  due  to  the  longer  time  of  N*  7T  allotted  for  computations  associated  with  the 
full-state  filter) . 

Coordination  of  the  two  reduced-order/full-state  filters  is  achieved  by  patting  the 
following  covariance  information  back  from  the  full-state  to  the  reduced-state  filter: 

*3pklk-l  *  Mpktk-1  (1.5-20) 


at  the  N*  AT  instants  of  data  transfer.  In  this  manner,  the  covariances  of  the  reduced- 
order  filter  are  not  permitted  to  diverge  away  from  the  better  overview  assessments  (due 
to  a  moro  detailed  accounting  of  cross-corrolations  and  considerations  of  additional  sig¬ 
nificant  error  contributors)  of  the  full-state  filter.  Thu*  the  salient  features  and 
theoretical  rationale  underlying  the  operation  of  multirate  reduced-order/full-state 
filtors,  as  depicted  in  Figure  1.5-1,  have  been  simply  described. 

A  technique  for  performing  a  reduced-order  suboptimal  filter  analysis  for  this  multi- 
rate  filtering  approach  to  ascertain  its  aporopr lateness  for  a  particular  application  is 
described  in  Ref.  61.  Generalisations  are  immediate  for  extending  this  approach  to  three 
(or  more)  concatenated  filters  or  one  series/two  parallel  filters,  if  need  be,  for  either 
the  previously  mentioned  multiratet  of  fast/slow  or  for  three  processing  rates—fast/ 
mcdium/alow.  However,  it  is  important  to  note  the  ordering  hierarchy  where  the  reduced- 
order  filter  taodel  is  a  proper  subset  (perhaps  by  a  similarity  transformation)  of  the 
larger  filter  model. 


2  THE  5 U RELY  LOCALLY  UNBIASED  (SLU)  APPROACH  FOR  DECENTRALI BED  FILTERING 
3.1  Layout  of  Section  2 

Justification  for  a  detailed  discrete-time  treatment  of  the  SLU  filtering  formulation 
it  provided  in  Section  2.2.1.  The  special  structure  that  must  be  exhibited  by  system  ob¬ 
servation  matrices  to  enable  use  of  SLU  filtering  to  a  omputational  advantage  la  de¬ 
scribed  in  Section  2.2.2  and  matrix  tram format ions  to  cable  verification  of  this  desir¬ 
able  structure  are  provided.  The  mechanisation  equations  for  implementing  the  SLU  algo¬ 
rithm  are  provided  In  Section  2.3  and  summarised  a*,  nine  (9)  entries  in  Table  2.3-1.  A 
table  of  SLU  operation  counts  to  quantify  the  computer  burden  upon  mechanisation  is  pro¬ 
vided  in  Section  2,4. 


2.2  Derivation  of  the  Discrete-Time  SLU  Filter 

2.2.1  Motivation  for  Proceeding  with  a  Discrete-Time  Derivation 

An  application  of  the  Matrix  Minimum  Principle  (Refs  1  and  2),  a  well-known  too)  for 
the  derivation  of  the  continuous-time  Kalman  filter  (Ref.  3) ,  ie  presented  here  for  an 
exclusively  discrete-time  derivation  of  the  dlecretm-time  SLU  decentralised  filtering 
equations.  This  demonstration  is  provided  here  for  the  following  reasons: 

•  The  SLU  approach  of  Refs.  1-14  to  decentralised  filtering  i*  via  utilisation  of  the 
continuous- time  version  of  the  Matrix  Minimum  Principle,  where  the  decentral i  ation 
requirement  ie  essentially  included  in  thw  routine  analysis  as  an  additional  struc¬ 
tural  constraint,  sad  the  filter  gains  are  then  optimised  to  provide  the  minimum 


variance  solutions  for  the  prescribed  filter  structures. 

•  The  SLU  results  of  Refs.  8-14  for  decentralized  filtering  are  stated  and  derived 
only  within  a  continuous-time  framework  (discrepancies  that  can  arise  between  dis¬ 
cretization  of  continuous-time  results  and  results  derived  exclusively  via  dis¬ 
crete-time  techniques  are  noted  and  explained  on  pp.  136-9  of  Ref,  4,  on  p.  297 

of  Ref.  5,  and  other  difficulties  discussed  in  Ref.  6) j  however,  practical  applica¬ 
tions  implemented  with  a  digital  computer  appear  to  be  better  suited  to  the  dis¬ 
crete-time  formulation  provided  below.  Indeed,  realistic  operations  counts  as  pro¬ 
vided  in  Section  2.4  to  analytically  quantify  the  required  computer  burden  can  be 
performed  only  for  the  discrete-time  formulation  (p.  93  of  Ref.  12) . 

•  The  milestone  application  of  the  Matrix  Minimum  Principle  to  filtering  problems 
(Ref.  3)  unfortunately  provides  no  clarification  for  why  the  particular  cost  func¬ 
tion  of  just  a  terminal  accuracy  constraint  was  chosen.  Use  of  the  same  terminal 
accuracy  constraint  persisted  without  clarification  throughout  Rofs.  3,  8-12,  and 
7,  but  is  questioned  and  appropriately  modified  both  in  similar  circumstances  in 
Refs.  47,  48,  4q,  and  here. 

First  the  special  structure  required  for  SLU  filtering  is  identified  in  Section  2.2.2 
and  the  descriptive  transformation  is  described.  The  SLU  filter  gains  are  specified  in 
Section  2.3  and  the  form  of  the  local  Riccati  equation  that  must  be  recursively  solved  by 
each  subsystem  is  also  specified. 


2.2.2  Special  Structure  Required  for  SLU  Filtering 

The  discrete-time  subsystem  model  that  appears  to  be  most  appropriate  for  Surely 
Locally  Unbiased  (SLU)  filtering  is  presented  in  Eqs.  1.2-40  and  1,2-41.  The  approach 
of  Refs.  8-12  is  to  assume  that  each  local  estimator  is  of  the  following  forms 

jJi(k+l|k)-ri(k+l,k)  x^ (k | k-1)  +Kt (kfcsjfk)  (2.2-1) 

and  the  corresponding  error  in  subsystem  estimation  is 

ei(k+l]k)-  xi(k+l)-Si(k+l|fc)  (2.2-2) 

The  subsystem  error  of  estimation  evolves  in  time  according  toi 

ei(k+lik)»*ij  (k+l,k)xi(k,+Lit(k)ui(k)+wi(k)-ri(k+l,k)^i  (k|h-l) 
-^(k+l.kJx^k)  +ri(k+l,k)xi(k) 

-Kj (k) |sj (Id Jdo* (j,) |  (2.2-3a) 

tkik-n 

*|«uoiu.w»(  tkn.nti*'  tJslk> 

♦  |4l(k)-i(l(ki*l(k)|  (2.2- 3b) 

The  desirable  property  of  the  subsystem  estimator  providing  conditionally  unbiased  esti¬ 
mates  as 


s|ft1(k«>l|k)|s1<k)]>  s[Kt(k>l)i|1(k>)  (2.3-4) 

imposes  the  following  structural  constraint  (justified  as  on  p.  291  of  Ref.  7)  on  the 
middle  coefficient  matrix  of  Eq.  2.2-3b  as 

9  -[♦ll(k+l,k)Pjf  L  LJLi(k)I,1(k1-Ki(k)Bt(k)  ^(k+l.klP^^]  (2.2-5) 

By  exploiting  the  structural  decomposition  of  the  observation  matrix  H^k)  as  provided  in 

Eq.  1.2-2d,  the  single  condition  of  Eq.  2.2-5  (being  satisfied  for  conditionally  unbiased 
estimates)  can  be  naturally  decomposed  for  convenience  into  the  following  two  conditions i 

[  ♦iitk+l,k)-*t(k)Si(k)-ri <*♦!.*)[  P*  i  -  0  (2.2-6) 


and 


(iii(k)-Ki(k)&i(k)jl,l(k)  (2.2-7) 

Via  the  definition  of  L^k)  in  Eq.  1.2-4,  the  single  condition  of  Eq.  2.2-7  can  be  furth¬ 
er  decomposed  into  an  aggregate  of  several  conditions  as 


for  j  -  l,2,...,N/{i) 


(2.2-8) 


[Lli(k)-Kiflt)HiOt)j  (k)  -  0 

while  the  condition  of  Eq.  2.2-6  (as  it  applies  to  each  of  the  component  subsystems 
i*l,2,...,N)  is  equivalent  to  a  familiar  structural  constraint  of 

ri!k+l,k)«  ♦u(k+l,k)-Kl(k)Hi(k)  (2.2-9) 

to  yield  decentralized  estimators  of  the  form 

ii(k+lik)-[*ii(k+-l,k)-ri(k)Hi(k)|  x^klk-U+K^k)  zj  (k)  (2.2-10) 

Before  looking  deeper  into  what  is  structurally  involved  in  satisfying  the  condition 
of  .  *».  2.2-8,  observe  that  the  weaker  requirement  to  provide  unconditionally  unbiased 
esf:  rtes  as  explicitly  represented  by 

E  [xt  (k+1  Ik)]  »  E[xi(k+l)j  (2.2-11) 

relies  ~nly  upon  use  of  'the  appropriate  filter  initial  conditions! 

^i(ft|-l)-E[xl(0)]  -0  (2.2-12) 

as  can  be  seen  by  taking  total  expectations  throughout  Eq.  2.2-3b  and  noting  that  only 
the  first  term  makes  a  nonzero  contribution  unless  E[e. (0|-1))  -  0  (equivalent  to  Eq. 
2.2-12).  1 

Returning  to  parallel  the  continuous-time  structural  investigation  of  Ref.  8,  the  de¬ 
centralized  gain  matrix  K^k)  (appearing  in  Eq.  2.2-10)  is  defined  to  be  of  the  Surely 
Locally  Unbiased  class  denoted  by 

(^Xq ,) 

(SLOl(k)^iKi(k)  such  that  Lti  (kJ-Kj  (k)fit(k)-0)  (2.2-13) 


if  and  only  if 


Rt(k)  Is  «  member  of  ISLUj (k) ]  (2.2-10 

Notice  that  a  n^  x  q^  gain  matrix  being  an  element  of  (SLU^tk)}  is  a  sufficient  condition 
for  satisfying  the  remaining  conditionally  unbiased  constraint  of  Eqe.  2.2-7  or  2.2-8. 

It  is  convenient  to  consider  two  other  subsets  of  matrices  corresponding  to  the  1th 
subsystem) 

.  fw  t 

oilk)  !  Jotoo  #uch  0,41  tVklLut,°  *  0  tor  J*1,2 . a/nii 

and 

ft^(k)  A  |K^(k)  such  that  thorc  exists  an  element  0^(k)  a  member 

of  0£(k>  such  that  Lu<k)  -  Kl(k)«l(k)  «•  0i'(k)  -  oj  (2.2-16) 

An  interesting  observation  (as  in  Ref.  8)  is  that  the  condition  of  Eq.  2.2-8  la  a 
subset  of  the  class  represented  by  Q.(k)  in  Eq.  2.2-16.  This  can  be  demonstrated  by 
first  noting  that  for 

R* j (k)  is  a  mtmber  of  0t(k)  (2.2-17) 

implies  that  there  exists  an  0^'(k)  with 

«  0  for  j-l,2,....N/(l)  (2.2-16) 


and 


Hi 00  "  *i°°  “l°°  **:i{k)  *  0  (2.2-19) 

Mow  poataultiplying  Eq.  2.2-19  by  L^ik)  yields 

[^(ki-K^klt^tk^  Llj(k)+^i(k)  lj.,(k)  *2*0-3  (2.2-20) 

which  simplifies  via  Eq.  2.2-18  to  condition  Eq.  2.2-6.  It  is  immediate  that  the  null 


matrix 


0  is  an  element  of  e^(k)  so 

(niXq.) 

{SLUi(k;>  C  nt(k)  C  {^(k)  such  thatEq.  2.2-8  is  satisfied}  (2.2-21) 

A  safe  suboptimal  decentralized  estimation  policy  is  to  optimize  the  filter  gains  only 
over  the  class  {SLU^ (k) }  of  SLU  filters  to  guarantee  subsystem  filters  that  are  condi¬ 
tionally  and  unconditionally  unbiased. 


Remark  2. 2. 2-1;  In  Ref.  8,  it  is  stated  that  the  condition  of  Eq.  2.2-8  is  equivalent  to 
the  set  0^(k)  as  defined  in  Eq.  2.2-16  (but  a  confirming  proof  eluded  this  author);  how¬ 
ever,  the  result  of  equivalence,  while  more  satisfactory,  was  not  utilized  as  being 
necessary  here. 


The  only  initial  worry  at  the  other  extreme  is  that  the  SLU  class  may  be  empty;  how¬ 
ever,  structural  conditions  that  guarantee  a  realistic  nonvacuous  optimization  will  now 
be  elucidated . 

A  nonvacuous  optimization  over  the  SLU  class  is  guaranteed  if  the  following  condition 
holds . 

Condition  2.2-1  (Ref.  8)  : 


rank  (^  (k) )  ■  <  q^^ 


(2.2-22) 


where  is  the  dimension  of  the  i^  local  measurement,  or,  equivalently,  satisfying  a 
canonical  structural  assumption  of 

Condition  2.2-11  (Ref.  8)  i 

A 


Adherence  to  the  canonical  structural  assumption  of  Condition  2.2-1'  can  be  forced  for 
any  arbitrary  H^k)  satisfying  Condition  2.2-1  of  Bq.  2.2-22  via  the  following  result  on 
p.  47  of  Ref.  SO. 

Theorem  2.2-1 t 

A 

For  H^(k),  a  (q^  x  p^) -matrix  of  rank  there  exists  a  nonsingular  (pi  x  p^)-matrix 
E^tk)  and  nonsingular  (q^  x  q^) -matrix  Ej^(k)  such  that 


(k) 


0 


VPi 


(2.2-23) 


(k)  fijtk)  *utk) 


(2.2-24a) 


Proof  that  such  a  factorization  is  possible  is  deferred  in  Ref.  8  to  Ref.  SO  where  only  a 
standard  existence  proof  ia  provided,  A  theoretical  loose-end,  previously  unsolved,  is 
how  to  computationally  execute  the  factorization  called  for  in  Eq.  2.2-24a  that  ia  guar¬ 
anteed  to  be  theoreticelly  possible  from  Theorem  2.2-i,  but  not  yet  specified  in  the 
literature  on  this  subject.  During  the  course  of  this  investigation  of  Ref.  IS,  it  was 
recognized  that  the  so-called  Singular  Value  Decomposition  (SVD)  algorithm  (Ref.  38)  will 
suffice  to  solve  the  factorization  problem  of  feg.  STa-H  an3rXhat" existing  coded,  stand¬ 
ardized,  and  validated  MkTkM  software  Implementations  exist  (e.g.,  EI5PACK  (pp.  265-9 
of  Ref.  39)). 

In  order  to  demonstrate  this  pleasant  resolution,  first  consider  the  following  gener¬ 
al  form  of  the  answer  that  la  returned  from  an  SVD  algorithmic  operation  as 


Sj_  | 

Hi  -  °i  0 


(PjXPi.) 


(2.2-24b) 


for  >  p^,  where  additionally  and  are  orthogonal: 

-1  T  -1  T 

ui  =  °i  *  vi  -  vi 

Further,  notice  that  upon  rearranging  Eg.  2.2-24b  the  following  results 


°il  8iwi»’i  h  °]'  rip, 


•  T-  f“- 

*>i  * 


(2.2-24C) 


Now  Eq.  2.2-24c  is  recognized  to  be  exactly  of  the  form  of  Eq.  2.2-24a  that  is  sought  (by 
making  the  following  associations  or  assignments)  with 

z  _1  4  u  T 

"il  ui 


vi  IK  1 


*2  0 


Caveat;  In  Ref.  9,  a  comment  occurs  prior  to  Eq.  7  that  the  rank  condition  (of  Theorem 
2,2-1)  being  satisfied  "roughly  speaking"  means  "that  at  least  one  element  of  the 
state  vector  of  each  user  Is  1 measured" and  that  each  of  the  interactions  to  each  ueer 
has  a  direct  effect  on  tho  local  measurement  obtained  by  the  user."  similarly  on  p.  17 
of  Ref.  8,  tbs  above  loose  interpretation  is  reinforced  in  stating  that  the  rank  condi¬ 
tion  "means  intuitively  that  the  user  can  observe  all  tho  interactions  to  his  unit." 

Both  tho  above  two  Intuitive  structural  requiremants  are  adhered  to  in  the  JTIDS  RelNav 
application.  While  the  equivalence  of  Condition  1  and  1*  (of  Ref.  3  that  correspond 
exactly  to  Condition  2.2-1  and  Thcorom  2.2-1  above)  have  been  validated  in  Ref.  15,  the 
verification  of  the  "loose  interpretations"  implying  compliance  with  Conditions  1  and  1* 
of  Ref,  B  cannot  be  verified  and  are  not  satisfied  by  all  applications,  as  demonstrated 
to  be  violated  for  0TID5  RelNav  as  shown  in  Eq.  4-13  of  Ref.  35. 

Earlier  examples  in  Refs.  8-12  of  how  to  apply  the  SLU  filter  to  power  system  appli¬ 
cations  had  degenerate  "special*  internal  system  structure  that  avoids  the  necessity  of 
performing  the  explicit  factorization  that  is  usually  required  (pp.  23-4  of  Ref.  8). 

By  the  above  defined  transformation  involving  2,.{k)  and  E , 2 tk) #  tho  following  trans¬ 
formation  rule 


lu— ♦Lii(k)»u(k)Al'u(k) 

tl  — **  (kl^tk) 

v  A  tk) — »«^(k)vi(k)fiv[(k)  x 

fl^k)— •*‘J{k)Si(k)sUUt).  5.1 

►  ^  ■ 

fii(k)— *^lk)Bi(k)A8lt,t;  x  * 

*i(kl— **^(k)*i(k)^[  (k) 

Rt  (k ) — 42 1 J  (k)  Ri  (kit  [  j  (k)^  (k) 


(2.2-25) 


(2.2-26) 


(2.2-27) 


(2.2-28) 


(2.2-29) 


(2.2-30e) 


(2. 2- 30b) 


(2.2-31) 


xi(k*l]k)-*li(ltvl,10*1(Ic|k-l)*L'u(k)[*'11(k)-3^1(k)x1(kilc>u]  ♦  K^jlk)  [* [ 2 (k) -St  j(k)xt  (k |k-l)|  (2.2-41) 

While  it  is  not  strictly  necessary  to  transform  tha  original  system  to  the  canonical 
form  of  Eqs.  2.2-32  and  2.2-33  to  utilize  the  optimal  SLU  decentralized  filtering  results, 

it  is  necessary  to  find  the  appropriate  Z^Oc),  zi2 (k) ,  and  calculate  both  Z^-1^)  and 
Zi2  (since  Z^  and  Z^2  are  defined  in  terms  of  orthogonal  and  diagonal  matrices  fol¬ 

lowing  for  Eq.  2.2-24c,  no  matrix  inversion  is  needed  here)  for  each  measurement  Z^tk). 

The  mechanization  equations  of  SLU  filtering  are  summarized  in  9  simple  steps  in  Table 
2.3-1.  The  optimization  to  specify  the  remaining  gains  K.,*  (i=l,2, . . . ,N)  is  performed 
next  in  Section  2.3  using  the  Matrix  Minimum  Principle.  * 


2.3  Specification  of  Optimal  Gain  and  Ricatti  Equations  for  the  Decentralized  SLU  Class 

In  preparation  for  specifying  the  equation  for  the  global  estimation  error,  notice 
that  Eq.  2.2-3b  simplifies  for  the  SLU  class  as 


a^(k+l|k)  ■  r^(k+l,k)e^(k|k-l)  +  ^w^(k)-K£(k)v^(k)J 


which,  when  augmented  for  all  N  subsystems,  yields 


where 


e(k+l|k)-r(k+l,k)e(k|k-l)+w(k) 


e(k+l|k)Ai  p*  ,  e.  (k+l|k) 

1-1  x,i  l 

rtk+i,k)5£  r. (k*i.k)pw  , 

a*1  k  X,1 

(k)Al^  L  [wi  (k)  -Ka  (k!  Vi  Ck)] 


(2.3-1) 


(2.3-2) 


(2.3-3) 


(2.3-4) 


(2.3-S) 


From  w^(k)  and  v^fk),  the  composite  process  N(k)  inherits  its  properties  of  also  being 
zero  mean,  white,  Gaussian,  and  independent  of  the  initial  condition. 

Step  1.  Obtaining  the  difference  equation  for  tho  time  evolution  of  the  covariance 
of  the  estimation  error  in  a  manner  analogous  to  what  is  done  in  Refs.  3  and  7  for  the 
"centralized"  case,  postaultiplying  Eq.  2.3-2  by  its  own  transpose  and  taking  expecta¬ 
tions  throughout  yields 

C,,k»l|k)«hf[e(k-l!k)eT(k*Uk)j  <2.3-6a) 

•£(kel,k)  Itk|k-1‘)  f* Ckel,kje  (2.3-6b) 


A  further  association  for  the  SLU  case  of  unbiased  filters  being 


£Ckel,k)At^4l  [•ll(kel.k)-Kltk)iltk)] 


(2.3-7) 


allows  the  observation  of  Ref.  8  that  the  solution  to  Eq.  2.3-6c  can  be  shown  by  direct 
substitution  to  be 


£{k*l|k)**S  Pi  .  p.  (k*l [k)  P„  s 
1-1  1 


(2.3-S) 


where  etch  P, (k+l|k)  evolves  in  time  according  to  the  local  discrete-time  Riccati  equa¬ 
tions  of  the* for* 


»llk»l|kl»(*u(k.l.k)-«1lkl»l(klJ  »ttk|k“Vt  [*u<k»l.kl«Vl  tktSjlkl] T*  VM*VfcUl 

•  [*u*k.l.k>-i'4Jlkl8*lai-«'u  iktSj.ttlj  >4tklk-ll  (*u  [*♦*.*>- t'ljtkJliimi-SjjlMljjtklj  * 


(2.3-»a) 


* (su  no •tu tk! t'kl !ki* ^ no]  .•Jj(kj^[,ikitJitkHt'um<«'l,f*«'Z[jT(kj.z'|jk)«'4J(ki*^1tki  (2-3-9b) 


with  natural  initial  condition 


Pi(0|-1)  = 


(2.3-10) 


Step  2.  Specifying  the  appropriate  scalar  performance  measure  or  cost  function  to  be 
the  weighted-mean-squared  error  in  estimation  (to  then  be  globally  minimized  in  specify¬ 
ing  the  optimal  gains  (i*l,...,N)  for  the  decentralized  SLU  filters]  as 


m  m 

?»  b{£  eT  (k|k-l)8(k)eT  (k|k-l)  > 
k-0  ~ 


-  I  Z  E  eT(k|k-l)M,  (k)e,  (k  k-l)l 
Ni-1  11  11  J 

N  .  ,  N.  N.  N.  -i 


where 


S(k)-diag{H1(k) ,M2 (k) , — .M^k) ] 
the  M^(k)  are  positive  definite,  and 

iN» 

J.  (•)*£  tr(P.  (k|k-l)M,  (k)l 
1  k»Q  1  1 


(2.3-lla) 


(2.3-llb) 


(2.3-llc) 


(2.3-12) 


(2.3-13) 


A  simplification  now  occurs  in  noting  that  minimizing  2.3-lla  over  the  aggregate  of 
SLU  classes  la  identical  to  minimizing  every  local  non-negative  cost  function  of  Eq . 

2.3-13  over  its  1th  SLU  class,  so  that 


Ji  [^Ki2  Jk-0,  ^Hlk)  'k-oli  *k-0,  *Mi  \J)] 


(2.3-14) 


While  global  optimality  is  sought,  it  has  just  been  shown  to  degenerate  into  N  local  op¬ 
timization  problems,  each  of  the  same  basic  structure.  Of  course,  the  measurement  data 
base  over  which  optimization  is  performed  is  more  restricted  as  an  approximation.  The 

Hamiltonian  for  the  1th  local  optimisation  is  of  the  form 
«  Ik)  .P^tkjk-l)  ,A^(k*l)  ,k) 

“tr(  ( [#ii,aq "*12^12]  Pi  [^i*jU<fKi2®lj]^3i4'Ki2Bi3t*,ii*I'URi3KiI4,Ri2Bi2Ki2}All>t*l)] 


*  tr (M(k)P(k|k-l) ) 


(2. 3-lSa) 


♦  tr (H(k)P(k|k-l)  I 


(2. 3-15b) 


where 


A^U+l)  is  the  n^  x  n^  costate  matrix 


0^3 1  (k )  *t\  i  (k )  1  (k )  L*  *  (k) 

*U.*q*Ulk*l‘k)'Ki{k)*ilik) 


(2.3-16) 


(2.3-17) 


Unlike  the  case  for  deriving  the  standard  centralized  Kalman  filter  using  the  Matrix 
Minimum  Principle,  there  ia  no  analog  to  tht  last  three  terms  present  in  Eq.  2.3-lSb, 
so  final  results  will  differ  correspondingly  due  to  the  added  complication. 

Differentiating  the  Hamiltonian  of  Eq.  2.3-lSb  with  reepect  to  the  appropriate  vari¬ 
able  yields  the  so-called  "coupling  equation*  as 


0  •  ^(K,la,PitklK-l)^l(k+i),k)|* 


(2.3-lta) 


<2. 3-18b) 


* 


(k+l)  [: 


LiiRi3-*ii,eq 


:»tata+*42  (fil2*IB12+a£l2J] 


+A^k+i>  [fi'li*;3-4ll  pJrJ+KiJ  (rL2?Xl^i2)] 


The  question  for  the  "backwards"  or  reverse  time  evolution  of  the  costate  (upon  using 
the  simplification  allowed  by  Ref.  2)  is: 

*  3Wi 

Ai(k)-+-55i(Ki(k>,Pt,At(k+l),k) 

*Ki(k+1'k)~^iiSirKi25l2]TAi(k+1)  [♦ii^->-1>lt)-L'<iiH^1-K^2H^2]-t-|M1(k)|  (2.3-19) 

with  transversality  (i.e.,  boundary)  condition 


A*(N0)=|^-tr[Mi(N9)Pi]  j^=  M. (N0)=M^(N0)>0  (2.3-20) 

The  following  arguments  proceed  as  in  Refs.  3  and  12  (but  with  more  justification  as 
will  be  indicated  parenthetically  now  that  the  boxed  terns  are  included  in  Eqs.  2.3-15 
and  2.3-19  according  to  the  convention  first  utilized  in  Refs.  47,  48,  49,  15,  and  35, 
and  explained  in  more  detail  in  Section  3  of  Ref.  46) .  Note  from  the  assumption  of  posi¬ 
tive  definiteness  of  M^(k)  and  the  form  of  Eqs.  2.5-19  and  2.5-20  that  the  A(k)  that 

evolves  backward  in  time  from  k=Ng  to  k«0  is  symmetric  and  positive  definite,  without 

the  boxed  terms  present  in  this  analysis,  it  is  more  difficult  (if  not  impossible)  to 
rigorously  establish  that  A(k)  is  positive  definite  since  it  is  not  known  for  sure  that 
the  matrix  quantity  in  brackets  that  pre-  and  postmultiplies  the  first  term  on  the  right 
side  of  Eq.  2.3-19  is  guaranteed  to  be  of  full  rank.  Only  because  of  the  presence  of  the 
boxed  term  in  Eq.  2.3-19  can  it  be  concluded  that "the  A(k)  evolving  from  Eq.  2.3-19  is 
indeed  symmetric  and  positive  definite  (independent  of  the  bracketed  terms  of  Eq.  2.3-19 
being  of  full  rank) .  Thus 


“A*i"1(k)  exists  for  0  s  k  i  Nq  and  is  symmetric." 
Thus  Eq.  2. 3-18b  can  be  premultiplied  throughout  by  A^”2(k)  to  result  in 


0-2  L'uRlj-Hu.sq^U  *  2  *'1X2^2  * 


(2.3-21S) 


or 


The  above  equation  specifies  the  discrete-time  optimum  gain  for  minimum  variance  estima¬ 
tion  within  the  class  of  SLU  decentralised  filters  (having  unbiased  estimates) .  Despite 
a  few  similarities  to  what  was  obtsined  for  continuous-time  in  Ref.  8,  the  result  re¬ 
ported  in  Eq.  2.3-21b  (as  obtained  in  tha  original  research  of  Refs.  IS  and  35)  is  a  new 
contribution.  (Consistent  with  standard  practice,  superscript  asterisks  will  be  sup¬ 
pressed  in  what  follows  for  notational  convenience.) 

Stop  3.  To  obtain  the  appropriate  Riccati  equation  to  be  solved  in  Implementing  the 
local ^teiima tor ,  differentials  the  Hamiltonian  of  Eq.  2.3-15  with  respect  to  A^(k+1)  to 

return  Eq.  2.3-9,  the  equation  of  evolution  that  P^tklk-l)  must  satisfy  as  s  dynamical 

constraint.  Substituting  tha  optimal  gain  of  Eq.  2.3-21b  back  into  Eq.  2,3-9  yields: 
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(2.3-228) 


(2.3-22b) 
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TABLE  2.3-1 

SUMMARY  OF  DISCRETE-TIME  MECHANIZATION  EQUATIONS 
OF  SURELY  LOCALLY  UNBIASED  (SLU)  FILTERING 


Order  of 
Calculations 


Step  2 


Step  3 


1-ff.FBI  WTlTta 


Step  4 


Step  5 


Step  8 


Step  9 


Mechanization  Equations  for  the  General  Case 


SVD  algorithm  performs  the 
following  factorization! 


Upon  making  the  following 
assignments; 


(v^Hw  n 

•  »t  ,2  o 
«  **•». 


‘u5*!  pr-.  i 
r* 


Hence 


h\  (k)®i (k)  *ii  <kj  * 


for  the  condition  p^  <  (as  confirmed/denied  from  the  factorization). 


Li — U^tkJL^k^LjOi) 


v  t  (k)— *t  l j  tk)v  i  (k)^v'  (k) 


fii(k)— ‘t’JtklB^k^S'fk)  H^k)— ♦ 


*t(k>— *l][j(k>*^k)^'  (k) 


R*  tk>— 48 1  j  <k)  Rt  (k  )*  ” j  (k  >  ^  (k ) 

Positioning  and  assigning  as;  ;  ; 

Hr— i 


»';»»•  si.  tail  1 
«L  lk)>  — U~ 4-J U-~l_A_ 

*£»>;  Xu^lvSi 


ItV-  *\“4“ 

l,y..  %  C*» 


*i ,k*1  W  *{*u'*u*i»)  *i  i*u‘*u*u) T  ‘(4i’^ii*u^ul 


*jJ  *»•{*„  *,**u*uk*»»  •>•)«*(,  i»t»J  »iwi  i  ** 


•This  discrete-ties  formulation  (derived  in  Ref.  IS  and  previously  unavailable  as  ac¬ 
knowledged  on  p.  92  of  Ref.  12)  was  necessary  before  coeputational  burden  of  Table  2.4-1 
could  be  quantified.  ••This  factorisation  was  theoretically  guaranteed  to  be  possible  in 
Refa.  8-12,  but  accoaplishaant  of  this  task  in  the  general  case  by  SVD  first  identified 
here  and  in  Oaf.  15.  Validated  SVD  software  is  available  for  this  task  (Refa.  38  and  39). 


For  insight,  it  is  mentioned  in  passing  that  six  distinct  steps  constituting  re¬ 
arrangements  (available  upon  request  and  provided  in  Ref.  15)  were  required  in  going  from 
Eq.  2.3-22a  to  the  concise  expression  of  Eq.  2.3-22b.  Bridging  this  gap  was  one  of  the 
most  gruelling  mathematical  challenges  encountered  in  this  investigation  even  if  it  con¬ 
sisted  of  only  algebraic  manipulations  of  long  unwieldy  expressions.  This  may  perhaps 
explain  why  from  1973  (Ref.  8)  to  1979  (Ref.  12),  the  discrete-time  case — useful  for 
digital  computer  implementation — did  not  emerge  even  though  the  discrete-time  case  was 
acknowledged  (p.  93  of  Ref.  12)  as  necessary  before  operation  counts  can  be  compiled  (as 
provided  in  Section  2.4).  The  9  simple  steps  for  mechanizing  an  SLU  filter  algorithm  are 
summarized  in  Table  2.3-1. 


2.4  Considerations  of  Computational  Burden  of  SLU  Filtering 


I 


m 


The  SLU  computer  memory  allotment  required  may  be  obtained  in  two  steps.  By  first 
reading  from  Table  V,  p.  750  of  Ref.  36,  the  memory  required  for  the  standard  Kalman 
filter  is 


Snj2  +  3nt  +  2niqi  +  Sj2  +  <li  +  1 


(2.4-1) 


Second,  accounting  for  the  additional  occurrence  of  such  characteristic  SLU  terms  as  L, ., 

-1  -1  11 
2n,  zi2»  Zii  •  an<*  *  the  corresponding  appropriate  dimehsions  are  then  added  to 

Eq.  2.4-1  to  yield 


5ni2+3n1+2niqi*3qt2*ql+nipt+3p12+l 


(2.4-2) 


The  number  of  adds,  multiplies,  and  logic  time  requirements  (as  compiled  in  Ref.  15  using 
the  techniques  of  Refs.  36  and  37)  are  summarized  in  Table  2.4-1  for  an  SLU  filter.  This 
information  can  be  used  to  establish  the  filter  cycle  time  required  for  processing  a 
measurement  once  the  add,  multiply,  and  logic  times  are  provided  for  the  intended  target 
machine. 

TABLE  2.4-1 

SUMMARY  OF  DISCRETE-TIHE  MECHANIZATION  EQUATIONS 
OF  SURELY  LOCALLY  UNBIASED  (SLU)  FILTERING 
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As  observed  in  Hem.  _ _  ... _ 

filter  treats  the  interaction  input  to  each  local  subsystem  as  xi  iv  _ _ _ 

mean,  Gaussian,  white  noise  disturbance,  but  of  the  appropriately  tailored  covariance 
level .  Therefore,  the  unmodified  SLU  filter  treats  all  interactions,  deterministic  or 
otherwise,  as  a  purely  stochastic  uncorrelated  interference  effect.  This  apparent  de¬ 
emphasis  of  the  significant  subsystem  interactions  may  be  initially  unsettling  for  some 
anticipated  applications  since  all  information  transfer  between  subsystems  can  only  be 
via  these  interactions  within  the  strict  SLU  filter  structure.  However,  there  are  exist¬ 
ing  refinements  of  the  SLU  filtering  formulation  (Refs.  11  and  12)  that  appear  to  take 
better  advantage  of  the  correlated  information  intrinsically  contained  in  the  subsystem 
interactions  by 

•  exchange  of  inter-subsystem  interactions  via  a  communication  channel, 

•  exchange  of  subsystem  state  estimates  via  a  communication  channel, 

•  modeling  possible  noise  in  the  communication  channel  if  necessary. 

An  overview  of  the  variety  of  identified  SLU  variations  already  developed  in  Refs.  11 
and  12  are  summarized  here  in  Tables  2.4-2  and  2.4-3.  In  particular,  use  of  a  form  of 


TABLE  2.4-fc 


ALTERNATIVE  VARIATIONS  IN  SURELY  LOCALLY  UNBIASED  (SLU)  FILTER  MECHANIZATIONS:  REF.  12 


3LU 

Filter 

Variations 

Information 

Exchange 

Order 

of 

Local 

Filter 

Communications 

Channel 

Noise 

Modeled 

in 

Design 

Decan tr ali zed 
or 

Centralized 

Design 

PCL 

None 

ni 

— 

Decentralized 

PSLU 

None 

ni 

— 

Decentralized 

PQD 

None 

N 

l  "i 
i-1  1 

Decentralized 

PLGD 

None 

N 

I 

i-1  1 

— 

Decentralised 

PB8LU* 

Interactions 

ni(>nl) 

Yea 

Decentralised 

»0  « 

ESLtl 

Interactions 

No 

Decentralised 

V 

RGO 

State  Estimates 

ni 

Yes 

Centralised 

PPOSLO 

State  Estimates 

ni 

Yea 

Centralised 

P° 

r  POSLU 

Stats  Estimate* 

ni 

No 

Centralised 

P  1 

None 

fii  *  nRON,i 

Decentralised 

*?rlaM»  SLU  candidates  for  the  JTIOS  RelHav  application  in  the  investigation  of  Ref.  IS. 


approximate  "aggregation*  in  filter  design  la  auggasted  in  chapter  S  of  Ref.  12  as  a  com¬ 
promise  to  obtain  a  so-called  Reduced-Order  Model  (ROM)  filter  that  has  better  perform¬ 
ance  then  the  SLU  while  also  offering  more  realistically  dimensioned  computations  than 
coaatonly  associated  with  a  centralised  Kalman  filtar.  Unlike  what  the  specialised  tech¬ 
nical  term  ‘aggregation*  may  at  first  glance  auggaat,  the  "method  of  aggregation*  is  a 
wail  developed  technique  (Refs.  S5,  56,  and  5?)  for  reallstlcal ly  representing  the 
effects  of  interactions  amongst  the  constituents  Of  a  large-scale  ay a ten  by  using  Just 
a  reduced-order  smaller  scale  system.  A  pedagogically  lucid  and  self-consistent  deriva¬ 
tion  and  explanation  of  the  ROM  filter  ia  provided  on  pp.  5-2  to  5-13  of  Ref.  15  end  its 
potential  for  the  JTIOS  RelHav  application  ia  summarized  in  Ref.  35. 


3  THS  SEQUENTIALLY  PARTITIONED  ALGORITHM  (SPA)  FOR  DECENTRALISED  FILTERING 

3.1  Derivation  of  the  Decentralised  SPA  Approach  to  Decentralised  Filtering 

An  interesting  approach  to  decentralised  filtering  first  described  in  a  1971  British 
Ph.D.  thesis  (Ref.  14)  by  N.  Shah  has  bean  ralatlvely  recently  reexamined  in  Ref.  13  for 


TABLE  2.4-3 


TERMINOLOGY  OF  SLU  ALTERNATIVE  VARIATIONS 


Symbolic  Designation 

Filter  Name  Designations  (from  Ref.  12) 

F 

CL 

Completely  Localized  Mechanization  (of  otherwise  Centralized)* 

F 

SLU 

Surely  Locally  Unbiased 

F 

GD 

Global  Dynamics 

F 

LGD 

Localized  Global  Dynamics 

PESLU 

* 

Expanded  Surely  Locally  Unbiased 

F° 

C  ESLU 

Same  as  above,  without  modeling  communications  noise. 

F 

rRGD 

Reduced-Order  Global  Dynamics 

ppdslu 

Partially  Decentralized  Surely  Locally  Unbiased 

F° 

r  PDSLO 

Same  as  above,  without  modeling  communications  noise. 

F 

ROM 

Reduced  Order  Model 

*Treats  local  Model  as  if  it  were  Global  Model. 


the  discrete-time  case  of  only  two  subsystems. 


Remark  3.1-1 t  A  few  typos  occur  in  the  derivation  of  the  SPA  filter  appearing  in  Ref.  13 
(pp.  534-7} .  Notably,  the  measurement  equation  appearing  in  Eq.  11.3.1  involving  y^(k+l) 

should  have  a  time  index  of  k+1  rather  than  k  for  x^.  Similarly,  the  time  index  of  the 
measurement  noise  v^  appearing  in  Eq.  11.3.7  should  bo  k+1  instead  of  k. 


The  mechanisation  equations  for  this  decentralized  filtering  approach  are  now  derived 
for  an  arbitrary  number  N  of  decentralised  subsystems  as  expected  to  be  encountered  in 
the  most  general  application.  Returning  to  Eq.  1.7-20c 


{k+l)«*Tti  (k+l,k)  x^  (k)«|t  ♦iJ(k+l,k)xJ(k)j 


(k) 


1*1 


(3.1-11 


and  Eq.  1.2-2,  augmented  by  Eq.  1.2-4,  as 

ti(k)-fi1(k)xl(kl*Ml(k)Lt(k)x(k)+vilkl  (3.l-2a) 

.  N 

•ai(k)xl(k)+al(k)  t  LiJ(k)x1(k)+vl(k)  (3.1-2b) 

1*1 

(for  1*1,2, . ,N) .  Consistent  with  the  definition  of  e-(k+l|k)  in  eq.  2.2-2,  the  estima¬ 
tion  error  is 

e^kjk)  A  xl(k)-i»lCklkl  (3.1-3) 

where 

x^ (kjk)  -  B(xi(k)|*(k)}  (3.1-4) 

While  information  patterns  can  be  related  to  the  associated  underlying  expanding  sub- 
sigma  algebras  (of  a  probability  space  or  triple)  with  respect  to  which  conditional  ex¬ 
pectations  ars  taken  (as  Radon-Nikodym  derivatives) ,  these  concepts  do  not  tppesr  to  be 
necessary  hers  and  so  are  avoided  to  expedite  the  presentation.  By  adding  end  subtract¬ 
ing  the  asms  tanks  simultanaoualy,  Eq.  3,1-1  for  the  1th  subsystem  may  be  rewritten  as 


■  N 

t.  (k+l)=*.  .  (IC4-1, Jc) X,  (kH  E  l.  .  (k+l,k)x,  (k) 
1  1  j*i  3 

*S*i 


while  the  measurement  equation  for  the  i ^  subsystem  is  represented  as 

*  N 

*u{Jc)-itOc)Xi(lc)+H?  (k)  r  ^(kJx^Jtlk-D+vJOc) 

ill 


+wt(k) 


(3.1-5) 


(3.1-6) 


where  in  the  above 


A  N  . 

wf(k)»v, (k)  +  E  *j  , (k+l,k)e.  (k|ki 
1  j-1  13  3 

j*i 

v*(k)^vi(k)>Z  Li.(k)  e. (k|k-l) 


j-1 


(3.1-7) 


(3.1-8) 


Under  the  following  assumptions  at  time-step  k. 

Assumption  3,1-1: 

Xj(k[k-1)  is  known  (deterministically)  for  jj<i,  j“l,2,...,N 
Assumption  3.1-2; 

«j(k|k),  e.j(k|k-l)  are  Gaussian  and  white  for  j><i,  j»l,2,...,N 


(3.1-9) 


(3.1-10) 


(just  as  residuals  should  ideally  be  white  when  filter  models  and  truth  models  are  appro¬ 
priately  matched),  therefore  (as  a  consequence  of  Assumption  3.1-2): 

Assumption  3.1-3: 


w*(k)  and  v  *{k)  ean  be  treated  as  Gaussian  white  process  and 
measurement  noises 


(3.1-11) 


and  the  standard  Kalman  filter  algorithm  can  be  applied  to  ^aeh  subsystem  (p.  536,  Ref. 
13)  with  the  following  appropriately  modified  covariances: 

& 

QJi(k)AE[w*(k)wjT(k)]  -0t(k)*  $.  tk|ki* J(k»i.k)  (3.1-12) 


t  ^ 

R1J{k)fiB[vi*(k)v*T(k)]  •AiiUl*ai(k^  l  tii  (h)Pj(k|k-l)l.^j(k)]  Jf  (k)  (3.1-13) 

j*i 

the  result  of  applying  the  standard  filtering  formulation  to  each  subsystem  and  utilisa¬ 
tion  of  the  above  assumptions  is 


\  (k‘-lik)-lii!k«l,k)xi{klk)^E^*ii(k*l,k)xj  (klk)j 


•l4l0i*l. 


.  3*i 

,  H 


(3. l-14a) 


(3. l-14b) 


L.klx,  (kik)^f  t  x,(k*ljk)| 

V.1  J 

,  ,  y 

St  ik.lhi.1, »»»  j»u»»n-*,  »»M»t  »»iJ*5»s4»»»i  :»•;!%«}  (3.1-15) 


>• 

5*i 


The  filter  gain  matrix  appearing  in  Eq.  3.1-15  can  be  obtained  from  the  following 

Rt  (k»l)«Pil  (k*l |k> (k*l)  (Si  (k*l)  Pit  (k*l |k)8* (kal)  (kai)]  (3.1-16) 

where  the  extrapolate  step  consist*  of 

Pu  (k+HkJe^fka^kJPjJ  (k|k)#  J  (kal.kl+OJj  (k) 


(3. 1-1?) 


and  the  update  step  consists  ojl  vaais  w. ^ — .  _  .  . 

PtJ  (k+lIk+lwJl-K^k+UH^k+l)]  Pii(k+l|k)  [l-ic^k+nS  (k+l)]T 

+Ri(k+l)Ri|(k+l)K^{k+l)  (3.1-18) 

It  is  noted  that  this  filter  is  suboptimal  as  perceived  from  a  centralized  point-of-view 
since  it  does  not  account  for  possible  off  diagonal  terms  in  the  covariance  matrix.  How¬ 
ever,  the  above  so-called  SPA  filter  is  computationally  VF>ry  attractive  to  implement, 
since  it  requires  a  substantially  smaller  number  of  elementary  multiplication  and  addi¬ 
tion  operations  and  less  computer  storage  than  a  globally  optimum  Kalman  filter. 

Further  simplifications  of  the  SPA  filter  were  possible  for  the  JTIDS  RelNav  applica¬ 
tion  considered  in  Refs.  15  and  35  since  there  is  no  cross-coupling  in  the  linearized 


system  error  models  as  represented  by 

*  l}(k+l,k)  -  o  for  i  i  j  (3.1-19) 

Hence,  Eqs.  3.1-5,  3.1-7,  3.1-12,  3.1-I4a,  and  3.1-15,  respectively,  degenerate  to 

xi(k*l)»iii(k+l,k)x1(k)+wi*(k)  (3.1-20) 

vj(k)£v.(k>  (3.1-21) 

Q«i(k)iE[wJ(k)wJT(k)]-  E[wi(k)wT(k)]»Qii(k)  (3.1-22) 

xi(k+ljk)-4si (k+i.k)xi(k|k)  >  (3.1-23) 

.  .  ,  .  .  * 

& 


As  seen  from  Eqs.  3.1-12  and  3.1-13,  respectively,  the  SPA  filter  accounts  for  intercon¬ 
nection  effects  by  utilising  flctiti  us  components  of  process  and  measurement  noises  with 
appropriately  enlarged  covariances  (not  unlike  aspects  of  the  SLU  approach  already  noted). 
Thus  the  entire  discrete-time  implementation  for  the  condition  of  Sq.  3.1-19  is  de¬ 
scribed  by  Eqs .  3.1-13,  3.1i2,  3.1-23,  3.1-24,  3.1-16.  3.1-17,  and  3.1-18,  as  summarised 
in  Table  3.1-1.  Kechanizatl  of  these  six  equations  constitute#  the  entire  computation¬ 
al  burden  of  SPA  and,  unlike  SwU,  involves  no  SVD  factorization  or  additional  transforma¬ 
tions.  The  computational  burden  of  SPA  filtering  is  quantified  in  Section  3.2. 

TABU)  3.1-1 

SUMMARY  OF  OXSCRSTR-TIME  MECHANISATION  SQUAT  I  QMS 
OK  SEQUENTIALLY  PARTITIONED  ALGORITHM  (SPA) 
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In  general,  for  the  SPA  filtering  approach 

E  [xx  (k+1)  |  Zi  (k)]  ^  E[xi(k+1)  |z(k)j  (3.1-25) 

so  that  taking  total  expectations  throughout  Eq.  3.1-25  yields: 

E(xi(k+l|k)}=E{E[xilk+l)  |3i(k)j  }*E{e[x  .  (fc+1)  |z(k)]  }=E{x,  (k+1)  }  (3.1-26) 

The  above  indicates  that  the  SPA  decentralized  filter,  along  with  most  other  practical 
filters,  is  a  biased  estimator  (Ref.  34)  (e.g.,  even  in  linear  Gaussian  applications,  the 
optimum  centralized  Kalman  filter  has  practicality  constraints  that  frequently  limit  the 
order  of  the  filter  implementation  that  -an  be  computationally  accommodated  to  be  much 
lower  than  the  “truth"  model,  thus  yielding  a  biased  filter  when  implemented).  As  long 
as  the  bias  is  either  evaluated,  compensated,  or  shown  to  be  tolerably  small,  its  pres¬ 
ence  should  not  interfere  with  the  intended  purpose  of  any  filter.  Indeed,  it  is  the  SPA 
filter  that  appears  to  be  most  useful  for  the  JTIDS  RelNav  application  as  discussed  in 
Refs.  15  and  35.  Results  of  subsequent  simulations  are  reported  in  Ref.  60. 


3.2  Computational  Burden  of  SPA  Filtering 


The  SPA  computer  memory  allotment  required  may  also  be  obtained  by  the  same  two  step 
procedure  discussed  in  Section  2.4.  The  SPA  filter  uses  additional  intermediate  scratch 
calculations  of  dimension  (ru  x  n^) ,  (q^  x  q^) ,  and  (q^  x  n^)  beyond  those  encountered  in 

a  conventional  Kalman  filter  so  the  total  memory  requirement  is 


+  3n.  +  3n.q, 


+  q.  +  1 


(3.2-1) 


The  number  of  adds,  multiplies,  and  logic  time  requirements  are  summarized  in  Table  3.2-1 
for  an  SPA  filter.  These  results  for  SPA  filtering  were  obtained  by  applying  the  stand¬ 
ard  methodology  for  operations  counts  from  Refs.  36  and  37  to  the  algorithms  summarized 
in  Table  3.1-1  in  a  manner  analogous  to  what  was  done  in  Section  2.4.  Both  the  SPA  (with 
the  special  structure  of  Eq.  3.1-19)  and  the  general  SLU  filter  implementations  are  anal¬ 
ytically  demonstrated  to  be  stable  in  Section  4. 


4.  STABILITY  OF  DISCRETE-TIME  DECENTRALIZED  FILTERS  OF  THE  SLU  AND  SPA  CLASSES 


4.1  Stability  Overview 


The  proof  of  stability  for  the  discrete-time  formulations  of  SLU  and  SPA  proceeds  in 
a  manner  analogous  to  the  continuous-time  proof  provided  in  Rer.  10  by  also  utilizing  the 
stability  framework  of  Ref.  16.  The  crux  of  the  proof  involves  demonstrating  that 


V [x(k| k-1)  ,k]  ■  xT(k|k-l)  P_1  (k|  k)  x(kl  k-1 )  (4.1-1) 

is  a  valid  Lyapunov  function  by  demonstrating  that  it  is  positive  definite  and  that  tho 
first  variation  along  the  trajectory  of 

x(k+l 11)  -  ( 4(k+l,k) -  ♦(k+l,k)K(k)H(k)l*(k|k-l)  (4.1-2) 

is  negative  definite.  The  inner  product  matrix  in  Eq.  4.1-1  being  the  inverse  of  the 
matrix  that  evolves  from  the  matrix  Riccati  equations  (encountered  in  both  SPA  and  SLU 
filtering)  which  can  bo  demonstrated  to  be  uniformly  bounded  above  and  below  under  the 
condition  of  uniform  complete  observability  of  the  subsystem  (without  also  requiring 
uniform  complete  cont -ollability  or  oven  controllability  due  to  weakened  hypothesis 
provided  by  Ref.  16). 

However,  recent  results  (Ref.  17,  Appendix  C  of  Raf.  18,  and  p.  244  of  Ref.  19)  indi¬ 
cated  a  minor  error  in  the  upper  and  lower  bounds  appearing  in  Refs.  40  and  41  (as  util¬ 
ized  in  Refs.  16  and  10).  This  error  is  corrected  in  Appendix  A  and  shown  to  not  adverse¬ 
ly  affect  the  stability  conclusions  for  SPA  and  SLU  as  long  as  all  observation  matrices 

Hi(ki)  are  of  full  rank  (a  new  restriction)  as  well  as  R^tk)  being  bounded  and  P^(0)>0. 

Bv  this  approach,  only  asymptotic  stability  is  established  for  SLU  and  SPA  in  contradis¬ 
tinction  to  exponential  asymptotic  stability. 


4.2  Precedent  of  an  Analytic  Framework  Enabling  a  Demonstration  of 
Stability  for  Several  Decentralized  Filtering  Mechanisations 


A  discrete-time  filter  of  the  following  form 


x(k+l|k}*4(k+l,k)x(k|k)»#(k+l,k)jl(k|k-l)+#tk+l,k)K(k)[ i(k)-H(k)x(k|k-$]  (4.2-la) 

•  K(V)N(k^  i!k|k-l)*«(k*l,k>K(kii <k 


(4. 2- ISO 


TABLE  3.2-1 


SUMMARY  OF  MECHANIZATION  EQUATIONS  AND  QUANTIFICATION  OF  COMPUTATIONAL  BURDEN* 
OF  SEQUENTIALLY  PARTITIONED  ALGORITHM  (SPA)  FILTERING 


Order  of 
Calculations 

Required 

SPA  Equation 

Being  Implemented 

Total  Number 
of  Adds 

Total 
Number  of 
Multiplies 

Logic  Time  (Estimated) 

Step  1 

Eq.  3.1-23  (same 
as  standard 

Kalman  filter) 

niJ  -  ni 

10*6ni'!*37ni 

Step  2 

Eq.  3.1-12 

(2n13-n1J)M 

(20+12njL3+42ni2*i6ni)N 
♦27*51^  ♦MUL 

Eq.  2.1-22  (same 
as  standard  KF 
for  JTIDS  RelNav 
Application) 

non* 

non* 

non* 

Step  3 

Eq.  3.1-17  (same 
as  standard  KF) 

4n1J-ln1J 

V 

24n1 3+|»Dl  J+*4i>1+6  7+MOt, 

Step  4 

Eq.  3.1-13 

)« 

(20+«nlJql+«njql3*21n1qi*21qiJ 
♦Wn.+ltqj  lH+17-Kqj  3*21qj  5+21qi 

♦MUL 

Step  5 

Eq.  3.1-16  (same 
as  standard  KF) 

t7+tn,Jq1+«2n1q1+32ni+12n;qlJ* 

7.5q1'+«lq13*t«5n1,+I0»q1*(q1+ 

2q1')DIV*(0.5qi3*2.5ql*l»*M, 

Step  6 

Eq.  3.1-24  (for 

JTIDS  RelNav, 

N  *  1,  Refs,  35 
and  15) 

» 

•  4*12q1,«12n1ql*«2nl*U*q1*d  • 
mm*  (20Hnl  J*21n1q1  *S  ini )  > 

Step  7 

Eq.  3.1-18  (same 
as  standard  KF) 

- 

w 

V 

«7*l2nJ,3+47n1,*J2n1*MUL 

* 

Compiled  based  on  precedent,  convention,  and  guidelines  Bet  forth  in  Refs.  36,  37,  and 
15  for  such  calculations,  where  in  the  above  SPA  specialization: 


n^  dimension  of  the  subsystem's  local  SPA  filter  (defined  in  Eq.  1.2-1) 


q^  dimension  of  subsystem's  local  measurements  (defined  in  Eq.  1.2-2a) 


MUL  unit  logic  time  required  for  a  multiply  by  an  element  retrieved  from  core  memory 
(by  indirect  addressing)  rather  than  by  an  element  from  tho  Arithmetic  Logic  Unit 
(ALU) 


DIV  unit  logic  time  required  for  a  division  by  an  element  retrieved  from  core  memory 
(by  indirect  addressing)  rather  than  by  an  element  from  the  ALU. 


N  number  of  individual  subsystems  contributing  covariance  information. 


with 


K(k) 

•  P(k|k-l)HT(k) (a(k)P(k|k-l)HT(k)+R(k))‘1 

(4.2-2) 

P(k|  k-l) 

■  ♦<k,k-l)P(k-l|k-l)Vr(k,k-l)+Q(k) 

(4.2-3) 

P(k|k) 

•  (l-K(k)H(k))P(k|k-l) 

(4.2-4a) 

*  (I-K(k)H  (k)  ]  p(k|  k-1)  (I-K  (k)H  (k)  ]T+K  (k)R  (k)KT(k) 

(4.2-4b) 

POtgi 

•  90 

(4.2-5) 

(for  convenience,  Eqs.  4.2-2,  4.2-3,  and  4.2-4b  may  be  rewritten  as: 


P  (k+l|k)»4P  (k|k-l)  4T-$P  Oc  |k-l)HT[HP(k|k-l)HT+R]  “'LHP  (k|k-l)  4T+Q  (4.2-6a) 

-*[HTR"1H+P‘1(k|k-1)]  -1«T+Q  (4.2-6b) 

as  will  later  be  utilized)  is  demonstrated  to  be  exponentially  asymptotically  stable  (p. 
222  and  pp.  228-9  of  Ref.  58)  if  the  following  three  conditions  are  each  satisfied: 

1.  $(k+l,k),  H(k) ,  Q(k) ,  R(k) ,  and  R~^(k)  are  bounded;  (4.2-7) 

2.  (4 (k+l,k) ,H(k) )  is  uniformly  completely  observable  (pp,  313-4  of  Ref.  18),  that 

is,  there  exist  two  positive  scalars  and  a2  such  that 

a, I  <  I1  4’r(j,k)H'r(j)R"1(j)H(j)4(j,k)<  a,I  (4.2-8) 

1  ~  j-t-r, 


for  some 


*  i  0  (4.2-9) 

HS2-1  (4.2-10) 

3.  [4(k+l,k) ,D(k) ]  is  uniformly  completely  controllable  for  any  D(k)  such  that 

D(k) DT(k)  =  Q(k)  (4.2-li) 

that  is,  there  exist  two  positive  scalars  a3  and  such  that 

t-1  - 

o,I<  E  <Mk,  j+l)Q  (j)  41  (k,  j+1)  <a,I  (4.2-12) 

J  “  j-t-n  ~  4 

for  some  l  and  n  as  in  Eqs.  4,2-8  and  4.2-9,  respectively. 

As  mentioned  on  p.  222  of  Ref.  58,  sufficient  conditions  for  asymptotic  stability,  but 
not  necessarily  exponential  stability  are  known  to  be  1.  and  2.  above,  and  3.  relaxed  to 
Eq.  A. 1-1  of  Appendix  A. 

For  a  discrete-time  system  of  the  form 


y (k+1)  -  f  (y(k)  ,k)  (4.2-13) 

where  y(k)  is  the  state  at  time-step  k,  if  there  exists  a  so-called  scalar  Lyapunov  func¬ 
tion  V(y(k),k)  such  that 

a.  V(0,k)  H  0  for  all  k  (4.2-14) 

b.  0<yx  ( !  !y(k)  1 1)  <v[y(k),kj  <Y2  ( |  |y  (k)  1 1 )  for  all  k>k'  (4.2-15) 

and 

yl(0)  -  Y2(0)  •  0  (4.2-16) 

with 

v  ||y||  >  -►•  M  llyll  ->•  (4.2-17) 

c.  v[y,k]  is  continuous  in  y  (4.2-18) 

d.  V(y,k]  «  M  y  +  «  (4.2-19) 

e.  AV(y,k)  -  rate  of  increase  of  VI...]  along  the  motion  of  a  trajectory  or  path  of 

Eq.  1.2-13,  starting  from  y  at  time-step  k 

A{v[y(k+1)  ,k+l]  -  v[y(k)  ,k]  )■  (.f.pTfzeT-  <0  (4.2-20) 

or,  equivalently, 

V(y(k+1) ,k+l]-V(y(k) ,k)  <  -  y3(I I y(k) | ( )  <o  (4.2-21) 


then  the  system  described  by  the  time  evolution  Eq.  4.2-13  is  asymptotically  stable  (in 
the  large)  (p,  396,  Corolary  1.2*  of  Ref.  59,  p.  240  of  Ref.  51).  The  proor  of!  asymp- 
totic  stability  (Appendix  C  of  Ref.  18)  (pp.  240-3  of  Ref.  51)  for  the  filter  of  Eq. 
4.2-1  consists  of  utilizing  a  discrete-time  Lyapunov  function  of  the  form  of  Eq.  4.1-1 


(Eq.  32  of  Ref.  16)  as  a  standard  way  to  demonstrate  stability  of  the  autonomous  time- 
varying  system  of  the  form  of  Eq.  4.1-2  (Eq.  31  of  Ref.  16)  (where  Eq.  4.1-2  is  observed 
to  be  the  homogeneous  "undriven"  portion  of  the  filter  of  Eq.  4.2-ld.  While  adherence  of 
Eq.  4.1-1  to  conditions  a.,  c.,  and  d.  is  immediate,  it  requires  more  work  to  establish 
that  conditions  b.  and  e.  are  satisfied  as  a  consequence  of  satisfying  the  conditions  1, 
2,  and  3'  of  the  hypotheses  as  Eqs.  4.2-7,  4.2-8,  and  A. 1-1.  The  following  bounds  (ob¬ 
tained  by  an  extremely  tedious  and  careful  analysis  of  McGarty  as  Eqs|  C.88  and  C.56  of 
Ref.  18)  are  correctly  validated  • 


where 


0  <  A  <  P(k|k)  <  B 


(4.2-22) 


A 


l  *(k,j+l)Q(j)*1(k,j+l) 
j*k-n 


,*S«4 

“1*3 


t  *T(j,k)HT(j)R'1(j)H(j)4(j,k) 
j*k-n 


(4.2-23) 


B 


A  ^  T  T  *1  1*"^  "  *2*' 

jj-x-n  l  3 


£  ♦  <k,)+l>Q(j)BT(k,  5*1) 
J-k-o 


(4.2-24) 


and  a^,  ct2,  a^,  a4,  and  n  ate  defined  in  Eqs.  4.2-8,  4.2-9,  and  4.2-12.  From  Eq. 

4.2-22,  inverses  can  be  taken  throughout  to  yield 


0  <  B*1  <  P-1(k|k)  <  A-1  (4.2-25) 

and  finally  pre-  and  postmultiplying  the  abcve  by  xT(k|k-l)  and  x(k|k-l),  respectively, 
yields 


0  < 


xT(k|  k-l)B~1,x(k|  k-1) 


xT  ( k  I  k-1 )  P_1  (k  |  k-1 )  x  ( k  |  k-1 ) 


<  x(k|k-l)A-li(k|k-l) 


(4.2-26) 


Thus  by  definition  of  the  appropriate  Lyapunov  function  to  be  used  in  this  situation  as 
Eq.  4.1-1,  Eq.  4.2-26  can  be  alternatively  recognized  as 


0<||x(k|k-l)||2  ,  <  v[x(k|k-l)  ,k|  <  ||  x(k|k-l) 

B’A(k)~  1  J  ~ 


2 

A_1(k) 


(4.2-27) 


which  certifies  all  three  of  Eqs.  4.2-15,  4.2-16,  and  4.2-17  comprising  condition  b. 
above. 


By  a  procedure  related  to  a  dual  optimal  control  problem  (where  the  cost  function  is 
reminiscent  of  the  exponent  in  an  associated  probability  density  function,  the  following 
condition  holds  (Eq.  C.136  of  Ref.  18) i 

v[li<k|k-l),k]-v[$(k-njk-n-l),k-«]<  -e^Bgll&Cklk-lJlJ2  (4.2-28) 

where  Bg,  and  Bg  are  positive  constants  defined  on  pp.  374-7  of  Ref.  18. 

Now  n  *  1  in  Eq.  4.2-28  demonstrates  that  the  condition  e.  (Eq.  4.2-21)  is  satisfied 
for  the  Lyapunov  function  postulated  in  Eq.  4.1-1.  All  five  conditions  (a.-e.)  being 
satisfied  by  the  Lyapunov  function  of  Eq.  4.1-1  therefore  suffices  to  demonstrate  asymp¬ 
totic  stability-in-the-large  for  the  autonomous  unforced  system  of  Eq.  4.1-2  or,  corres¬ 
pondingly,  asymptotic  stability  for  the  randomly  forced  system  of  Eq.  4.2-lb  that  iB  the 
end  objective. 


4.3  Analytic  stability  of  the  8LU  Filters 

In  analogy  to  what  is  done  on  p.  200  of  Ref.  10  for  continuous-time,  an  existing  sta¬ 
bility  theorem  that  serves  as  the  basis  of  what  is  done  in  Sections  4.3  and  4.4  is  as 
follows  (same  statement  as  Theorem  4.1  of  Ref.  16,  but  proof  is  augmented  and  corrected 
as  indicated  in  Appendix  A) s 

Theorem  4.3-1  (Anderson) : 

If  4(k,k-l),  *"1(k,k-l),  H(k) ,  Q(k) ,  and  R’1(k)  are  bounded,  H(k)  is  of  full  rank, 

(♦,HR-1^2)  is  uniformly  completely  observable,  and  condition  3'  (as  Eq.  A.l-1)  is  satis¬ 
fied,  then  a  filter  of  the  form  of  Eqs.  4.2-lb  to  4.2-5  is  asymptotically  stable  (in  the 
large) . 

For  the  local  SLU  filters,  as  long  as  the  pair  H^kjR^^fk)  (of  Eq.  1.2-2a)  and 
*ii(k+l,k)  (of  Eq.  1.2-40)  are  uniformly  completely  observable,  Hi(k)  is  bounded  and  of 
full  rank,  and  condition  3'  is  satisfied  via  an  appropriate  choice  of 


then  each  local  SLU  filter  is  stable.  This  conclusion  is  reached  by  making  the  following 
associations  in  Eg.  2.2-41  and  Eg.  2.3-22b 


♦  4 - 

(*ii-L'iisil> 

(4.3-2) 

K4— 

*i2 

(4.3-3) 

Q< - 

- «i+EU*il£U 

(4.3-4) 

a<— 

~ 5i2 

(4.3-5) 

r  i — 

— 1 *12 

(4.3-6) 

Po* — 

—poi 

(4.3-7) 

to  obtain  eguations  corresponding  to  Egs.  4.2-lb  and  4.2-6a,  respectively.  The  SPA 
filter  has  a  structure  that  also  allows  stability  to  be  demonstrated  by  invoking  Theorem 
4.3-1. 


4,4  Analytic  stability  of  the  SPA  Filter 

For  the  condition  of  Eg.  3.1-19  (as  encountered  for  JTXDS  RelNav  in  Refs.  15  and  35), 
the  mechanization  eguations  for  the  SPA  filter  are  Egs.  3.1-23,  Eg.  3.1-12  (degenerating 
to  3.1-22),  Eg.  3.1-17,  Eg.  3.1-16,  Eg.  3.1-24  (degenerating  to  Eg.  4.1-2  for  condition 
of  Eg.  3.1-19),  and  Eg.  3.1-18.  Then  by  an  association  similar  to  that  used  in  Section 
4.3  and  Ref.  10  with  notable  egui valences  such  as 

R  4 -  Rh*  (4.4-1) 

H  4 -  H1  (4.4-2) 

K  4 -  Kj  (4.4-3) 

and  the  other  associations  being  even  more  obvious,  Theorem  4.3-1  can  be  invoked  to  con¬ 
clude  asymptotic  stability  of  the  SPA  filter  as  specialized  for  the  condition  of  Eq. 
3.1-19  since  it  has  the  reguisite  structure  [utilized  in  Ref.  10  as  enunciated  in  Ref. 

16  (with  proof  corrected  herein  in  Appendix  A) ) . 


5  APPLICATIONS  OF  DECENTRALIZED  FILTERING 

Throughout  Refs.  8-12,  potential  applications  of  decentralized  filtering  are  indi¬ 
cated  by  examples  in  interconnected  power  systems  for  freguency  monitoring  (as  a  prelude 
to  stabilized  maintenance)  and  power  load  estimation.  Refs.  15  and  35  investigate  appli¬ 
cation  of  decentralized  filtering  to  the  Joint  Tactical  Information  Distribution  System 
(JTIDS)  Relative  Navigation  (RelNav)  feature  currently  being  developed  by  the  U.S.  Joint 
Services  and  eventually  intended  for  NATO.  The  SPA  decentralized  filtering  formulation 
is  recommended  in  Refs.  15  and  35,  since  it  possesses  a  reasonably  mild  computational 
burden  and  an  analytic  guarantee  of  "filter  stability"  (viz.,  an  ability  to  at  least 
track  the  true  state  adeguately  whether  or  not  the  true  states  are  stable)  as  a  preregui- 
site  for  answering  any  other  more  probing  gueations  concerning  the  stability  of  the  JTXDS 
net  or  of  the  common  grid  defined  by  the  "controller"  for  relative  navigation/targeting. 
Further  discussion  and  simulation  studies  of  tho  SPA  and  other  approaches  to  RelNav  are 
provided  in  Ref.  60. 

Sensor  fusion  is  a  concept  that  pervades  several  fields  as  addreosed  for  Identifica¬ 
tion,  Friend,  Foe,  or  Neutral  (1FFN)  in  Refs.  63,  64,  and  65  and  for  Communication, 

Command,  and  Control  (C  )  on  p.  204  of  Ref.  67.  Ref.  63  indicates  the  breadth  of 
approaches  that  are  being  pursued  for  the  next  generation  of  IFF  beyond  just  the  intere- 
gator/transponder  beacon  of  the  current  L-band  Mark  12.  While  Hughes/Fullerton  has  pur¬ 
sued  IFFN  using  information  theoretic  techniques  (i.e.,  Shannon  theory,  entropy  argu¬ 
ments,  and  minimization  of  well-poeed  cost  functions  for  achieving  objectives)  for  air¬ 
craft  identification,  Hughea/Culver  City  has  used  the  Bayesian  approaoh  to  examine  criti¬ 
cal  issues  of  intersensor  correlation,  and  Ref.  63  considers  Theoretical  and  practical 
aspects  of  implementing  Bayesian -based  maximum  likelihood  decisions  and  majority  rule 
decisions  in  this  application.  Indeed,  Dynamics  Research  Corporation  (DRC)  has  reported¬ 
ly  investigated  use  of  "fuzzy  set"  teohnigues  for  reducing  the  indicated  computational 
burden  for  this  challenging  problem,  where  even  engine  harmonics  are  being  exploited  for 
aircraft  identification  (Ref.  68)  by  fighter  aircraft  in  the  same  vein  as  the  airoraft 
identification  from  radar  signatures  performed  by  the  E-3A  AWACS,  but  with  greater  com¬ 
putational  capacity  than  a  fighter  is  availed  with.  A  recently  developed  methodology  for 
potentially  reducing  the  computational  burden  in  providing  adequate  data  base  handling 
for  these  life-or-death  (friend  or  foe)  classifications  (by  a  novel  conversion  of  the 


problem  of  queries  on  an  imprecise  data  base  into  a  problem  of  statistical  inference)  is 
described  by  E.  Wong  in  Ref.  73.  Essential  aspects  of  a  Bayesian  filtering  approach  and 
consideration  of  alternative  (but  necessary)  approximations  are  treated  in  Ref.  66.  This 
IFFN  area,  where  several  possibly  correlated  measurement  sensors  are  utilized  in  making 
identification  decisions  under  dynamic  conditions,  appears  likely  for  fruitful  utiliza¬ 
tion  of  decentralized  filtering  techniques  such  as  Speyer's  (Refs.  32  and  33)  or  the 
multirate  multiple  filtering  approach  of  Section  1.5. 

3 

A  fairly  bleak  picture  of  the  U.S.  and  Allied  C  ,  as  of  1980,  as  made  evident  from 
the  unpleasant  experiences  of  the  "Nifty  Nuget*  war  gaming  exercise  simulating  an  all-out 
conventional  war  against  the  attacking  forces  of  the  Warsaw  Pact  in  Europe,  is  portrayed 

3 

in  Ref.  74  as  a  variety  of  solutions  are  sought  for  aspects  of  the  total  C  problem.  An 

3 

analytic  framework  for  C  considerations  in  a  form  compatible  with  modern  state-variable 
estimation  techniques  is  laid  out  by  the  Technical  Director  of  the  Naval  Electronic 
Systems  Command  in  Ref.  70,  with  further  vital  descriptive  elaborations  in  Refs.  75  and 

76.  Another  perspective  on  current  research  issues  persisting  as  problems  in  c3  are 
examined  in  Ref.  67  and  the  analytical  nature  of  compatible  hierarchical  decentralized 

structures  for  C3  are  considered  in  Refs.  67,  69,  70,  and  71.  On  the  other  hand,  it  is 
important  to  take  into  account  the  moderating  remarks  of  Ho  (Ref.  77)  to  the  effect  that 

3 

a  pat  solution  to  the  C  problem  is  not  readily  at  hand  from  estimation  and  control 
system  theorist,  but  must  be  carefully  tailored  and  developed  in  order  to  successfully 

resolve  the  C3  problems.  Leads  toward  this  end  are  offered  by  Ref.  78. 

Current  specifications  for  the  Phase  1  integration  of  the  JTIDS  RelNav  and  Global 
Positioning  System  (GPS)  on  the  F-16A  call  for  utilization  of  three  separate  filters, 
one  for  GPS,  one  for  RelNav,  and  one  dedicated  to  aided  inertial  navigation.  This  type 
of  situation  appears  a  likely  candidate  for  the  multirate  filtering  approach  of  Section 
1.5  (as  already  applied  to  a  navigation  example  in  Ref.  61).  The  GPS  filter  could  be 
used  to  incorporate  position  and  velocity  information  at  a  fast  rate  in  an  unjammed 
environment,  then  feed  it  to  a  slower-rate  higher  fidelity  navigation  filter  used  for 
aiding  the  inertial  navigation  system  in  an  integrated  manner. 

For  two  separate  GPS  and  JTIDS  filters  of  dimension  12  and  15,  respectively,  as  con¬ 
sidered  in  Ref.  82  (which,  unfortunately,  ignored  filter  throughput  considerations)  the 
advantage  of  two  over  one  larger  19  state  unified  filter  is  obtained  from  the  ratio  of 
the  total  number  of  required  operations  (Ref,  36)  as 


or  a  26%  reduction  in  the  total  number  of  operations  to  be  performed  during  each  filter 
cycle  even  though  the  INS  gyro  drift-rate  states  are  modeled  twice.  Unfortunately,  a 
slight  2%  increase  in  required  computer  memory  allotment  is  indicated  by 


1.02; 


however,  the  large  benefit  appears  to  be  well  worth  the  slight  penalty. 


The  case  favoring  two  separate  filters  is  even  more  pronounced  when  considering  an 
alternate  state  selection  (Ref.  83)  corresponding  to  two  filters  of  state  size  12  and  18 
versus  a  single  22  state  filter  since  calculations  of  the  above  form  indicate  savings  to 
be  achieved  in  both  the  number  of  operations  (equivalent  to  algorithm  cycle  time'  of 
processing  a  filter  measurement)  and  computer  memory  required  as,  respectively,  301 
and  31. 


If  two  separate  digital  processors  are  used,  parallel  processing  of  each  of  the  two 
filters  on  different  machines  provides  the  advantage  that  the  system  is  only  limited  by 
the  slower  speed  of  the  single  larger  filter  (of  *  15  or  18  states) .  In  comparison,  the 
smaller  filter  of  12  states  oan  proceed  through  six  (6)  Kalman  filter  measurement  proc¬ 
essing  cycles  in  the  same  time  that  a  larger  unified  22  state  filter  oould  complete  only 
one  cycle,  as  indicated  by  the  following  ratios; 
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6.16 


The  conclusion  is  that  a  unified  single  filter  will  limit  processing  throughput  and 
hinder  full  utilization  of  the  GPS  measurements  available  in  an  un jammed  environment. 

Another  likely  navigation  application  for  use  of  decentralised  filtering  is  in  the 
navigation  room  of  strategic  submarines  (as  discussed  in  detail  on  p.  326  of  Ref.  35)  to 
avoid  unnecessarily  redundant  state  modeling  in  the  three  filters  currently  used  for 
SINS/ESGN  navigation  in  Trident  SSBNs.  An  important  SSBN  application  of  another  varia¬ 
tion  of  standard  Kalman  filtering  with  optimized  scheduling  of  alternative  sensor  meas¬ 
urements  is  reported  in  Ref.  49.  The  above  enumerated  list  of  candidate  application 
areas  for  decentralized  filtering  is  representative  and  nonexhaustive . 


APPENDIX  As  CORRECTING  PAST  STABILITY  METHODS  PRIOR  TO  GENERALIZATION 
FOR  ESTABLISHING  STABILITY  OF  DECENTRALIZED  FILTERS 

A. 1  Status  Review  and  a  Counterexample 

An  interesting  aspect  of  Kalman  filtering  is  the  so-called  "robustness"  or  eventual 
correct  time  evolution  of  the  solution  of  the  Riccati  equation  despite  an  incorrectly 
specified  initial  value  P^  (as  long  as  it  is  positive  definite) .  This  aspect  is  partic¬ 
ularly  useful  in  applications  where  the  situation  of  ignorance  of  initial  uncertainty  in 
the  states  of  the  filter  model  can  (and  does)  occur,  but  without  deleterious  consequences 
as  a  result  of  precisely  this  robustness  property.  Kalman  (Ref.  53)  gave  the  first  proof 
of  this  robustness  result  under  the  strong  hypotheses  of  uniform  complete  observability 
and  uniform  complete  controllability  to  provide  the  strong  conclusion  that  the  correct 
estimates  are  homed  in  upon  at  an  exponential  rate.  B.  D.  0.  Anderson  (Ref.  16)  provided 
a  weaker  conclusion  of  only  asymptotic  stability  (rather  than  stability  at  an  exponential 
rate)  due  to  weaker  but  more  frequently  meet  hypotheses  that  remove  the  explicit  require¬ 
ment  of  uniform  complete  controllability  by  requiring  only  that  condition  3'  (cf.  Eqs. 
4.2-11,  -12): 

r  kl-1  t  i 

^PQ  +  t  $(k0,k+l)  Q(k+1)4  (ko,k+l)j  is  nonsingular  for  some  k^>kQ  (A. 1-1) 

where  Pq  is  the  covariance  of  the  Gaussian  initial  condition  xQ,  Q  is  the  process  noise 

covariance,  and  $  is  the  discrete-time  transition  matrix  of  the  linear  system  model.  It 
is  emphasized  on  p.  223  of  Ref.  58  that  the  condition  of  Eq.  A. 1-1  is  satisfied  if  PQ  is 

nonsingular  (as  can  be  selected)  "irrespective  of  $(k+l,k) ,Q(k) ,"  or  even  explicit  con¬ 
trollability  being  achievable  (an  example  being  provided  on  pp.  137-8  of  Ref.  16  of  an 
estimator  that  is  asymptotically  stable  despite  the  absence  of  process  noise) .  Ander¬ 
son's  stability  proofs  for  both  continuous -time  and  discrete-time  utilize  slightly  modi¬ 
fied  forms  of  standard  arguments  for  Lyapunov  functions  V(x,t) .  In  the  case  of  contin¬ 
uous  time,  Anderson  utilizes  the  following  inequality  (unnumbered  equation  following  Eq. 
23  of  Ref.  16) 


•  T  T  —  1 

V(x,t) <-KiH1R  aHx 


(A.l-2a) 
(A. l-2b) 


where  R  is  the  measurement  noise  covariance,  H  is  the  observation  matrix,  and  x  is  the 
state.  Jazwinski's  result  (p.  241,  unnumbered  equations  following  Eq.  7.198  of  Ref.  51) 
(as  credited  to  Deyst  and  Price  as  Ref.  40,  but  now  known  to  be  in  error  as  discussed  in 
the  preceding  paragraph)  of 

V(x(k)  ,k)-V(x(k-l)  ,k-l)<-xT(k)HT(k)R‘l(k)H(k)x(k)-UT(k)p"1(k|k-l)0(k)  (A.l-3) 


U('k)^|p(k|k)p"l(k|k-l)-lj  4(k,k-l)x(k-l) 


(A. 1-4) 


is  also  utilized  in  Ref.  16  for  the  discrete-time  case,  but  Anderson  goes  further  since 


-xT(k)HT(k)R‘1(k)H(k)x(k)-UT(k)P"1(k|k)U(k)<-xT(k-)HTR*lik)H(k)x(k)  (A.  1-5) 


to  use  only 


V(x(k),k)-V(x(k-1) ,k-l)<-xT(k)KT(k)R'Hk)H(k)x(k) 


(A. 1-6) 


under  the  conditions  that 


(A. 1-7) 


H(k)  and  R"A(k)  srs  bounded  for  all  k 


(A. 1-8) 


(where  the  boundedness  condition  on  H(k)  is  stated  on  p.  141  of  Ref.  16  to  be  a  new  re¬ 
quirement}.  Unfortunately,  while  the  inequality  of  Bq.  A. 1-6  is  true,  the  inequalities 
of  both  Eqs.  A.l-2b  and  A. 1-7  are  questionable  since  for 


R$[o  l]  H$[o  o]  X$[o  l|T 


(A. 1-9) 


it  happens  that 


xtHTr“1Hx  »  o|o 


(A. 1-10) 


thus  contradicting  both  Eqs.  A.l-2b  and  A. 1-7  that  are  utilized  in  Ref.  16.  While  an 
additional  condition  of  requiring  that  H(k)  also  be  of  full  rank  will  circumvent  these 


difficulties,  the  discrete-rime  case  suxi  _ _ 

40.  Anderson's  invalid  stability  proof  has  also  been  utilized  by  Sanders  et  al.  (Refs. 
10  and  11)  in  demonstrating  asymptotic  stability  of  decentralized  filters  of  the  Surely 
Locally  Unbiased  (SLU)  class  so  Refs.  10  and  11  unknowingly  inherited  a  flaw.  However, 
the  results  can  be  patched  up  as  done  in  Section  A. 2  by  using  the  correction  supplied  by 
McGarty  (Ref.  18)  and  Deyst  (Ref.  41).  Aasnaes  and  Kailath  (Ref.  52)  weakened  the  re¬ 
quired  assumptions  to  establish  stability  of  the  centralized  filter  even  further  and 

proved  convergence  at  a  t  rate  (for  some  k  >  1) . 

A. 2  A  Minor  Correction  to  Update  Anderson's  Generalization 


Following  McGarty* s  rigorous  derivation  of  Eq,  C.127  of  Ref.  18  (analogous  to  Eq.  57 
in  Ref,  17  and  the  unnumbered  equation  prior  to  Eq.  7.199  in  Ref.  52),  the  following  is 
obtained 

V[x(k|k-l),k]  <  xT(k-l|k-2)P~1(k-l|k-l)x(k-l|k-2) 


-  xT(k| k-1)  HT(k)  R"1  (k)  H  (k)x  (kj  k-1) 


-  uT(k) [#<k, k-1) P (k-1 |k-l) $T(k, k-1) +Q(k) )-1u(k) 


(A. 2-1) 


where  u(k)  is  appropriately  defined  in  Eq.  C.118  of  Ref.  18  (analogous  to  Eq.  7.197  of 
Ref.  51  and  Eq.  56  of  Ref.  17) 


u(k)  4  (p(k|k)P”1(kik-l)  -  i)«  (k,k-l)x(k-l|k-2) 


(A. 2-2) 


Since  the  right-hand  side  of  Eq.  A. 2-1  is  upper  bounded  all  the  more  if  fewer  nonnegative 
quantities  are  subtracted  from  it,  Eq.  A. 2-1  is  altered  (as  outlined  for  discrete-time  on 
p.  143  of  Ref.  16)  to  yield 

Vtx(klk-l)  ,k)  <  xT(k-l|k-2)P’1  (k-1  |k-l) x(k-l  |k-2) 


xT(klk-l)HT(k)R~1 (k)H(k)xtklk-l) 


(A. 2-3) 


or,  equivalently, 


v(x(kl k-l) ,k]  -  xT(k-l|k-2)P”l(k-l|k-l)x(k-l|k-2> 


«  V (x(k| k-1) ,k)  -  V[x(k-llk-2) ,k-l) 


(A.2-4a) 


<  -xT(klk-l)HT(k)R"1(k)H(k)x(k|k-l) 


(A.2-4b) 


<  8  (A.2-40) 

where  as  offered  as  one  correction  following  Eqs.  A. 1-9  and  A. 1-10 

H(k)  must  bo  of  full  rank  (A, 2-5) 

to  guarantee  the  requisite  strict  inequality  noeded  in  Eq.  A.2-4c  as  a  verification  of 
condition  (e)  as  Eq.  4.2-11  (without  utilizing  the  strong  condition  of  Eq.  4.2-11  but 
letting  only  the  weaker  more  generally  met  condition  3'  of  Eq.  A. 1-1  suffice). 

Throughout  Eqs.  A. 2-1  to  A. 2-4  a  tacit  assumption  is  made  that  p’1(k|k-l)  exists 
(i.e.,  P (k | k-1)  is  nonsingular)  otherwise  it  could  not  be  routinely  utilized  in  the 
above.  The  pertinence  of  Anderson's  weaker  condition  3'  (Eq.  A. 2-1)  first  proposed 
in  Ref.  16  now  becomes  evident  since  a  discrete-time  version  (i.e.,  paralleling)  his 
aontinuoua-time  proof  of  Lemma  3.1,  p.  139  of  Ref.  16  reveals  that  P(k|k-1)  that  evolves 
in  time  from  Eqs.  4.2-3,  4.2-4,  and  4.2-5  and  the  matrix  of  Eq.  A. 1-1  have  an  identloal 
null apace.  This  means  that  if  singularity  occurs,  it  is  simultaneous.  Hence  by  a  proper 
selection  of  PQ  to  be  nonaingular,  both  Eq.  A. 1-1  and  P(k|k-1)  are  guaranteed  to  be  non¬ 
singular,  and  therefore  invertible  as  needed  to  demonstrate  asymptotic  stability. 
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A  broad  overview  of  various  discrete-time  linear  filtering  techniques  including  the  Square-Root  and 
variations  of  Square-Root,  Factorized,  Chandrasekhar ,  Partitioning  and  Decentralized  algorithms  as  well  as 
the  basic  Covariance  and  Information  filters  are  presented.  The  purpose  of  this  chapter  is  to  examine  and 
compare  computer  burdens  of  these  well-known  filtering  algorithms  from  practical  operation  point  of  view. 


1.0  INTRODUCTION 

The  entreating  problem  of  estimation  {prediction,  filtering,  smoothing)  has  attracted  many  mathemati¬ 
cians,  scientists  and  engineers  throughout  history  -  beginning  with  the  Babylonians  who  had  applied  a  form 
of  mathematics  that  was  similar  to  the  Fourier  series  to  Interpret  observations  and  make  decisions.  Recent 
history  has  shown  that  Euler,  Lagrange,  Laplace,  Bernoulli  and  others  had  contributed  to  the  advancement  of 
estimation  theories  In  many  ways.  However,  the  solution  to  a  clast  of  estimation  problems  in  finding  the 
beat  value  of  an  unknown  parameter  corrupted  by  noise  (additive)  was  given  by  Gauss  and  Legendre,  who  had 
separately  but  concurrently  formulated  the  method  of  Least-Squares,  In  which  the  fundamental  concepts  of 
redundant  measurement  data,  observability,  dynamic  modelling,  etc.  were  introduced.  These  concepts  hsve 
provided  foundstlons  for  many  new  develops^nts  in  estimation  theories  known  today.  Readers  who  are  inter¬ 
ested  in  antiquity  should  consult  Neugebauer  (1}  and  Sorenson  (2). 

Modern  history  has  rccordtd  that  Fisher  (I]  was  one  of  the  original  investigators  in  the  formulation 
of  statistical  estimation  method*.  However,  the  development  of  the  linear,  minimum,  mean-aquare-error 
estimator,  which  ia  swat  familiar  to  engineers  today,  had  not  been  completed  until  the  I9e0‘*  when  the 
Wlener-Hopf  equation  was  first  established  by  Wiener  [4]  and  Kolmogorov  (5).  Unfortunately  the  filter 
design  using  the  Wlener-Hopf  equation  was  proved  to  be  impractical  due  to  difficulties  in  obtaining  ex¬ 
plicit  solutions.  Swerling  (6)  at  well  as  Carlton  and  FoUlo  (?)  circumvented  these  difficulties  by 
obtaining  the  Least-Square  estimates  in  a  recursive  manner.  This  new  development  is  believed  to  be  the 
starting  point  of  the  so-called  "Kalman  Filtering".  The  main  contribution  of  Kalman  and  Buey  (8,4)  waa 
the  transformation  of  the  Wlener-Hopf  integral  equation  into  an  equivalent  nonlinear  differential  equation; 
the  solution  of  which  yield*  the  covarlcnce  matrix.  shlch  contain*  ail  neceasary  Information  for  the  design 
of  the  optimal  filter.  Hence,  by  demanding  a  numerical  rather  than  an  analytical  solution  of  the  Wlcoer- 
Kopf  equation,  thay  have  successfully  developed  a  recursive  filter  that  can  be  conveniently  realised  in 
real-time  by  a  digital  computer.  In  practical  application,  the  on-line  computer  used  to  implement  the 
filter  equations  is  usually  limited  in  speed  and  memory.  Frequently,  it  is  not  possible  to  program  the 
ideal  (theoretical  filter)  equations,  tine*  the  truly  optimal  fitter  swat  model  *_lj  error  sources  of  the 
system.  The  solution  to  this  problem  (excessively  targe  dimension)  is  usually  to  design  suboptimal  filter*, 
in  which  certtin  state*  in  the  system  have  to  be  ignored  or  simplified.  Another  implementation  problem 
associated  with  ail  digital  computers  is  the  numerical  accuracy,  which  ta  seriously  effected  by  the  inherent 
finite  nature  of  the  computer.  For  example,  in  the  case  of  the  Kalmsa-Bucy  algorithm,  the  covariance 
matrix  contains  all  Information  needed  for  the  filter  synthesis  (hence,  it  la  called  the  Covariance  Witter). 
For  thia  reason,  it  is  critical  chat  the  eorrectneaa  of  this  matrix  swat  be  preserved  at  alt  Hews.  How¬ 
ever,  because  of  matrix  subtraction*  Involved  in  each  computing  cycle,  ncm-poaltlve  element*  in  the  covar¬ 
iance  matrix  could  result  from  truncation  and  rrund-off  error? .  Chin  (10)  baa  reported  that  this  computa- 
t local  Instability  could  lead  to  filter  divergence. 

In  order  to  preserve  the  correctness  of  the  covariance  matrix  (hence  preventing  filter  divergence) ,  * 
number  of  methods  catted  "Square  Root  Covariance  Algorithms"  have  been  Introduced  (11-20).  In  a  parallel 
effort  to  preserve  the  correctness  of  the  information  matrix  (inverse  of  covariance  matrix)  in  the  "Infor¬ 
mation  Filter",  the  Square-Root  Information  Fitter  have  alto  been  developed  (21-23).  Will*  these  algorithms 
generally  yield  improved  masericat  stability,  the  number  of  computational  steps  la  usually  greater  than  that 
of  the  standard  Kalman  ftltar.  For  this  reason,  engineer*  do  have  a  certain  degree  of  reluctance  to  employ 
the  Square-Root  algorithms.  An  attestpt  to  Improve  this  situation  was  made  by  tiarman  (24)  who  ha*  employed 
factorisation  techniques  (9$ ,26),  which  do  not  Involve  square  root  computations. 

In  the  early  I970's,  a  different  approach  was  explored  for  the  dealga  of  recursive  filters  that  avoids 
the  utilisation  of  the  matrix  Rlccatl  equation.  Kallath  (27)  Hia  shown  that  for  a  certain  clasa  of  special 
applications,  the  so-called  X-T  function  of  Ctundraaekhar  could  be  used  to  gain  two  magnitudes  of  computa¬ 
tional  iaprovement  over  the  usual  Riccatl-type  algorithms.  Independently,  Ltnquiat  (28.29)  and  Lalnlotia 
( 30]  have  derived  Chandrasekhar  algorithms  for  discrete-time* Invariant  and  coot inuout- time -varying  systems, 
respectively.  However,  there  it  no  guarantee  thet  these  algorithm*  wilt  provide  computational  stability. 
Other  Intrinsic  aapaeta  of  Chandrasekhar  filters  war*  discussed  by  Crammer  (31). 

Recent  advanceamnt  in  the  design  of  practical  filter  and  smoother  was  to  seek  entirely  new  algorithm* 
that  not  only  provldt  computational  stability  but  also  minimize  computer  burden*.  This  nm*  approach  has 
bean  taken  by  Lalniotla  (32*38)  who  introduced  partitioning  estimation  concept*  winch  ar*  radically  different 
from  those  of  the  Kalman  footing.  The  partitioning  approach  constitutes  essentially  an  adaptive  framework, 
which  yield*  fundamentally  new  estimation  algorithm*  ta  naturally  decoupled,  paratl* l -processing  realisation, 
and  moat  important  of  all,  computationally  attractive. 

Another  new  attempt  to  economize  computational  step*  aa  well  at  to  anaura  filter  stability  la  the 
Surely  Locally  Unbiased  (SLU)  decentralized  approach.  Th*  basic  concept  la  to  decompose  a  Urge  system 
into  small  subsystem*  In  such  *  manner  that  th*  ftltar  for  th*  over*  1 1  system  shall  be  opttmUed.  This _ 


•**4. 


efficient  then  the  computation  of  one  aggregated  high  order  system.  Also  the  decentralized  fiiter  has  a 
special  property  that  provider  asymptotic  stability. 

The  purpose  of  this  paper  Is  to  examine,  from  computational  aspects,  advantages  and  disadvantages  of 
various  discrete-time  linear  filtering  algorithms  mentioned  above.  In  order  to  facilitate  the  discussion, 
a  linear  system  model  as  well  as  notations  and  conventions  are  defined  in  Section  2.  Prior  to  the  inves¬ 
tigation  of  computer  burdens,  an  overview  of  filtering  algorithms  selected  for  this  study  is  presented  in 
Section  3.  The  main  contribution  of  this  paper  -  a  survey  of  filtering  computational  efficiencies  - 
is  given  in  Section  4,  in  which  computer  time  and  memory  requirements  for  the  Kalman  (covariance  and  infor¬ 
mation  filters),  Square-aoot,  Factorized,  Chandrasekhar,  Partitioning  and  Decentralized  algorithms  „ra 
summarized.  Finally,  conclusion*  aa  well  aa  recommendation  arc  given  in  Section  5* 


2.0  SYSTEM  MODEL,  NOTATIONS  AND  CONVENTIONS 


The  linear  filtoring  problem  treated  in  thia  paper  ia  being  stated  using  the  following  discrete-time 
model: 


x(k+l)  -  4  (k+l,k)  x(k)  +  u(k>  (1) 

z(k)  -  H(k)  x(k)  +  v(k)  (2) 


For  a  given  aec  of  measurements: 

\  •  t(i)»  t(itl) . z(k) 

it  ia  deaired  to  find  the  optimal  (in  the  sense  of  nean-squere-error)  filtered  estimate  *(k/k , I)  of  x(k) . 
It  ie  eaaumed  that  u(k)  and  v(k)  are  uncorrelated  zero-mean  Gaussian  noise  with  covariances  Q(k)  and  R(k) , 
respectively.  Other  notations  and  conventions  are  given  as  follows: 


SYMBOL 


pjgamisn 


x(k) 

«<$ 

*<*) 

3(k/k.l) 

«n(k/k.l) 

S^k/k.f) 

XjOO 

Sj(kel/k) 

*<t> 

*<t> 

v<k) 

Kknl.k) 

a(k) 

at(k) 

%<k> 

«„<h) 

hT(k) 

«(k) 

it(k) 

«,(h) 

Vk) 

r<0 

fttco.tj 

u(k) 

«j(k) 

<}0<} 

QjOO 

*00 

wt(k) 

•(h) 


system  state  vector  at  discrete  time  index  k 

initial  system  state  vector  (Gaussian  distributed) 

the  mean  value  of  x(f) 

nomlna 1  Initial  system  state  vector 

remainder  initial  system  state  vector 

optimal  filtered  estimate  of  x(k) 

nomine  1  optiMl  filtered  eetlMtc  of  x(k) 

remainder  optiMl  filtered  estimate  of  x(k) 

tth  subsystem  state  vector 

optiMl  stete  estimate  of  the  l*h  subsystem 

system  state  vector  (cunt  inuous-t  Urn?) 

optiMl  filtered  estlMte  of  x(t) 

innovation  sequence 

system  stete  transit  ion  Mtrlx 

measurement  vector  at  discrete  time  index  k 

Ith  subsystem  measurement  vector 

measurement  vectors  of  the  “Cats  equation” 

measurement  Vectors  of  the  "data  equation’* 

{th  subsystem  state  transition  Mtrlx 

Unearitad  measurement  operator  (vector) 

oeasurament  matrix 

Ith  subsystem  meaaurement  matrix 

coupling  measurement  matrix  bit,  1*1  aod  other  subsystems 

linear l s*d  susurtmot  Mtrlx 

fundamental  Mtrlx 

Uneariced  fundamental  Mtrlx 

system  noise  vector 

l1**  subsystem  nolee  vector 

system  noise  covariance  matrix 

square  root  of  Q(k) 

Ith  subsystem  noise  covariance  Mtrlx 
meaaurement  noise  vector 
ith  lubiystcm  maasuremsnt  noise  vector 
meeeurement  noise  covariance  matrix 


Updating 

*(lwl/k+l,«  -  a(k4l/k,i)  4  K(k+1,4>  [z(k4l)-H(k4l)f(k4l/k,  X)  ]  (5) 

r  t  *l 

K(k+l,i)  -  P(k4l/k,i>  H*(fc4l)  [H(k4l)P(k4l/k,0  H*(k4l)  4  R(k4l)]  (6) 

P(k4l/k4l,i)  -  [I-K(k4t,f>  H(k-tl)]  P(k4t/k.i>  (7) 

Note  that  Eq .  (7)  la  correct  only  If  the  gain  K(k+1, 0  la  op. latum. 

2.  Stabilized  Kalman  Fitter 

The  stabilized  filter  (* octet imes  called  the  Joseph  algorithm  [52])  la  leas  sensitive  to  com¬ 
puter  round-off  errors.  Another  benefit  is  that  it  yields  correct  P(k4l/k4l,f)  even  if  K(k4l,i)  Is  non- 
optimum.  The  updating  covariance  matrix  la  given  by 

P(k4  l/k4 1 1  ©  -  [I-K(k4l,f>  8(k4l)  ]  P(k4l/k,f)  [X-Kfkt!.,©  H(k4l))T  4  K(lc4l,©  R(k4l)  KT(k4l..©  (8) 

Other  updating  and  %xt?apoiatlen  equations  are  the  same  as  Bqs.  (3)  -  (6).  initial  conditions 
for  the  standard  <*s  veil  as  the  irsHlited  filters  are  defined  as  follova: 

e  f*<4> )  -  i  <f/A  (9) 

£  {(*(©  -  x(f/f)J  (*<4>  -  5a/X>]T  }  -  P(i,©  (10) 

3-  Extended  Kalman  Filter 

The  “extended”  Kaicun  filtering  la  a  popular  technique  for  treating  noolineirltlee  In  the 
design  of  minimum  variance  estimator#.  Other  method*  of  the  same  footing  (Taylor  series  expansion)  are 
Iterated  Extended  Kstman  filtering,  Gaussian  Second-Order  filtering,  and  Linearized  Kalman  filtering  (M>]. 

Since  most  physical  nonlinear  systems  een  be  represented  by  differential  equations  and  measure¬ 
ments  sre  usually  available  at  discrete  ttme,  therefore,  it  Is  proper  as  veil  aa  convenient  (for  aeriee 


expansion)  to  describe  system  end  measurement  models  as  follows: 

x(t)  *  f(x(t),t)  4  u(t>  (U) 

Id)  *  hf«e(*v)J  *  v<k)  (12) 

la  which  u(c)  and  v(k)  are  uncorr elated  sero-meso  Causa Ian  noise  with 

8(u(t)  uT(t)|  •  Q(t>  (13) 

E{v(k)  vT(k)!  •  *<*>  ;  (li) 


also  the  Initial  vector  x(fi  1«  Cause lan  with  mass  and  covariance  given  by  eq.  (9)  and  Eq.  (10). 


Helloes 


P(4(t),tJ  * 

*(t)  -  «(t) 

(15) 

!M»t«(t  )) 

V*>  * 

«(tk)  -  t(th) 

(16) 

extrapolation  equations  are  given  by: 

*  f{K(t),t)S 

(in 

-  r(S(t).t)  P(t)  e  Kt)  rT(S(t).tJ  4  Q(t) 

(U) 

Update  equations  are  given  by: 

«(k<l/kel,«  -  t(k»  1/k,  0  4  K(H1.« 

ftilNl)  -  b{l(Mt/k.©l) 

<!*) 

Hksl/kst.O  -  (I-K(kal)  U^(kel))  P(kat/k,« 

(20) 

K(kal,0  «  P(ku/k,4  H^fkel)  (Ml(k4l)  P(k*t/k,  C  «£(kel)  ♦  *(kal)J 

(21) 

Other  forma  of  variation  (Cants i so  Second -Order ,  ate.)  are  given  In  reference  (14). 

»•  Swata  Soot  Covet  lent*  FUttn 
l.  totter 

The  flrat  square  root  algorithm  was  introduced  by  Potter  (llj  for  a  restricted  application  of 
taro  system  solas  and  measurements  are  scalar  muann«.  «■»  r^.  m  »■<  ff  a..—.  ■  ■  — 


SYMBOL 


■ 


i1*1  subsystem  measurement  noise  covariance  matrix 


scaler  measurement  noise  variance 


a  priori  state  error  covariance  matrix 
a  priori  state  error  covariance  matrix 
nowlna 1  a  priori  state  error  covariance  matrix 
remainder  a  priori  state  error  covariance  matrix 


P(k/k,  6 


state  error  covariance  matrix 


?n(k/k,j0 


yk/k,/) 


Pt(k/k) 


nominal  slate  error  covariance  matrix 
remainder  state  error  covariance  matrix 
ith  subsystem  state  error  covariance  matrix 


P'l(k/k,l) 


information  matrix 


S~  l(k/k,  t) 


Rx(k/k,f> 


R  (k+l) 
.  ux 

V(k) 


square  root  of  information  matrix 
square  root  of  information  matrix 
square  root  of  covariance  matrix 

square  root  of  cross-covariance  between  variables  of  "data  equations" 
lower-triangular  square  root  of  measurement  noise  covariance  matrix 
lower-triangular  square  root  of  systems  noise  covariance  matrix 


Z  -  STH 


l}(k/k) 
T,$(k)  & 
T(k+1) 

D(k) .  D(k) 


anti-aymmetric  matrix  chosen  to  maintain  S(k,k)  in  lower  triangular  form 
tranafotmaf Ion  matlix  whose  columns  are  composed  of  eigenvectors 


of  ehs  covariance  matrix  P 


diagonal  matrix  whose  diagonal  elements  are  eigenvalues  of  the 
state  error  covariance  matrix 


U(k)  ,  U(k) 
d(k/k) 

K(k) 

yk.o 


unit  upper  triangular  matrix 

transformed  state  vector.  d(k/k)  «  ?  l(k/k,f)  x(k/'k) 
Kaltwn  filter  gain  matrix 
nominal  Kalman  filter  gain  matrix 
tth  subsystem  filter  gain  matrix 


coupling  matrix  between  i and  subsystems 


0  (k,  f) 
n 


observation  matrix 


b(k/k> 

*(k> 


v«. 


transformed  optimum  state  vector 

residual  error  of  least-square*  fit 

dimensions  of  x*data  equation  and  u-dsta  equation 


Convent  Ions’- 

•T 

a.  Superscript  T  la  used  to  denote  transpose  of  vector*  and  Matrices;  { • 1  is  used  to  denote  the 
transpose  of  the  Inverse  of  a  matrix. 

b.  Matrices  are  denoted  by  upper  ease  letters.  Vectors  ere  denoted  by  lower  case  letters  except 
for  k,  *,  m  tod  n  which  ere  Intfger*  and  t  denotes  time. 

c .  Vectors  are  aeeumed  to  be  columns  unless  otherwise  denoted  by  superscript  T. 


4.  Units*  otherwise  specified,  the  dimension  of  the  stete  vecror  is  n  and  the  dimension  of  the 
measurement  vector  la  m,  n  »  m. 


3.0  AicttMitM  descJumohs 


A,  Covet  lanes  flluta 


Tbs  "atstkdsrd"  form  or  the  Kalman  filttr  refers  to  the  estimator  first  given  by  Kalman  (8). 
from  which  the  discrete  optimal  filter  wee  derived  end  subsequently  documented  In  many  books  [k;*51]. 
The  filter  algorithm  ta  usually  given  In  two  seta  of  equations  -  one  for  extrapolation .  tha  other  one 
for  sgdatlngf 


tfkclA.f)  -  *0wl»  <(k/k.< 

Kk»l/k,$  -  Kk»  P(k/k, 0  *T<k,k)  *  Q(k) 


Updating 

4(k+l/k+l,iQ  -  «(k+l/k,^>  4  K(k+1,^)  (*(k+l)  -H(k4l)S(k4l/k,  tj  ]  (5) 

K(k+1 , t)  -  P(k4l/k,l)  HT(fc+l)  (H(k+l)P(k+t/k, Q  HT(k+l)  +  R(k+1)]  (6) 

P(k+l/k+l,4>  -  [I-K(k+1,  t)  H(k4 1) ]  P(k4l/k,X)  (7) 

Note  that  Eq.  (7)  ia  correct  only  if  the  gain  K(k4l,f)  ia  optiaxan. 

2.  Stabilised  Kalman  Filter 

The  stabilized  filter  (aoaetimes  called  the  Joaeph  algorithm  [52])  ia  less  aenaitive  to  com¬ 
puter  round-off  errors.  Another  benefit  ia  that  it  yields  correct  P<k4l/k+l,f)  even  if  K(k4l,f)  ia  non- 
optimum.  The  updating  covariance  matrix  is  given  by 

P(k4l/k4l,4)  -  [  I-K(k4 1 ,  jO  H(k4i)  ]  P(k4l/k,i)  (I-K(k4l,l)  H(k4l)]T  4  K(k4l,J)  R(k4l)  KT(k4l,f)  (8) 

Other  updating  and  extrapolation  equations  are  the  same  as  Eqs.  (3)  -  (6).  Initial  conditions 
for  the  standard  as  well  as  the  stabilized  filters  are  defined  as  follows: 


E  fx(f)}  •  i  (Ut)  (9) 

E  {[*</>  -  x(f/i>]  [*(X>  -  HUD)X  )  -  P(i,jQ  (10) 

3.  Extended  Kslnen  Filter 


The  "extended"  Kalman  filtering  is  a  popular  technique  for  treating  roa linearities  in  the 
design  of  mlnimtss  variance  estimators.  Other  methods  of  the  same  footing  (Tailor  series  expansion)  are 
Iterated  Extended  Kalman  filtering,  Gaussi  «  Seconc-Order  filtering,  and.  Unearited  Telman  filtering  [h6]. 

Since  most  physical  non' inear  systems  e«r.  be  represented  by  differential  equations  and  measure¬ 
ments  are  usually  available  at  discrete  time,  therefore,  it  is  proper  as  well  es  convenient  (for  series 
expansion)  to  describe  system  end  aeaoureme.it  models  aa  follows: 


X(t)  «  f(x(t),t]  4  U(t) 
s(k)  »  h(x(tk)]  4  v(k) 

in  which  u(t)  end  v(k)  are  uncorrelated  zero-mean  Cause j an  noias  with 
B[u(t)  uT(t>]  •  Q(t) 

E(v(k)  vT(V))  «  R(k)  ; 

alao  the  Initial  vector  x(i>  la  Gaussian  with  mean  and  eovarlance  gives  Ky  Eq.  (9)  and  Eq.  (10). 
Define: 


F(*(t).t]  - 


Sh(r(tu)) 

“t<k)  -  T^r 


x(t)  «  P(t) 


x(tk)  -  *(tk) 


Extrapolation  equations  ere  given  by: 

-  ?tt(t),t)] 

-  r[4(t),t]  P(t)  ♦  P(t)  PT[S(i),tJ  4  Q(t) 

Update  equation*  are  given  by: 

t(k4l/k4l,  f)  «*(k4l/k,f)  4  K(k4l,$  f*(k4 1)  -  h(S(k4l/k,«]) 
Kkel/k*!,#  -  [i-K(kel)  (^(kal)]  P(kel/k,< 

X(kel,4  •  Kkel/k.f)  ^(k+l)  (H^kel)  P(k+t/k,$  t£(kel)  +  g(kel)] 


ill) 

Oa> 

(U) 

OM 


05) 

(16) 

on 

08) 

09) 

(20) 

(8l> 


Other  forma  of  variation  (Caueslan  Second-Order .  etc.)  are  given  in  reference  (86]. 
tew#  toot  covariance  PUtete 
1.  Pott  or 

The  fitat  square  root  algorithm  was  introduced  by  Potter  [11]  for  a  reatrteted  application  of 
taro  ayetam  uolse  and  measurements  ere  scalar  quantitlea,  i.e.,  Eqa.  (1)  end  (8)  become 


x(k+l)  -  $(k+ I , k)  x(k)  (23) 

z(k)  -  hT(k)  x(k)  +  v(k)  (23) 

where  h(k)  la  a  vector  and  the  variance  of  the  measurement  noise  r(k)  is  a  scalar  value.  This  method,  as 
well  as  other  methods  discussed  in  the  sequel,  consists  of  defining  a  square  root  matrix  S  such  that 

P(k/k,  £)  £  S(k/k,i)  ST(k/k,f).  (24) 

The  factorisation  of  covariance  square  roots  is  generally  not  unique.  However,  this  lack  of  uniqueness 
is  not  serious  because  a  unique  square  root  factorization  can  always  be  obtained  by  using  the  Cholesky  de¬ 
composition  technique  [53]  (sometimes  referred  to  as  the  Banachiewicz  and  Dwyer  algorithm)  which  factors 
any  positive  semi-definite  symmetric  matrix  into  the  product  of  &  lower  triangular  matrix  and  its  transpose. 
Description  of  this  algorithm  can  be  found  in  reference  [60]  as  well  as  in  many  other  sources. 

Define: 

y(k+1.4>  Q  ST(k+l/k,«  h(k+l).  (25) 

Consider  the  case  *u  whicn  Q(k)  »  0,  then  the  extrapolation  equation  for  S(k+l/k,f)  is  given  by 

S(k+’/k,E)  -  *(k  l.k)  S(k/k,J0  (26) 

fhe  exirapoletion  as  well  ag  .pdating  of  the  state  vector  is  the  same  as  Eq.  (3)  and  Eq.  (5),  respec¬ 
tively.  Other  update  eouat  ons  are  given  below: 

S'k+l/Hl -  S(k-tl.'k,j£)  [1  +  <y(k+l,f)  y(k+l,jOyT(k+l,i)]  (27) 

X(k+1 ,  f)  .  -  SOtfl/k.a  ST(k-tl/k,  f)  h(k+l) _  (28) 

hT(fc+l)  S(k+l/k,  /)  ST(k+l/k,i)  h(k+l)  +  r(k+l) 


in  which  u(fcel.jQ  la  given  by 


0  '  "tf") 


In  Eq.  (29),  tine  index  for  or  ,  y  and  r  is  the  same,  hence  it  is  being  suppressed  for  clarity.  When 
there  la  no  ambiguity,  simplified  notations  such  aa  Eq.  (29)  will  be  used  in  subsequent  discussions. 


This  square  root  filter  is  an  extension  of  the  Potter  algorithm  by  considering  vector  measure- 
•ante  (elmultaneous)  with  correlated  component  errors.  This  method  requires  diagonalization  of  an  n  x  n 
matrix,  t.e.. 


TT(STHTRHS)T 


where  ij,  ig . «n  are  elganvaluae  of  the  covariance  Mtttx  and  T  it  the  transformation  matrix  consisting 

of  eigenvectors  in  its  column*.  The  extrapolation  equation*  are  the  earn*  aa  the  Potter  filter;  the  update 
equation*  were  derived  in  reference  [  12] : 

S<k+l/kel,f)  •  S(kei/k,$  fl  +  T[(l  ♦  D)‘l/8  -  I]  TT  }  (31) 

K<k+l,J)  -  S(k+l/k, 0  8(1  ♦  BTB)  l(Bl/2)  1  (32) 

whar#  _  _  l/2  -l 

I  -  ST(fcU/k,  4)  HT(kei)  .  (33) 

The  extrapolation  and  updating  of  tht  syatam  atata  vector  are  always  the  same  aa  Eq.  (3)  and  Eq.  (5). 

Aa  such,  auhaequaot  discuss Iona  will  b#  concentrated  on  the  covariance  and  tba  gain  matrices. 


nBrniwcn-iiiiiiVwi'E 


lima  Cast) 


(NOTE:  Although  this  papar  ia  mainly  -unearned  with  tha  dlscrsta-tlma  cast,  It  la  fait, 
howavtr ,  that  a  Brief  discussion  of  the  continuous- time  eaaa  should  be  given  hare  for  completeness.) 

Consider  tha  system  modal  given  by 


«(t)  -  P(t)  x(t)  ♦  0(t)  u(t) 


**-0 


s(t)  -  H(t)  x(t)  +  v(t)  (35) 

where  u(t)  and  v(t)  are  zero  mean  non-correlated  white  Gaussian  noise  with  unity  covariances.  Let  ?(t)  be 
the  covariance  of  the  error  of  the  state  estimate,  P(t)  obeys  the  following  Riccatl  equations: 

P(t)  -  F(t)  P(t)  +  P(t)  FT(t)  +  G(t)  GT(t)  -  P(t)  HT(t)  H(t)  P(t)  (36) 

with  Initial  condition: 

P(°)  “  PQ  (37) 

The  Andrews  algorithm  (13)  was  the  first  Square  Root  filter  that  considers  the  existence  of  system 
noise.  Similar  to  the  Bellantoni  and  Dodge  algorithm,  it  also  processes  vector  measurements.  Rowever,  it 
does  not  require  any  matrix  diagonalization.  The  extrapolation  of  the  square  root  covariance  is  given  as: 

S(t)  -  F(k)  S(t)  +  [W(t)  +  1/2  G(t)  Z(t)  GT(t)]  S-T(t)  ,  (38) 


in  which  Q(t)  is  the  covariance  of  the  system  noise  and  W(t)  is  a  skew  symmetric  matrix  that  maintains 
S(t)  in  the  lower  triangular  form. 

T 

It  should  be  pointed  out  that  solution  of  Eq.  (38)  requires  that  the  Inverse  of  S  (t)  be  computed  at 
'each  integration  step  which  requires  a  large  number  of  multiplications.  This  is  undesirable.  Therefore, 
Tapley  and  Choe  [71]  restructured  the  problem  and  let  the  skew  symmetric  matrix,  W(t) ,  be  chosen  in  such  a 


way  that  inversion  of  sT(t)  is  not  needed. 

[S(t)  -  F(t)  S(t)]  ST(t)  -  1/2  Q(t)  +W(t)  (39) 

Furthermore,  in  order  to  maintain  S(t)  in  a  lower  triangular  form,  the  following  definitions 

E(t)  £  F(t)  S(t)  (40) 

C(t)  £  S(t)  -  E(t)  (41) 

W(t)  £  W(t)  +  1/2  Q(t)  (42) 

will  be  used  to  rewrite  Eq.  (39)  as 

C(t)  ST(t)  «W(t)  (43) 

which  is  the  desired  result. 


However,  since  the  skew  symmetric  matrix,  W(t) ,  can  be  chosen  arbitrarily,  Morf,  et  al.  [72]  provided 
another  method  for  computing  the  lower- triangular  S(t)  matrix  that  is  simpler  than  the  method  just  described 


above.  Let  S(t)  be  nonsingular  and  define 

P(t)  £  S(t)  S(t)T  (44) 

Equation  (36)  can  be  written  as: 

P(t)  -  S(t)  ST(t)  +  S(t)  ST(t)  (45) 

P(t)  -  (F  -  1/2  SST  HTH)  SST  +  1/2  GGTS"TST  +  SST(FT  -  1/2  HT  HSST)  +  1/2  SS*1  GGT  (46) 
■1  *T 

Multiplying  Eq.  (45)  on  the  left  by  S  and  on  the  right  by  S  yields 

S'1  S  +  ST  S'T  -  L  (47) 

where  L^FF1  +  GG1  -  H^H  (48) 

F  *  s’1  F  S  (49) 

G  -  S'1  G  (50) 

H  -  H  S  (54 

Since  S  is  lower  triangular,  S  *  S  is  the  lower-triangular  part  of  L,  hence 

S  -  S  L  (52) 

vhare  L  is  the  11  lower -tralngular  part"  operator. 


Since  Eq.  (52)  does  not  Involve  explicit  skew  symmetric  matrix,  it  seems  to  be  simpler  to  compute 
than  the  other  two  square  root  methods  discussed  above. 

4.  Schmidt 

Instead  of  using  Bq.  (34)  to  extrapolate  the  square  root  covariance  matrix,  Schmidt  [14]  in¬ 
troduced  a  method  which  facilitates  digital  computations.  This  algorithm  requires  finding  an  orthogonal 
transformation  matrix  T,  dimension  (n+m)  x  (n+ra) ,  such  that  T^T  ■  I.  (n  Is  the  dimension  of  the  state 
vector  and  m  is  the  dimension  of  the  measurement  vector.) 


Consider  the  expression 


(51) 


£*(k+l,k)  S(k/k,f)  i  [Q(k)]1/2jTTT  ] 

which  can  be  written  as 

*(k+l,k)  S(k/k,f>  ST(k/k,  t)  tT(k+l,k)  +  Q(k) .  (52) 


Expression  (52)  is  the  right  side  of  Eq.  (4) .  Hence,  expression  (51)  must  be  the  left  side  of  Eq.  (4). 
Therefore,  the  following  relationship  is  established  for  the  extrapolation  of  S(k/k,  f) . 


n  f 
®  t 


S^k+l/k^ 

0 


S^k 


(Ql72(k+1)  )T 


)  n 
)  ® 


(53) 


T  T  1/2 

In  order  to  uniquely  express  S  (k+l/k,/)  in  terms  of  S  (k/k,f),  $(k+l,k)  and  Q  (k+1) ,  matrix  T  must 

be  constructed  such  that  Eq.  (53)  will  be  in  triangular  form.  This  can  be  done  by  using  the  Gram-Schmidt 
process  or  the  Householder  transformation. 


Reference  [16]  provides  descriptions  of  the  Gram-Schmidt  and  Householder  transformations.  A  more 
extensive  treatment  of  this  subject  is  found  in  chapter  5  of  reference  (60). 


5.  Carlson 


The  essence  of  Carlson's  technique  is  to  preserve  the  square  root  covariance  matrix  in  tri¬ 
angular  form  during  the  extrapolation  interval  as  well  as  the  update  time.  In  addition,  Carlson  recognised 
that  the  transition  matrix  is  often  block-triangular ,  the  fact  which  can  be  exploited  to  further  reduce 
computation  steps.  To  preserve  S (k+l/k, f)  in  triangular  form  during  extrapolation,  two  methods  are  sug¬ 
gested.  One  is  basically  the  same  as  Eq.  (53),  the  other  is  called  the  Root  Sum  Square  (RSS) ,  which 
computes  the  covariance  matrix  using  Eq.  (4),  then  P(k+l/k,  t)  is  factored  (Cholesky  decomposition)  into 
triangular  square  root  matrices  S(fckl/k,j©  ST(k+l/k,  t) .  In  order  to  make  certain  that  S(k+i/k+i,jQ  is  in 
triangular  form  during  update,  the  Potter  algorithm  is  modified  by  demanding  that 


A 


JUL 


T 

y  y  +  r 


1/2 


be  upper  triangular,  l.e.,  for  scalar  measurements 

P(k+i/k+i,/)  -  P(k+l/k,J&  -  K(kfl)  hT(k+l)  P(k+l/k,l) 


(54) 


(55) 


which  c*n  be  written  4s 


T  T 

P(k+l/k+l,j&  -  S (k+l/k, t)  ST(k+l/k.i)  -  S" 

y  y  +  r 


and  factored  into 


P(k+l/k+l,f)  -  S 


I  - 


...yx 

T 

y  y  +  r 


sT  . 


(56) 


(5?) 


Hence, 


S(k+l/k+l,i)  -  S(kfl/k, t)  A(k+1)  . 


(5®) 


in  which  A(k+1)  must  be  chosen  such  that  S(k+l/k+l ,  f)  is  also  upper  triangular.  A  method  that  can  be  used 
to  select  and  compute  the  A(k+1)  matrix  is  given  in  reference  (20). 


C.  Information  Filters 


The  Covariance  filter  discussed  in  Section  3A  la  the  Kalman-Bucy  filter  in  its  original  form  (the 
filter  equations  are  derived  from  the  covariance  matrix).  The  Information  Filter  discussed  in  this  section 
is  basically  of  the  same  tooting.  However,  the  filter  equations  are  derived  from  the  Inverse  of  the  covar¬ 
iance  matrix  which  is  closely  related  to  the  information  matrix  (reference  (47),  p.  241).  The  motivation 
for  taking  this  approach  is  to  avoid  computation  difficulties  in  the  case  where  the  initial  state  error 
covariance  F(i,4)  Is  unknown  and  assumed  to  be  infinity. 


The  development  of  Information  filter  equations  la  straightforward, 
matrix  Inversion  lesana 

(r+  if  a)  -  f 1  -  r‘l  if(i  +  t  r*1  nT>  t  fl 

to  the  covariance  matrix 

P(k+l/k,$  -  4(k+l,k)  P(k/k,J Q  »T(k+l,k)  x  Q(k) 


Thin  la  done  by  applying  the 

(59) 

(60) 


by  identifying 


T  ■  •  »tT  »  if  -  Q  and  £  ■  I  . 


(61) 
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The  propagation  of  the  Information  matrix  was  shown  [54]  to  be: 

p'^k+l/k.i)  -  F(k)  -  F(k)(F(k)  +  Q-1<k>l  1  F(k)  (62) 

where 

F (k)  4  [*T(k+l,k)f 1  p'l(k/k,f)  f'l(k*l,k).  (63) 

The  update  of  the  Inverse  covariance  matrix  from  P  ''(k/k-l.f)  to  P  1  (k/k ,  Is  given  by 

P'^k/k.f)  -  p'l(k/k-l,0  +  HT(k)  R'L(k)  H(k)  .  (64) 

By  defining  the  state  of  the  information  filter  as 

d(k/k)  4  P'^k/k.X)  x(k/k)  (65) 

d(k+l/k)  £  P'^k+l/k.X)  $(k+l/k).  (66) 

It  can  be  easily  shown  that  the  propagation  of  d(k+l/k)  la 

d(k+l/k)  -  [I  -  p'^k+l/k.U  Q(k)]  »”T(k+l/k)  d(k/k)  (67) 

and  the  update  of  d(k/k)  la 

d(k/k)  -  d(k/k-t)  +  HT(k)  R*l(k)  r(k)  (68) 


It  vlU  be  shown  In  the  next  section  that  the  information  filter  la  more  efficient  than  the  covariance 
filter  aa  far  as  update  la  concerned.  However.  In  propagation,  the  covariance  filter  requires  fewer 
computations. 

Due  to  computational  error,  the  use  of  Eqs.  (62)  and  (64)  nay  lead  to  nonnegative  definiteness  of 
P  (k+l/k , X) .  Once  again,  this  difficulty  can  be  avoided  by  applying  the  square  root  concept,  which  will 
be  discussed  next. 


An  efficient  square  root  solution  to  the  least  square  problem  using  the  Householder  algorithm 
was  demonstrated  by  Golub  [55] ,  Butlnger  and  Golub  [56],  and  Jordan  (57].  Hanson  and  Lawson  (50]  extended 
the  theory  to  include  rank  deficient  systems,  and  adaptad  the  Householder  algorithm  to  solve  sequential 
least  squares  problems.  Dyer  and  McReynolda  developed  the  equere  root  Information  filter  based  on  House* 
holder's  matrix  triangulariaatioo  pvocadure  and  Cox's  (59)  sequential  estimation  algorithm  (dynamic 
programing  formulation). 

Recall  In  the  Square  Root  Covariance  Filter.  It  waa  defined  (Eq.  (24)  ): 

F(k/k,l)  4S(k/k.^  ST(k/k,0. 

For  the  development  of  the  Square  Root  Information  Filter  (SR1F) .  it  is  consistent  to  define: 

F*l(k/k,f)  A  s'V/k.f)  s'l(k/k ,f)  (69) 

b(k)  4  S*l(k/k.f)  8<k/k)  (70) 

The  update  of  the  inverse  covarlsnce  square  root  is  given  by 


a 


where  t  is  the  orthogonal  transformation  matrix  defined  previously.  The  update  of  b(k/k)  is  glvao  by 


[« MSfll  . 

,  f 

(72) 

[•(k)  J 

tV*‘(k)«<k), 

where  e(k)  la  the  residual  error  after  processing  the  measurement.  The  propagation  of  the  Inverse  covert* 
coca  la  given  by  _  » 


given  by  r 

-  *  « 

.  T  _ _ _ . . 9... 

0  :  S*l(kal/k.«  s*‘(k/k.4  *‘Wk);  S*l(k/k.$ 


C(k*l/k)  4  t*<k*0  ♦  tT(kel/k)  l(keWk)]l/®  . 
The  propagation  of  b(k+l/k)  la  given  by 

1  .tLoJ 

_  n{lb(ksl/k)J  I  b(k/k)  I 


where 


(76) 

(77) 


a_1(k)  -  CT(k>  C(k)  +  <fl(k) 

C(k)  -  S‘l(k/k,i)  **1(k/k)  . 

A  different  fora  of  propagating  S  ^(k+l/k.f)  and  b(k+l/k)  la  also  available: 

S'^k+l/k.f)  -  (X  -  [1  +  {a(k)  Q'^k))172  }'X  C(k)  a(k)  CT(k)  )  s’^k/k.i)  |_1(k/k) 

b(k+l/k)  -  (X-a(k)  [1  +  {a(k)  Q'^k)  }1/2  f 1  C(k)  CT(k)  )  b(k/k) 


It  should  be  apparent  from  the  previous  section  that  while  the  Dyer-McReynolds  SRIF  is  attrac¬ 
tive,  it  relies  heavily  upon  the  Householder  transformation  as  well  as  relying  on  the  concept  of  dynamic 
programming,  which  seems  to  be  a  little  too  abstract  and  difficult  to  understand.  For  this  reason,  Bierman 
(22]  introduced  the  recursive  least-square  approach  intended  to  simplify  the  basic  structure  of  SRIF.  In 
essence,  Bierman's  square  root  data  processing  method  utilized  the  so-called  "data  equation"  and  the  sum-of- 
squares  performance  functional  to  develop  equations  that  propagate  the  state  estimate  and  its  error  covar¬ 
iance.  Eqs.  (1)  and  (2)  arc  considered  as  "measurement  equations"  and  Eqs.  (80)  and  (81)  below  are 
considered  to  be  a  priori  "data  equations"  associated  with  Eqs.  (1)  and  (2),  respectively: 

«u(k)  -  Ru(k)  u(k)  +  wu(k)  (80) 

*x(k)  -  Rx(k)  X(k)  +  wx(k)  (81) 

where  wu  and  wx  are  assumed  to  be  zero  mean,  independent  random  processes  with  unity  covariances. 

Define: 

Q(k)  4  Ry(k)  Ru(k) 


*„(•>  £  C<°> 


By  selecting  the  performance  functional  to  be 


J(ktl)  •  ||  Rx(o)  x(o)  -  *x(o) 


(  H  H(i)  x(i)  -  z(l)  |f  +  ||  Ru(l)  u(i>  -  »u(i)  if  )  , 


the  problem  la  to  minimise  J(ktl)  with  respect  to  x(l)  and  u(l)  for  1  •  0,  1,  2,  ...,  k  ,  such  that  the 
solution  yields  the  optimal  estimate  of  x(k), 

Bierman  (60]  has  shown  that  the  following  "Information  Arrays"  contain  all  necessary  Information  needed 
for  state  and  covariance  update  as  well  as  propagation,  the  actual  data  processing  requires  a  transforma¬ 
tion  and  update  (mapping)  given  by  Eq.  (83)  and  Kq.  (8*0  ,  respectively: 


R„(k)  !  s  (k) 

t(k)  .5— 

H(k)  I  s(k) 


0  I  e(k)  }  8x 
H .  1 


R  (k)  :  0  ;  .  (k)  )  H 

T(k*l)  -»-* - ;r - (-1-- — - 

*R,(k)  *  ‘(k+1):  R^k)  «  1(kel) !  y k)  ] 


Ru(ke-1);  Rux(k+1)  I  tu(kel) 
0  !  Rx(fefl)  :  *x(k*l) 


in  which  Nx  and  Nu  are  dimensions  of  the  x(k)  and  u(k) ,  respectively;  e(k)  is  the  error  in  the  least  square 

fit,  and  T(k)  and  T(k+l)  are  products  of  H  elementary  Householder  transformations.  Definitions  of  other 
symbols  are  given  In  Section  2.  u 

the  update  estimate  and  covariance  are 

t(kel)  •  R*l(k+D  sfc(kel>  (85) 

Px(kel)  -  R^(k+1)  R*T(k+l)  (86) 


the  propagation  of  etatt  vector  requirea  solution  of  a(k)  and  8(k+l)  (i.a.,  u(k)  and  x(kel)  form  an  aug¬ 
mented  data  aquation  (  “j|j*  ^  ]  ) . 

^(kfl)  u(k)  ♦  R^Ocel)  x(kel)  -  ejkel)  -  wu(k)  (87) 

Rx(kel)  x(kel)  -  s^kel)  ♦  wx(k)  (88) 
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which  can  be  solved  using  Che  Gaussian  elimination  method. 

E.  Factorised  Filters 

During  the  past  decade,  a  number  of  authors  [25.  26,  60-66]  have  contributed  to  improving  the  Kalman 
filtering  computation  efficiency  by  suggesting  square-root-free  triangular  factorizations.  Essentially, 
this  approach  is  based  on  the  rank  one  modification  of  the  Cholesky  method.  For  example,  Agee  and  Turner 
[61]  have  proved  that  for  a  positive  definite  covariance  matrix  P  such  that  P  *  UDU^,  in  which  U  is  a  unit 
upper  triangular  matrix  and  D  is  a  diagonal  matrix  with  elements  d. ,  d„,  ....  d  .  Where  n  is  the  dimension 
of  P,  there  exists  an  update  ?(k)  matrix  such  that  n 

P(k)  -  U(k)  D(k)  U1  (k)  -  U(k)  D(k)  UT(k)  +  cv(k)  vT(k)  ,  (89) 

wnere  c  is  a  scalar.  v(k)  is  a  vector  of  n-dimension.  If  P(k)  is  positive  definite,  then  U(k)  and  D(k) 
can  be  computed  as  follows: 

For  j  »  n,  n-1,  ....  2,  recursively  compute  d^(k)  and  u^k),  which  are  elements  of  D(k)  and  U(k): 

d^k+l)  -  d^k)  +  Cj  v®(k)  (90) 


vt(k)  ♦-  vt(k)  -  Vj(k)  u  (k) 

utj(k+l)  »  u^(k)  4  Cj  Vj(k)  vt(k)/dj(k+l) 


1,  — -  j-l 


1  -  1.  — -  j-l 


Cj.1  “  Cj  dj(k)  /  dj(k+l)  (93) 

The  notation  «-  is  used  for  "replacement"  in  the  FORTRAN  implementation.  Detailed  proof  Is  given  in 
reference  [60]  (p.  45).  This  method  of  calculating  D(k)  and  t)(k)  is  generally  valid  for  both  covariance 
filters  as  well  as  Information  filter  updates.  To  illustrate  the  usefulness  of  this  factorized  approach, 
consider  the  covariance  update  (Kalman  fitter): 

P(k+l/k+l,f>  -  P(k4l  /k,£)  -  P(k-H/k.|)  HT(k)  H(k)  P(k4l/k,.Q  HT(k)  H(k)  P(k+l/k,Z).  (94) 

By  assuming  scalar  measurement,  (94)  can  be  factored  into: 

U(kfl/k+l)  D(k+l/k+l)  UT(k4l/k4l)  -  U(k+l/k)[D(k+l/k)  -  (  ^  )  V(k+i/k)  VT(k+l/k>]  UT(k4l/k)  (95) 

where  _ 

V(k4l/k)  •  D<k4l/k)  U  (k+l/k)  HT<k4l)  .  (96) 

and  the  scalar  "a"  ta  given  by; 

a  «  H(k4l)  P(k+l/k)  «T(k+l)  +  R(kal)  .  (9?) 

Using  Eq.  (89).  let 

U(k4l/k)  D(k4l/k)  ^(kel/k)  *  (DCk+l/k)  ♦  £  V(k*l/k)  VT(k+l/k)].  (98) 

Then  -■*<>  be  *«ritt«n  as 

U(kf l/ks i)  D(k4 l/kal)  ««T(k4l/k4i)  -  [U(k4l/k)  U(k4l/k)]  0(k4l/k)[U(k+l/k)  U(k4l/k)]T  .  (99) 

Since  t)(k4l/k)  and  U(k+l/k)  are  unit  upper  triangular,  Eq.  (99)  yields: 

U(k4l/k4l)  -  tf(k4l/k)  U(k*l/k)  (100) 

0(fc4  l/k+l)  •  D(k+l/k)  .  (101) 

The  above  results  show  that  the  problem  of  factoring  the  filter  update  covariance  has  bean  reduced  to 
the  taak  of  factoring  a  symmetric  matrix  (D(k4l/k)  j  V(kel/k)  VT(kel/k) ]  into  U{k4 l/k)  and  D(k4l/k). 

Mow.  coma  id  tr  the  covariance  propagation 

P(k4l/k)  •  tfkel.k)  P(k/k>  *T(k4l.k)  ♦  Q(k).  (102) 

It  la  required  to  find  [U(k4l/k)  D(k4l/k)  U*(k4l/'n)]  inch  that  it  it  equal  to  the  right-hand  aide  of  gq. 
(102) .  Without  loee  of  generality,  let  Q(k)  ha  a  diagonal  Merle  and  let 

vP(k*l/k)  $  t Khel.k)  0(k/k);  t  ]  (103) 


B(k4l/k)  4 


toil;.-? _ 

l  0  !  Q(k) 


Thao  it  can  ha  shown  that 


0 (he l/k)  tftkel/k)  CT(kal/k)  -  Kkel.k)  U(k/k)  0fk/h)  t)T(k/h)  9T(h*l,k)  a  Q(k)  ,  (10$) 

which  la  the  desired  result.  Sq.  (10))  can  ha  efficiently  computed  using  the  modi find  Cram-tchmldt  method, 
the  householder  trsnaforMtlon,  or  the  Cleans  transformstioa. 
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F.  Chandrasekhar  Filter* 

An  approach  to  minimize  the  computer  burden  wet  Introduced  by  Kallath  [27]  who  considered  a  special 
case  of  continuous-time  stationary  process  and  showed  that  differential  equations  of  the  Chandrasekhar  type 
Instead  of  the  matrix  Rlccati  differential  equation  can  be  used  to  compute  the  filter  gain.  This  develop¬ 
ment  was  Immediately  followed  by  Llnqulst  [28]  who  treated  the  same  problem  by  meana  of  the  backward 
Innovation  process.  Morf  et  al.  (67,  69]  and  llnqulst  [29]  have  solved  the  discrete-time  stationary  process 
problem;  Frledlander  et  al.  [68]  have  treated  the  discrete-time  nonstationary  proceaa  problem  by  Introducing 
a  way  of  classifying  stochastic  processes  In  terms  of  an  "Index  of  nonstattonaric;".  It  was  shown  that 
Chandrasekhar  equations  can  be  derived  from  the  extended  Lev-nsoo-Vhlttle-Wlgglns-Roblnson  algorithms 
for  stationary  time-series;  Lalnlotis  [30]  has  provided  generalized  algorithms  for  the  continuous-time 
nonstaclonary  as  well  as  stationary  processes. 

Approaching  from  the  square-root  algorithm  viewpoint  (l.e.,  propagation  of  the  square  root  of  P(t) 
Instead  of  the  square  root  of  P(t)),  Morf  et  al.  [70]  derived  the  continuous-time  Chandrasekhar  c 

filter  equations  which  are  Identical  to  results  given  In  reference  [30],  In  which  the  "Partitioning  Poimulas” 
of  Lalnlotis  were  used. 

For  the  purpose  of  comparing  digital  computational  efficiency,  only  the  discrete-time  Chandrasekhar 
algorithm  la  described  In  the  paper,  since  It  was  pointed  out  in  reference  [70]  that  the  number  of  computing 
operations  la  approximately  equal  for  various  versions  of  Chandrasekhar  filters. 

By  considering  constant  matrices  t,  H,  F  ,  Q  and  R  associated  with  a  system  model  given  In  (1)  and  (2). 
reference  [29]  presented  the  following  results,  from  which  the  optimal  filter  gain  matrix  K(k)  can  be 


determined  In  the  following  manner: 

K(k)  -  g  A(k)(tiA(k)  e  R]'1  (106) 

where 

A(k)  •  A(k-l)  .  A'(k-l)  C*l(k-l)  A'*l(k-1)  UT  (107) 

A'(k)  -  ♦A'(k-l)  ♦  A«k-t)[RA(k-l)  ♦  R]'1  HA*(k«l)  (108) 

T  T  'l 

C(k)  -  C(k-l)  -  A'*(k-1)  H* (HA(k-l)  ♦  R)  HA  '(k«l)  ,  (109) 

with  tnltul  considerations: 

A(o)  -  Po  HT  (UO) 

A'(0-9»o»T  (Hi) 

C(a)  -  11  Fo  »T  e  t  .  (118) 


Koto  that  A<k)  end  A  *(k)  are  n  x  a  matrices  and  C(k)  (a  a  symmetric  mam  matrix.  Thus.  In  order  to  solve 
foe  A(k>  only  2  o*  e  [»(*♦!)  )/2  equations  ere  needed,  ee  In  contrast  to  the  conventional  Kalman  algorithm 
In  which  h?  equations  are  required  to  compute  the  Miter  gain.  If  m  «  <  n,  which  Is  true  In  aeny  prettiest 
situations,  the  number  of  equation*  to  he  solved  tn  each  step  1*  of  order  n  versus  a®.  Since  only  the 
Invert#  of  C(k)  It  needed  In  tq.  (107),  Eq.  (109)  msy  he  replaced  by 

C“l(k>  -  C*‘(k-1)  e  C*l(k-1)  A'T(k*l)  «T(HA(k)  4  t)**  KA'(k-l)  c"l(k-l>  .  (113) 

which  can  he  obtained  by  applying  the  matrix  Inversion  learns  to  Eq,  (109). 

The  shove  results  mere  also  extended  to  the  continuous-time  case  by  Llnqulst  [23]  and  they  mere  shown 
to  be  exactly  corresponding  to  the  equations  derived  by  Us l lath  (27) . 

c. 

tn  a  radically  different  approach  to  filtering  and  ntisstim  tn  gtneral,  Lalnlotis  (12-Jf,  72-M) 
hat  developed  the  Part U toeing  Algorithms,  mhtch  are  fundamentally  new  techniques  never  explored  before. 

The  partitioning  approach  yields  mem  results  for  linear  at  melt  as  nonlinear  eat last  too  m  naturally 
decoupled,  computationally  attractive  and  fast  para  Me  l -processing  realisation*.  The  partitioned  filter 
contains  the  Kalman  filter  as  s  special  cast  and  it  constitutes  the  natural  framework  for  efficient  change 
of  initial  conditions  without  recourse  to  tmoctislna  the  data.  This  special  property  will  lead  to  effl- 
cleat  computations.  Savers 1  partitioning  elgorlthma  for  discrete-time  linear  systems  are  given  below. 

1.  Centre  t  fitt  it  toned  AlsoUthm  (CFa) 

In  the  filtering  problem  stated  tn  Section  2.  solution  provided  by  the  fnrtttloolng  approach 
consists  of  decomposing  the  initial  state  vector  x(f)  Into  the  sum  of  two  Independent  vectors: 

x(«  •  %  (lib) 

where  *  Is  an  unknown  model  parameter  vector  to  bo  adapted  (reference*  (32,  3),  7k]),  let  end  f,  be 
tta  mean  aad  covariance  of  *6  respectively  (the  choice  of  and  7(  Is  arbitrary) .  Since  xft  and 

are  aasumad  to  be  Independent ,  the  fol Lowing  initial  conditions  relationship  bald: 

«(V$  -  *.(*  ♦  tr<«  (115) 
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•  P n(t>  +  Pf(«  (U6) 

The  optimal  filtered  estimate  end  the  corresponding  error-covariance  oetrlx  P(k.f)  are  given  by 

S(k/k,f)  -  ao(k/k,X>  +  fir(k/k.f)  (U7) 

P(k,4  ■  P„(k.f)  +  P,(k,i)  fork*!  (118) 

u  X 

where  $Q(k/k>i)  end  Pn(k,f)  are  computed  uelng  the  standard  Kalman  filter  equations  with  initial  conditions 

“  *n  end  P(X,f)  »  P(f).  The  remainder  estimate  Sr(k/k,0  and  the  corresponding  error  covariance 
matrix  are  given  by 

4r(k/k.f)  -  ^(k.$  ir(!/k,X)  (119) 

Pr(k/!)  -  ^(k.f)  Pr(!/k)  ^(k.!>  (120) 

where  i  (f'k.f)  and  ?r(!/k)  are  the  smoothed  estimate  of  the  partial  initial  state  x  (!)  and  its  covariance 
matrix  Respectively.  They  are  given  by 

ir(Vk.i>  -  Pr(f/kXHn(k,f>  ♦  Pll< X)  Jr(f)]  (121) 

Pr(^/k)  -  (Pr(Q  On(k.Q  +  I]'1  P r(f>  (122) 

where 

Hn(k.f)  -Mn(k-1.£)  e  ^(k-1,0  *T(k.k-l)  ttT(k)  P*l(k/k-l,i>  (121) 

n 

f0(k.i)  On(k.O  -On(k-l.f)  f£(k-t,f)  ^(k.k-l)  «T(k)  P^(kA.f)  M(k>  ((k.k-l)  yi'U,  (l?h) 

n 

^(k.k-l)  -  (l  -  KR(k.l)  M(k)J  *(k.k-l>  (125) 

f  (k.O  •  *(k)  -  H(k)  »(k,k-l>  if*-  t/k-l.fi  (126) 

c  n 

P  (fc/k-l.f)  -  «(k)  P (k/k-1.0  «T(k)  ♦  «(k)  (12?) 

% 

«0(k.i)  •  Pn(k/k-l,f)  «T(W)  P*l(k/k-l,!)  .  (128) 

fk 

The  CPA  given  above  (Cq*.  (117-129))  constitutes  a  fen.ly  of  res  Hist  lens  of  the  optimal  linear  filter. 


one  for  eeeh  InUlel-stete-vsctor  partitioning,  for  example,  the  Kalman  filter  la  a  member  of  this  family 
for  nominal  Initial  conditions  equal  to  actual  Initial  condition*,  namely  *  0  and  P.(i)  »  (OJ.  Unlike 
tha  Kalman  filter.  CPA  la  appllcabla  to  all  initial  conditions  including  P(f. fi  •  m.  with  the  freedom  of 
choosing  nominal  Initial  conditions,  CP*  ia  closely  related  to  the  Chandraeekhar  realisation  of  the  Kalman 
filter  algorithm.  Specifically,  the  computations!  advantage#  of  the  Chandrasekhar  algorithm  depend#  on  the 
low-rank  property  of  the  actual  initial  condition#. 

The  basic  approach  of  CP*  it  to  decompose  the  Initial  state  vector  Into  the  tom  of  tw  statistically 
Independent  Gaussian  random  vector#  (£q.  (Uk)  ).  The  nstural  catena!  ,i  of  this  concept  it  to  consider  the 
decomposition  of  the  inlclst  title  into  the  sua  of  an  arbitrary  number  of  Jointly  Croatian  random  vectors 
which  may  he  statistical  tv  dependent ■  Indeed  this  concept  he*  been  developed  hy  leintotis  and  Andriaanl  tl 
(Mi  Into  tha  to-ealied  •'multipart  It  lew  tog"  algorithm,  which  can  he  used  for,  aatong  other  sppticaciona, 
efficient  parameter  identification#  and  filtered  state  eattmate  of  off-diagonal  terns  in  the  Initial-state 
covariance  matrix. 

3-  Uwfcda..Algo;Uhm 

The  lambda  algorithm  ha*  *  decoupled  structure  which  results  from  partitioning  of  the  total 
data  Interval  Into  nonover tapping  aobintervala.  Slemental  filtering  solution*  are  first  computed  In  each 
tuh  interval  with  arbitrarily  chosen  nominal  initial  condition#.  Then  the  overall  aolution  la  obtained  by 
connecting  the  elemental  piecewise  solution*  via  CPA.  Thus .  tha  desired  estimation  result#  over  the  entire 
interval  he#  been  decomposed  into  a  tat  of  coaplstely  decoupled  elemental  solution*  which  can  be  computed 
la  either  e  aerial  or  parallel-processing  mods. 

let  the  data  interval  constat  of  measurement*  ^  •  (t(o),  aft) . *(*)  )  ,  where  s(h)  $  *(t^)  and 

t0  «  t^.  Given  data  Interval  (t^.  t^)  to  be  divided  into  nonovar lapping  aub Interval a  (t^,  1^1  ,  tha 


Lambda  algorithm  la  given  by 

«J.*>  •  «0<M)  ♦  tr<li«)  cm> 

KJ.o)  -  MJ.D  ♦  T.(i.o)  <l») 

■  » 

*{J.o)  $  *(tJ/tJ.t0)  <13l) 

KJ.o)  $  KtJ.t#)  (138) 


where 


**" » 


i  I 

,  I: 

*•  !• 

i. 

% 

i  ! 

h  t 


ft 


It 


I 1 


are  the  optimal  estimate  and  Its  covariance  matrix  at  t.  with  initial  conditions  x(t  /t  )  and  P(t  ,t  )  at 
t^.  The  nooinal  quantities  ^  o  o 


yj.o  a 


Pn(M) 


W  *i> 


(133) 

(134) 


•re  the  nominal  estimate  of  the  corresponding  nominsl  covariance  it  t,,  obtained  using  the  standard  Kalman 
filtering  equations  with  initial  conditions  at  t^  given  by  ^ 


yi.t)  A  yy 

V1-*)  *  W 

The  remainder  initial  conditions  are  given  by 


*rU.o)  £  S(l.o)  -  yi,t> 


Pr(t.o)  t  P(t.o)  -  Pn(i.D  • 


(135) 

(136) 

(137) 

(138) 


fhe  remainder  smoothed  estimate  yi,o)  and  Its  covariance  matrix  Pr( ) ,o)  are  given  by  Eq*.  (119-128) 
with  the  following  proper  Identlficatlona:  t^  •>  y  t^  •  t^  yi)  »  yi,o)  and  Pf(l)  *•  P^l.o). 

To  show  the  recursive  nature  of  the  Lambda  algorithm,  equations  (129-130)  and  the  present  version  of 
equations  (121*122)  may  be  combined  to  yield: 


«().<>)  •  yj.o  ♦  yj.oiyt.o)  <yj,t>  ♦  n  tyi.o>  yj.o  ♦  *Ta.o)i 

po.o)  •  yi,o  ♦  yj.otPr(t.o)  o„o.i)  *  tj  ptu.o)  {o.o 


(139) 

(140) 


where  0  (1,1)  $  0  (t,,t.)  end  H  (l.t)  $  M  (t,,t,)  sre  obtained  from  equations  (123-125)  for  the  sublutervsl 
«  ojl  n  n  j  i 

(t.,t.).  The  recursive  operation*  are  repeated  for  each  sublntervai  until  t.  »  t 
1  )  i  n 

3.  Pelts  Algorithm 

The  Pelts  algorithm  is  developed  baaed  on  "doubling"  the  length  of  the  partitioning  Interval 
of  the  t-**d>4e  algorithm.  Thu  development  »»«  motivated  by  applications  of  the  Partitioned  filter  to  time- 
Invariant  model*.  In  tdilch  the  (umber  of  Iterations  needed  to  reech  steady  state  depend*  on  the  time 
constant*  of  the  model. 

The  "doubling"  algorithm,  known  to  be  faster  than  the  Chandrasekhar  algorithm  is  given  (79'  by  the 
following  recursive  equations: 

.  i 

t{ s**lA)  *  y^A)  ♦  ya"AHt<s"A)  <ya%)  y2°A)  ^<2%  »*t> 

where  *  (•).  i  (•),  and  0  (•)  ate  given  by  P 

OB  P  B 


-i 


ya**1*)  -  ?a(20A)  a  ^(2°A)!»0(2RA)  <y2*A)  *  Tj  y2ftA)  ^«nA) 
ys^A)  »  ^(2%{rn(a°A)  <ys0A)  ♦  U  1  y2nA) 

0#(2nA)  -  0o(2ftA)  a  0^(2%  ♦  1)  2^(8%  y8ftA> 

for  «  •  0,  1,  2.  3,  ... 


(142) 

0*3) 

O**) 


tt  can  be  seem  that  emtrix  inversions  which  are  required  to  obtain  the  Mceetl  solution  at  tht  end  of 
a  turn  interval  which  la  twice  as  tong  a*  the  Interval  in  the  previous  Iteration,  1.*.,  doubling. 

*•  ftr-SseDltNrtmuqUjt 

The  fat-Sample  partitioning  It  knotbtr  eatenalon  of  the  haalc  Lambda  algorithm,  la  which  par¬ 
titioning  la  done  at  every  resale  with  aero  nominal  Initial  conditions,  l.e.,  t(k)  «  0  and  P (k)  •  0  for 
k  •  1,  8.  . ...  W.  this  ultimate  partitioning  of  tha  data  Interval  at  every  seapttag  Instant  yields  com¬ 
pletely  decoupled  Uneat  estimation  from  •  ample  to  sample,  thus  resulting  In  «  simple  recuretve  algorithm, 
which  wwa  glean  (35.  37,751  by  the  following  recursive  equation*: 


t(kel.O)  *  t#(k4l.k)  t  ^(k4l,0(P(k,0)  0B(k4t,h)  4  if1  (Pfk.O)  Ha(h4l,k)  4  i(k,0)l 
Kk4l.O)  «  yk4l,k)  *  yk4l.h)(f<M)  0#(h4l,h)  4  t]*1  «k.0>  ^(k4l.k) 


whet* 


tyktt.k)  -  «T(k4l,k)  »T(k4l)  *(k4l)  *(k*l) 


0*5) 

(1*6) 

<»» 


T"  I  T 


On(k*l.k)  -  »T(k+l,k)  BT(kfI)  A(k+1)  H(k+l)  «(k+l,k)  (148) 

and  Che  Per-Sample  nominal  filter  equation*  are  aa  follow*: 

8n<k+l,k)  -  kn(k+l,k)  s (kel)  (149) 

Pn(k+l,k)  -  [X  -  kn(k+l.k)  H(k+1)]  Q(k)  (I5O) 

kJkeX.k)  -  Q(k)  HX(kH)  A(k+1)  (I51) 

where 

X 

A(k+1)  e  [H(k+l)  Q(k)  H*(k+1)  4  R(k+1>]  .  (1S2) 


The  Per-Sample  partitioning  filter  1«  aeaorylcss  «•  It  cap  be  seen  from  Eq*.  (147-152),  since  alt 
«es»ry  has  been  transferred  to  the  basic  partition  filter  equation  (145).  It  1*  observed  that  coeiputation 
Of  the  nominal  filter  gain  k^fk+l.k)  1*  accoeipllshed  using  Eq.  (15U  *nd  Eq.  (152).  without  repeated  use 
of  the  Rtccati  equation  as  Is  required  In  the  Kalman  filter  computation. 

Xt  1*  *l*o  observed  that  all  other  quantities,  H  ,  0  ,  *  ,  P  and  »,  are  completely  determined  non- 
requrjlyyly  by  Eq*.  (Ik7-152),  using  only  the  model  quantities  and  the  data  at  the  current  time  (tk+l). 

The  remarkable  nature  of  the  Ter-Saople  partitioned  filter  and  Its  computational  advantage*  can  be  seen 
further  by  considering  the  case  of  time- invariant  model*.  For  time- Invariant  models,  the  recursive  algo¬ 
rithm  Is  exactly  a*  given  in  Sq*.  (1^5-152) ,  except  that  now  alt  relevant  quantities  are  time-invariant. 


Specifically,  the  Per-Sample  partitioned  fitter  la  now  given  (J5,  )7,  7$)  ea  follow*: 

Per -Sample  Partitioned  Filter 

t(k4f.O)  •  *a(k+l.k)  a  ^(P(k.O)  Qo  *  t}*1  tP(k.O)  Xo(k+i.k)  *  x(k.O)]  (15)) 

where 

Ko(kel,k)  «  *T  llT  A  *(k*l)  (154) 

°n  *  {  055) 

^(ket.k)  -  ko  »(kel)  05*) 

%  *  I*  •  «l  Q  (157) 

kB  *  Q  «T  A  (S5g) 

sad  -1 

A  t  {S  (|  «  *  •  }  <  159) 


To  fully  appreciate  how  Interesting  the  shove  version  of  the  partitioned  (liter  1*.  it  must  be -noted 
thee  both  the  Kalman  filter  realisation  and  the  Chandrasekhar  realisation  of  the  optimal  linear  estimate 
result  In  filter  algorithms  that  are  time-varying  even  for  time- Invar lent  model*.  ftemety,  both  the  Kalman 
and  Chandrasekhar  realisations  are  ttae-varylng  filters  In  the  transient  evee  for  time-invariant 

mfcdeta,  yet  it  la  seen  that  the  above  Per-Sample  partitioning  realisation  of  the  optimal  fitter  for  time- 
Invariant  models  t*  a  completely  time- Invariant  one  even  from  the  first  Iteration.  This  transformation  of 
a  basical ly  time-varying  filter  (at  least  *t  the  transient  stage)  into  an  effectively  time- invariant  one  Is 
do#  to  partitioning  and  the  aero  initialisation  at  eech  sampling  lhat'ant,  which  lead  to  completely  decoupled 
and  ammcrylaa*  nominal  filter*. 

it  Is  noted  further  then,  in  view  of  the  time  invariant*  of  all  the  relevant  quantities,  namely  0  ,  k  . 
^  and  P#.  that  they  only  need  to  be  c imputed  once,  and  stored  for  subsequent  use.  *  9 


H.  Decentralised  Filter* 


taring  the  last  few  years,  a  novel  approach  used  for  state  e»tiMtioo  of  large-scale  system*  ha* 
h**«  dr<«htf*n«*ti?h.  As  s  result,  s  number  of  decent  re  Used  filters  have  bean  developed.  Two  of  which, 
addressing  the  general  problem  (inttrcoenectlon*  exist  In  ail  subsystem*  «*  well  a*  U  the  measurement), 
at#  the  tO'  SlIed  Surely  locally  Unbiased  (SIX')  ( J9-44) ,  and  Sequentially  Psrttt loced  Algorithm  (SPA) 
flltata.  The  attractive  feature*  of  these  slgorithm*  are  filter  stability  and  computation  efficiency. 
Thee*  froyntlsi  for  the  coetimioua-tlaw  SU!  formulation  have  been  demon* traced  fc?  sandera  at  at.  jbjj. 
Tha  dlaiusatan  of  the  discrete-time  decentralised  filters  it  given  by  bn  and  Chin  in  Part  l  of  thia 
volums.  fur  as  If -contained  purposes,  a  brief  smaaavy  sf  than*  StgortUmw  la  glvmm  below. 

1‘  .Clg  Jartiv.locdnT  AhibiSitsd  fitter 


OsmaUSar  tha  following  collectloa  Jg  .  1*1.2 . ft)  of  ft  Interconnected  *td>*y*t«ms: 

ft 

SjOO  ♦  tu  Z  Kjfk)  e  wt(fc)  . 

S#j 


(160) 
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With  measurement  at  S.  , 

1  N 

i  (k)  «  H  (k)  x.(k)  +  H.  (k)  E  L  x  (k)  +  v.  (k)  (161) 

J-l  )  1 


Hi(k)  1*  a  matrix  of  rank  <  <11  .  and  has  the  physical  interpretation  that  the  local  decision  Baker 

at  Si  can  obaerve  all  sybsystem  interactions.  H,(k)  is  the  local  state  observation  matrix.  The  problem  is 
to  find  N  decentralized  filter  gains  of  the  specific  SUI  class  to  minimize  a  global  cost  function.  The 
approach  is,  first,  to  decouple  into  N  local  minimization  of  the  constituent  cost  functions  for  the  N  local 
subsystems,  then  apply  the  discrete-time  Matrix  Minimum  Principle  to  solve  for  the  gain  of  each  local  sub¬ 
system.  This  procedure  yields  recursive  computations. 


The  optimal  state  estimate  is  given  by 


/  /  —  t 


*  •  _  1 


S^k+l/k)  -  *u(k+l,k)  ^(k/k-1)  +  Lu(zu  -  Hu  xt)  +  K^z^  -  tt) 


where 


pi  f 

“l  t 


‘il 

*v 

‘l2 


-1 


-  U2  *i 


Pt  <«t 


Pi  x  is  the  dimension  of  the  itk  local  measurement 
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Ri2 

^  ^  ki  ^ 

_  ,T 


4  i  4 


_,T 


K12(k)  "  t*ii^vl  k>  H1JJ(k)  -  Lu(k)  {Hu(k)  P^k/k-1)  Ul2(k)  +Ru(k)}  ] 


_,T 


-1 


fHi2(k'  Pi<k/k’1>  Mig<k>  +  *l2<k>l 


T 


P^k+l/k)  -  (»u(k+l,k)  S  tu(k)  Ki:tk)  P.(k/k  l)[»u(fcfl,k)  -  ^(k)  Hu) 

T 

-  [  -  ^(k)  It'^k)  }  ?i(k/k-l)  H12(k)  -  l't(k)  i'3(k)] 

_,T  -X 

•  fH^Ck)  P^k/k-1)  H^k)  +  Rr,(k)] 

t  —»  _»T  i  i  * 

•I  duOM-i.k)  *  hu(k)  Ru(k;.}  p^k/k-1)  H12(k)  -  Lu(k)  *13<k)] 


+  [Qt(k>  +  hu(k)  Ru(k)  1^(4)] 


T 

Vk>  4  Qt(k)  -  L^tk)  *^(k)  L*t<fc) 


(162) 

(163) 


(164) 

(165) 

(166) 

(167) 

(168) 

(169) 

(170) 

(171) 


(172) 


(173) 
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Equation  (172)  describe*  how  the  variance  In  estimation  error  evolves  in  discrete-ti-ws.  T'  «  SUJ  filter 
treats  the  interaction  input  to  each  local  subsystem  as  if  it  were  just  a  o-mean ,  Gaussian  white  noise., 
but  of  the  appropriately  adjusted  covariance. 

Although  it  is  necessary  to  find  u^(k)  and  u,,(k)  at  each  measurement,  calculation  of  their  inverses  is 
not  needed  because  they  are  imbedded  in  the  decentralised  filtering  algorithm. 


Another  formulation  of  the  decentralised  filter  is  given  by  Shah  [90].  The  so-called  SPA  filter 
can  be  summarised  below.  Consider  the  following  subsystem  and  measurement  equations: 

N 

xt(k+l)  -*lt0cfl,k)  xt(k)  +  E  l^Ocfl.k)  Xj(k)  +  Wi(k)  <174) 


st(k)  -  H1(k)  xt(k)  +  H^k)  Lt(k)  i(k)  +  vt(k) 


-  Hj(k)  x1(k)  +  H1(k)  E  --^(k)  *j<k)  +  vt(k) 


for  i  -  1.  2 . N. 

The  estimation  error  is  defined  in  tbe  usual  manner: 

et<k/k)  ^  ^(k)  .  fit(k/k)  (177) 

where 

S^k/k)  -  E{xi(k)/ Z(k)}  (178) 

in  which  Z(k)  la  the  measurement  set.  By  combining  Eqs.  (174 )  and  (177),  the  1th  subsystem  may  be  vrltten  aa: 

N  .  * 

*t(k*l)  •  #lt(k+l.k)  -  t  ltJ(k+l.k)  «  («  ■«  wt(k)  .  (179) 

J  ^ 

and  tha  measurement  equation  for  th*  Ith  subsystem  la  represented  by 

_  *  N  * 

*u(k)  ■  Ht(k)  xt(k)  +  Ht(k)  htJ(k)  tj(k/k-t)  ♦  vt(k)  (180) 


e  H 

w^k)  £  vj(k)  ♦  5  ili(ke-l.k>  e^k/k) 


v*(k)  S  v, (k)  +  t  h,  ,(k>  e.(kA-l)  .  (182) 

i  i  j.!  i)  ) 

W 

Atstau  e,,  w*  and  v*  can  be  treated  aa  Cauealen  white  noise,  then  th*  standard  Kalman  filter  algorithm  can 
be  applied  to  each  subsystem  with  the  following  appropriately  modified  covariances: 

Q*t(k)  £  «i**(k)  w*T(k) }  (185) 


Qu(k)  -  Qu(k)  ♦  t  ♦tJ(kel.k)  P^k/k)  ^(k+l.k) 
£  k(v*(k)  ^  (k>) 


k*t(k)  •  Ku(k)  ♦  Ht(k)  >  Z  tH(k)  Pj(k/k-l)  1^(10  V  «[(k)  •  <«*»> 

1  51 

th*  resule  of  applying  the  Kalataa  filtering  technique  yields  tha  U  l lowing  aubayetem  scats  propagation  and 
state  update: 

« 

t^kel/k)  •  l^Oeel.k)  S^k/k)  ♦  z  ^  (kel.k)  ^(k/k)  (187) 

Jdi  ' 


or 


a  (k*i/k)  -  ♦,.(k*i,k)  a.(k/k)  +  r  8,(k+i/k) 

i+i 


(188) 


S£(k4l/k4l)  -  a^k+l/k)  +  Ki(k4l)(*11(k+1)  -  il^ktl)  *£(k4l/k)  -  H£(k4l)  t  L£j(k4l)  x^k+l/k)] 

i«  (■”> 


in  which  the  filter  gain  K£(k4l)  Is  computed  In  the  usual  (Kalman  filtering)  manner: 
Kl(k4l)  -  P£J(k4l/k>  H[(k+l)[H1(k4l)  Pu(k4l/K)  H^(k4-l)  4  R^OM-l)] 


(190) 


and  the  covariance  propagation  and  update  aru  given  by 

Ptt(k+l/k)  -  *ti(k4l/k)  P££(k/k)  *£1(k4l/k)  4  Q*£(k) 


(191) 


Plt(k4l/k4l)  .[x  -  K£(k4l)  Ht(k4l)]  Pil(k+l/k)[l  -  K£(k4l)  H£(k4i)f  4  K£(k4l)  R1£(k4l)  Kt(k4l). 


(192) 


4.0  COhPUTBR  SDRDEHS 

Discussions  of  computer  burdens  of  various  algorithms  described  in  the  previous  section  can  be  found 
In  open  literature  (eg.,  [14-24])  in  which  considerable  data  were  provided  pertaining  to  the  computation 
efficiency  of  Covariance  and  Information  filters,  and  their  Square-Root  variations,  as  well  as  the  Chandra¬ 
sekhar  and  Facotrieed  filters.  However,  computer  burdens  of  the  recently  developed  Partitioning  and 
Decentralized  filters  have  been  documented  only  in  closed  literature  (e.g. ,  (92,93]).  The  purpose  of  chia 
section  Is  to  provide  an  assessment  of  computer  time  and  memory  requirements  of  these  relatively  new 
approaches  as  well  a a  other  conventional  algorithms. 

It  is  well  knovn  chat  a  preciae  quantitative  statement  of  computer  central  process  unit  (cpu)  time  end 
memory  storage  requirement*  are  difficult  to  obtain,  since  the  exact  number  of  count*  depends  upon  the 
manner  In  which  the  filter  equation*  are  programmed  and  the  particular  computer  used  to  process  the  date. 

For  these  reasons,  only  an  approximate  assessment  la  given  here.  For  example,  the  logic  time  (151  ha*  been 
excluded.  Also,  the  transition  matrix  and  the  aeaaurement  matrix  are  stained  to  be  given,  alnce  the  number 
of  operetlona  required  to  compute  these  matrices  1*  heavily  dependent  on  the  nature  of  the  problem.  Further¬ 
more,  In  the  process  of  assessing  operation  count*,  no  distinction  la  made  between  multiplication  and 
division.  Although  the  cpu  Urns  required  to  perform  a  division  la  longer  than  multiplication,  this  noaump- 
tlon  Impacts  the  result*  In  a  minimal  manner  because  the  number  of  division*  In  a  filtering  cycle  1*  very 
small  compared  to  the  number  of  multiplications.  Sine*  multiplication  require*  much  more  cpu  time  than 
addition  and  subtraction,  hence,  for  first  order-magnitude  approximation.  It  1*  reasonable  to  regard  computer 
time  a»  directly  proporttonal  to  the  number  of  multiplications  (Including  divisions  and  extracting  square 
roots)  needed  to  complete  the  filtering  cycle.  In  general,  computer  time  and  m*SK>ry  requlremen  a  are  given 
tn  terms  of  n  and  m,  where  n  is  th*  dimension  of  the  state  vector  end  m  t*  the  dimension  of  the  measurement 
vector.  In  the  cast  of  decentralised  filtering,  u.  end  q.  are  used  to  represent  dimension*  of  the  state 
vector  and  measurement  vector,  respectively.  In  tu#  case  of  sequential  processing  of  vector  meaaureawnta 
or  acaler  measurements,  other  symbols  will  bs  used.  Fot  example,  Btsrman's  equation*  for  SqXF  and  the 
factorised  filters  belong  to  thl*  category,  naturally,  cautions  nuat  bt  taken  when  comparison  is  made 
between  sequential-  cr  scalar-processing  technique  and  vector-processing  technique. 

All  matrix  inversion*  sre  assumed  to  be  performed  via  th*  Cholaaky  factorisation  routine,  which  require* 

only  (  g  n  4  *  n  4  n  q]  operations  (q  1*  the  number  of  multiplication*  required  to  extract  the  aquar*  root 

of  a  scalar)  and  (  |  n2  4  |  n]  memory  location*.  The  numbev  of  operation*  required  for  th*  calculation  of 

eigenvalue*  and  eigenvector*  are  difficult  to  ******  because  of  the  iterative  process  Involved.  Thus  a  var¬ 
iations  1  parameter  is  allcvad  In  th*  operation  count*. 

Following  th*  simplified  approach  together  with  the  above  assumptions,  th*  number  of  predominant 
opamtona  (multiplication)  and  mamorle*  required  for  variout  algorithms  sre  assessed,  and  results  are 
presented  in  table*  1  -  12.  in  addition,  Table  13  is  provided  to  show  recent  trends  in  computer  operation 
speed*,  so  that  cpu  time  for  dlfferant  machines  can  be  derived  for  eech  algorithm. 

A  separate  table  for  toe  Extended  Kalman  filter  i*  not  being  made  because  the  standard  Kalman  filter 
includes  the  Extended  Kalman  filter,  In  which  a  tat  of  nonlinear  differential  aquation*  swat  b*  Integrated 
tn  order  to  propxgate  states  between  measurement r.  For  thl*  rector.,  up  to  90fr  oC  cpu  tlae  required  par 
filter  cycle  t a  spent  tn  integrating  differential  equation*.  Th*  temalnlng  10£  of  cpu  time  would  b*  spent 
on  performing  the  computation  sequence  of  th*  ttandard  Kalman  filter.  The  square  root  covariance  filter  of 
Andr«et,  Taplay  and  Chce,  Morf,  levy  and  Kallath  are  close  enough  (as  far  a*  computer  operation*  ate  con- 
canted)  to  be  considered  at  on*  clan;  therefore,  only  one  table  Is  provided  under  the  heading  of  "Andrew# 
Square  Root  Filter". 

Computation  details  of  th#  three  derivative*  of  th#  General  Partitioned  M*otithm  -  Lambda,  Delta  and 
Fvr-Sample  Partitioning  -  at  well  a*  their  square  root  formuletlont  a.*  documented  in  (92j.  In  general, 
computer  burdens  of  thaxj  derivative*  are  considerable  leas  than  that  required  by  ;h*  general  formulation. 
Particularly  attractive  la  th*  for-S*mpl*  Partitioning  algorithm,  thick  Is  teamorylasa  and  without  repeated 
uar  of  the  Rlc.-atl  equation. 
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The  entire  clui  of  zero-order  systems  with  tcalar  sequential  measurements  has  been  treated  in  great 
detail  by  Blerman  [60],  which  includes  tables  auamarlzlng  operation  counts  of  SRIF  and  factorized  filters 
as  well  as  counts  of  the  Householder  transformation.  Hence,  these  tables  are  not  duplicated  here. 


TABLE  1 

Computational  Requirements  of  the  Standard  Kalman  Filter 


Step 

Computation  Sequence 

Operations 

Storage 

1 

4<k/k, A 

tt 

2 

P(k/k  ,A 

n2 

3 

♦(k+l,k) 

n2 

b 

♦(k+l,k)  P(k/k,i) 

3 

n 

store  in  2 

5 

t<!ttl,k)  P(k/k,jf>  (k+l,k) 

n3 

store  in  b 

6 

Q(k) 

n2 

7 

8 

9 

P(k+l/k,f>  -  4(ksl,k)  P(k/k, 0  *T(k+l,k)  +  Q(k) 

4(k+l/k,j0  -  *(k+l,k)  «(k/k,i) 

H(k+l) 

2 

n 

store  in  5 

store  in  1 

m  n 

10 

P(ktl/k,  1)  HT(kfl) 

2 

m  n 

store  in  7 

11 

R(k+1) 

»2 

12 

H(fcft)  P(k+l/k,  ft  HT(k+l)  +  R(k-rl) 

2 

m  n 

m2 

13 

lb 

(a(fe+l)  P(k+l/k, fi  Hf(ktl)  +R(k+1)}*1 

K(k+l,i)  -  Kk+l/k.O  HT(k>l)(H'kfl)  P(k+l/k,  fi  RT<*1)  +  R(kU)]'1 

|(m3e3m2)eoq 

-2n 

m  n 

15 

*(k+l) 

» 

16 

i(k+l/k*l.fi  -  *Ot+l/k,/)  +  r(k+l,*)[s(k+t)  -  H(k+1)  >“(fc+l/k,  U) 

a  n 

store  in  8 

17 

P(k4 t/k+l , 0  -  F(k+l/k,l)  -  K(V+l,^(r(kH,k,0  HT(ktl)]T 

mn2 

store  in  10 

2n3-M2+2*ui2 

3n2-Ht+2mn 

Tota  l 

O 

♦  2a n  +  m  n 

* 

e  a*2  +  » 

TABLE  2 

Computational  Requirements  of  the  Stabilized  UU.o  Filter 

Step 

Computation  Sequence 

Operations 

Storage 

1 

KkA.fi 

0 

8 

KkA.fi 

n8 

3 

|(k+l,k) 

u8 

k 

♦Ortt.k)  nv*.Q 

n3 

store  in  2 

5 

«k*l.k)  KkA.fi  «T(kel,k) 

n3 

store  Co 

6 

Q(k) 

o8 

7 

KkelA.fi  •  Kkel.k)  KkA.fi  *t'kM,k)  +  q(k) 

store  in  5 

e 

KkelA.fi  -  Kkel.k)  KkA.fi 

n2 

store  in  1 

9 

U(k*l) 

tt  0 

10 

KkelA.fi  HT(kel) 

mn8 

store  la  7 

(Continued) 


TABLE  2  (Continued) 


Step 

Computation  Sequence 

Operations 

Storage 

il 

R(ktl) 

m2 

12 

H(fcfl)  P(fc+l/k,0  HT(k+l)  +  R(k+1) 

2 

m  n 

m2 

13 

[H(kt-l)  P(k+l/k, f)  HT(k+l)  +  R(lM-l)]’1 

1,  3 2*  . 

— (m  +3m  )-Hnq 

store  in  12 

lk 

K(kf  1 ,  i)  -  P(k+l/k,jf)  HT(k+l)[H(kfl)  P(k+l/k,  t)  HT(k+l)  +  ROt+l)]"1 

2 

m  a 

in  n 

15 

*<k+l) 

m 

16 

*(kfl/kfl,i)  -  «(fcfl/k,i)  +  K(k+l,0[i(kfl)  -  H(k+1)  *(k+l/k,jQ] 

m  n 

store  in  8 

17 

1  -  K(kf  1 ,  C)  H(k+l) 

n2  m 

store  in  10 

18 

[X  -  K(k+l,i)  H(k+1)]  P(k+l/k,jO 

n3 

store  in  3 

19 

[I  -  K(kfl,fl  H(kt-l) I  P(fcfl/k,  jQ[I  -  K(kfl,  C)  H(fc+1)]T 

n3 

store  in  17 

20 

K(kfl,  f)  R(ktl) 

2 

m  n 

store  in  18 

21 

P(kfl/kfl,i)  -  [X  -  K(k+1,£  H(k+1)]  P(k+l/k,«[I  -  K(kH,jQH(kH)]T 

+  K(kfl,i)  R(kt-l)  KT(k+l,jQ 

2 

n  tn 

s tore  in  19 

3  P  p 

kn  +n  +3mn 

p 

3n  +n+2an 

Total 

+  3m  n  +  mn 
+  |m3+|m2+mq 

+  2 m**  +  tn 

TABLE  3 

Computation*!.  Requirement*  of  the  Potter  Square  Root  Filter 


Step 

Computation  Sequence 

Operatlona 

Storage 

1 

*(k/k  ,t) 

n 

2 

8(k/k.© 

n2 

3 

*(k+l.k) 

n2 

k 

*(kfl/k.E>  •  «(kH.k)  8(k/k,0 

n2 

store  in  1 

5 

S(kfl/k,  t)  -  ♦(k+l,k)  S(k/k,|) 

n3 

store  In  2 

6 

h(k+l> 

Q 

7 

y(lrt-t,8>  -  S^(k+l/k,  S)  h(kfl) 

n2 

A 

8 

8(k+l/k,  0  y(k+l,  X) 

n2 

n 

9 

r(kfl) 

i 

10 

yT(ktl,fl  y(k+ti  t) 

A 

i 

11 

R(kH,4>  -  8(k+l/k,«  y(ktl,D/  tyT(k+l.$  y(k*l,X>  +  r(k+l)] 

n 

store  in  8 

12 

*(k+l) 

1 

13 

hT(WU)  8(k*l/k.fi 

n 

1 

lk 

*(k+l/kU ,i)  -  *(k+l/k,0  +k(k+l,  X)(a(kt-l)  -  hT(kU)  8<kM/k,  X>] 

n 

store  In  k 

15 

l  -  yT(k*l,0  y(k+l.«  /  {yT(k+l.«  y(k+l,«  ♦  r(k+l>] 

i 

store  In  12 

16 

•li[l-  yT(k+l,A  y(k+l,)Q  /  tyT(k+l,«  y(kel,0  +  r(W-l)]  ]l/2 

9 

store  in  15 

17 

a  ■  (step  16]  /  [atep  10] 

1 

store  in  13 

18 

y(k+l,«  y*<kH,« 

n 

store  in  6 

19 

«y<kU,$  y*(k+l,® 

n 

store  in  18 

20 

$(lwl/k+l,jQ  ■  SOe+l/k, J>(I  -  ay(k*l,«  yT<k+l,$] 

n2 

store  in  5 

n3  ♦  kn2 

2nS  +  ko  +  k 

Total 

♦  6o  +q+2 

TABLg  4 

Computational  Requirement  of  the  Bellantonl  and  Dodge  Square  Root  Filter 


Step 

Computation  Sequence 

Operations 

Storage 

1 

«<k/k,  £) 

n 

2 

S(k/fc,,© 

2 

n 

3 

f(k+l,k) 

2 

n 

4 

X(k+l/k,£>  -  t(k+l,k)  «(k/k,D 

n2 

store  in  1 

5 

S(k+l/k,  £)  -  t(k+l,k)  S(k/k,  £) 

n3 

2 

n 

6 

R'^(k+1) 

0 

m~ 

7 

H(k+1) 

m  n 

8 

R**(k+1)  H(ktl) 

m2  n 

m  n 

9 

i(ktl) 

m 

10 

*(k+l)  -  H(k+1)  S(k/k,j© 

m  n 

store  in  9 

11 

BT(k+l)  -  R-*<k+l)  H(ktl)  S(k+l/k,  X) 

2 

n  m 

store  in  7 

12 

I  +  BI(k+l)  B(k+1) 

2 

m  n 

store  in  3 

13 

[I  +  BT(k*l)  B(kt-l)]*1 

13  2 

“(n  +3m  )+  m  q 

store  in  12 

14 

R*^(k+1)  [step  10] 

.* 

store  in  4 

15 

[step  13]  [step  14] 

m2 

store  in  10 

16 

[B(k+1)]  [step  15] 

m  n 

store  in  8 

17 

*(k+l/k+l,j©  -  *(k+l/k,,Q  +  S(k+l/k,  f)[step  16] 

2 

n 

store  in  4 

18 

B(lc+1)  BT(k+l) 

m  n2 

store  in  12 

19 

D,  diagonal  matrix  consisting  eigenvalues 

* 

store  In  15 

20 

T,  transformation  matrix  consisting  eigenvectors 

store  in  16 

21 

(I  +  D)**  -  I 

m  q 

store  in  20 

22 

((X  +  D)’*  -  I]  TT 

m  n 

store  in  11 

23 

Tfstep  22]  +  I 

m  n2 

store  in  18 

24 

S(k+l/tcH,  j0  -  S(k+l/k,  t)  [step  23] 

n3 

store  in  2 

*  variable  number  of  operations  depending 

Total  2n3  +  2n2  +  3n2m  +  2m2n 

2 

3n  +  n  +  2  n  m 

on  computer  being  used  a*  well  as  method 
being  used  to  compute  eigenvalues  and 

7  2  13 

+3mn+^m  +  -  m 

+  m  +  m2 

eigenvector*.  +  2m  q  +  ^ 


table  5 

Computational  Requirement*  of  the  Andrew*  Square  Root  Filter 


Step 

Computation  Sequence 

Operation 

Storage 

1 

fi(k/k,|) 

n 

2 

8(k/k,j0 

n2 

3 

|(k+l,k) 

n2 

4 

*(k4.1/k,jQ  -  *<k+l,k)  *(k/k,i) 

n2 

store  In  1 

5 

S(k+l/k,  JQ  -  t(kfl.k)  S(k/k,0 

n3 

n2 

6 

H(k+1) 

_2  "  " 

7 

*0*1) 

ac  +  a 

2  2 

8 

8^(kfl) 

m2  m 

2  2 

9 

H(fcfl)  -H(kfl)  S(k+l/k,  jQ 

n2. 

stors  In  2 

10 

wiT  •  ft  +  u*  a 

a£  +  m 

2  +  2 

stora  In  S 

It 

0.  o*1 

y  +  »2-|  +  «q 

stora  It.  10 

12 

a(lH-t) 

m 

IS 

s(k+l)  -  U(kfl)  *(kfl/k,J0 

m  n 

stora  In  12 

U 

U“l[»t«p  IS] 

(Contlnuad) 

£  + 1 

2  2 

stora  In  IS 

TABLE  5  (Continued) 


18  [step  16]  H1 


19  X(k+l/kH,jQ  -  «(fcH/k,jQ  +  (step  17]  [step  14] 

20  s(k+l/kfl,  i)  »  S(k+l/k,()[I  -  (step  15)  (step  16)  H  ] 


store  in  6 


3  2  „ 

m  .  a  ,  © 

t  +  t  +  3 


nm^  ran 
2  2 


\  o  2  3  2 

n  +  n  +  3n  m  +  ppam 

±7  1 3,-2 

+  ~:n  +  Tjm  +  <an 


store  in  7 
store  in  11 
store  in  15 

store  in  4 
store  in  9 
2 

3n  +  n  +  m  n 


p 

+  m  +  2a 


+  -  a  +  n  q 


m&A 

Computational  Requireoents  of  the  Schaidt  Square  Root  Filter 


Computation  Sequence 


1  *(k/k,jQ 

2  S(k/k,0 

3  l(kfl.k) 

4  *(kfl/k, t)  -  4(k+l,k)  *(V/k.f) 

5  qW>  , 

6  A  •  (l(W-l.k)  S(k/k.jQ  qV+D] 

7  AT  e  (e  is  an  arbitrary  n-column  vector) 

8  A  A1  e 

9  A  AT  e  eT 

10  T  -  I  -  A  AT  e  eT  /  [first  element  of  A  AT  e  vector) 

11  A  -  T  A 

12  (first  element  of  A  AT  e  vector] * 

13  First  column  of  S(kH/k,f)  ■  A  A1  «  /  (sf*P  42] 

14  To  complete  the  S(kH/k,«  matrix,  steps  7-13  will  be 
Iterated  (n-l)  times.  Since  the  dimension  of  A  and  T 
are  effectively  reduced  by  one  at  each  iteration,  the 
number  of  operations  for: 

7  a  5 


9  &  10 


Operation 


Storage 


12  A  13 

h(k*l) 

y(k*l,  f)  »  ST(k+l/k,$  h(kel) 

8(kfl/'t.  J  y(k*l ,  t) 
t(kvl) 

yT(k+',i)  y(k4‘l, 

•  a(kfl/k,Dy(kei,«/yT(k+4.Dy(^i-«'Mr(k*l> 

i(kH) 

hT(ket)  «(k+l/k .$ 
f(kfl/kel,  t)  -  *(kel/k,$ 

♦  f(kfl,^ta(kel)-hT(k+l)l(k+l/k,«] 
(Cistlsutdl 


E  (n+m)  (n+l-1) 

‘*l  n 

Z  (n-i) 

i  1*1 

n-l  j, 

E  (trt«)(n*l) 
l-1  n 

Z  (n-l) 
1-1 


store  in  1 


store  in  3 
store  in  9 
store  in  6 


store  in  8 


store  in  13 


•tore  In  17 


•tore  in  4 


•22 


TABLE  6  (Continued) 


Step 

Computation  Sequence 

Operation 

Storage 

24 

l-yT(4+l,4y(fc+l,.©  /  (yT(lctl,i>y(fc+l,0  +  r(4+l)] 

1 

atore  in  21 

25 

-1  ±  {l-yT(fctl,4)yOM-l,jO/tyT(k+l,^y(fcfl,^+r(fctl)]}% 

q 

store  in  24 

26 

a  ■  (atep  25]/[atep  19] 

i 

atore  in  22 

27 

y(k+l,«  yT(4+i,/> 

Q 

atore  in  15 

28 

or  y(k+l, ©  yT(k+l,i) 

n 

store  in  27 

29 

s(4+l/4+l,i)  -  s(4+l/4,f)[l-ef  y(4+l,X)yT(4+l,i!)] 

2 

n 

store  in  2 

Total 

+  3n^  +  6  n  +  q  +  2 

5n2  +  6n  +  5 

n  n  n  - 

+  2  E  (n-1)  +  E  (n+m)  (n+l-i)  +  I  (n+m)(n-i) 
i-1  i-1  i-1 


TABLE  7 

Computational  Requirement*  of  the  Carlson  Square  Root  Filter 


Step 

Computation  Sequence 

Operation 

Storage 

1 

2(4/4,  J0 

n 

2 

S(4/k,i) 

2 

n 

3 

♦(4+1,4) 

n2 

4 

*(4+1/4, 0  «  «k+l.k)  S(k/k,i> 

n2 

store  in  1 

5 

Q(4+l) 

2 

n 

6 

♦(4+1,4)  S(k/4.0 

n3 

store  In  2 

7 

♦(4+1,4)  S(4/4.0  ST(k/k,$  ^(k+l.k) 

n3 

n-2  n-1 

store  In  6 

8 

S(k+l/4,  t)  - 

(  ♦(4+l,k)S(4/4, ^ST(k/k, 4) ♦(k+l,k)+0(4+l)]' 

E  (n-l-i)  +  E  (n-t) 
l-l  l-l 

♦  (n-2)  ♦  (n-l)q 

9 

h(4+l) 

0 

10 

y(k+l , 0  •  ST(4+l/4,0  H(4+l) 

n2 

a 

11 

S (4+1/4, 0  y (4+1,0 

n2 

a 

12 

t(4+l) 

i 

13 

yT(k+l,0  y(k+l,« 

n 

i 

14 

4(4+1. 0-  S(4+l/k.©y<4+l,D/(yT(4+l.l)y<k+l,«+t<k+l)J 

n 

store  In  11 

15 

*(4+l) 

1 

16 

hT(k+l)  *<4+1/4,  ©  ,|((kHf) 

a 

l 

17 

•(k+l/4+1,  D  •  *(4+1/4, <^t<4+l)-hT(4+l)*<4+l/k,^) 

0 

store  in  4 

18 

y  (4+1,1)  y*(4+l ,  f) 

0 

store  in  9 

19 

l  -  (y(k+l ,  t)  y*(4+l ,!))/{ yT(4+l ,  0  y (4+1 , 0+r (4+1) ) 

n 

store  In  3 

20 

(I  -  (y(4+l,  ®y*(4+l,  0)/{y*(4+l,  $y(k+l, 0+r(4+l)) 

tarn*  aa  atep  8 

store  In  19 

21 

S(4+IA+1.*>  -  S(4+l/4,  0(atep  20] 

E  Ifn-(l-l)] 

1*1 

Store  In  7 

Total 

2a3  ♦  3«2  ♦  fin 

Jo2  ♦  4n  +  4 

n-2 

+  2  E  (n-l-l)  ♦  2(o*2) 


♦  2(n-l)q  ♦  2  E  (o*l) 

♦  E  l(o-(i*l» 
l-l 


Total 


Jo*  ♦  4n  +  4 
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TABLE  8 

Computation* 1  Requirements  o l  the  Information  Filter 


Step 

Computation  Sequence 

Operation* 

Storage 

1 

d  (k/k,  tj 

Q 

2 

p'l(k/k,« 

2 

n 

3 

rl(kfi,k) 

2 

n 

4 

Q'l(k> 

2 

n 

5 

Q(k) 

2 

n 

6 

p'^k/k.X)  »‘1(k+l,k) 

3 

n 

•tore  In 

2 

7 

p(k>  -  rTocfi,k)  p'^k/k.ij  rVku.k) 

n3 

store  In 

6 

8 

[P(k)  +q‘l(k)]'1 

^(n343n2)4nq 

store  In 

7 

9 

P(k)[P(k)  4  Q'l(k)]'1 

2  3 

n 

store  In 

8 

10 

p'Wl/k.l)  -  F(k)-F(k>[p(k)-K)‘1(k)]'1F(k) 

3 

n 

•tore  In 

9 

11 

P”l(k4l/k,  jQ  Q(k) 

n3 

•tore  in 

5 

12 

#*T(fcfltk)  d  (k/k,  t) 

2 

n 

•tore  in 

l 

13 

d(k4l/k, £)  -  [I  -  ?'l(k+l/k.0Q(k)]  *'T<k4l.k)d(k/k,f> 

n2 

•tore  In 

12 

14 

H(k4l) 

QB 

15 

*<k4l) 

B 

16 

»'’(k4l) 

2 

• 

17 

HT(k4l)  R_l(k4l) 

n  a2 

■tore  In 

14 

18 

d(kft/k+l,fi  -  d(k4l/k,f)  4  HT(k4l)  R'1(k4l)  s(k4l) 

2 

n 

•tore  In 

13 

19 

P"l(k4l/k4l,l)  -  P*l(k4l/k,i)  4  HT(k4l)  R‘l(k4l)  H(k4l) 

2 

n  m 

•tore  In 

10 

Total 

u  3  ,  7  2  2 

*jp>  4  gO  4  2m 

4n2  4  n  ■» 

►  OB 

4  a2  4  nq 

4  m  4  *2 

TABU  9 

Computet  tonal  Rtq'.'lremsnt*  of  the  CH»ndrs«ekhar  Filter 


Step 

Computation  Sequence 

Operations 

Storage 

l 

«  k/k) 

Q 

2 

«k4l,k) 

n2 

3 

<<k4l.k)  t(k/k> 

o2 

•tore  in  1 

4 

A(k-l) 

a  a 

5 

A'<k-8) 

Cl  B 

6 

C*l(k-2) 

m2 

7 

it 

B  tt 

• 

8 

•8 

9 

(8A<k-l)  4  R)‘l 

|(mJ43m2)4mq 

m2 

to 

«q.  113 

amJ4|m®tHeB® 

•tore  la  6 

a 

tq.  108 

2»2b  4  4mm® 

•tare  la  $ 

12 

Fq.  107 

m8o  4  a*,2 

•tore  la  4 

13 

tq.  106 

oan  4  m2o 

n  b 

14 

«(k4l/k4t)  •  «(fc4l/k)  4  K(k4l)(t(k4l)  -  H  t(k4l/k)J 

m  n 

•tore  la  3 

Total 

3  2 

2m  4  7  m 

2m24.2 

o 

4  o  (W8m)  4  a  a 

4  e(U4a) 

4  |<m54  3m2) 

- - 

+  *4 

W 


3 


■3 

I 

I 


TABLE  10 

Computational  Requirement a  of  the  General  Partitioning  Filter 
(In  addition  to  the  "nominal11  Kalman  filter  computation) 


Step 

Computation  Sequence 

Operation* 

Storage 

1 

fn(k-l/k-l,© 

a 

2 

*<k,k-l> 

n2 

3 

H(k) 

2 

m  a 

a  q 

4 

Eq.  126 

B 

5 

6 

?n(k/k-l,« 

t(K) 

n2 

«2 

7 

Eq.  127 

2 

n  n 

n2 

S 

P*l(k/k-l,l) 

o2 

9 

Eq.  128 

2 

a  m 

a  m 

10 

Eq.  125 

2 

n  a 

n2 

11 

On(k-l,D 

a2 

12 

Eq.  124 

3  2  2 

2n  +  3q  i  +  2 n» 

2 

n 

13 

M*) 

n2 

14 

rr(£>  on(k,  t) 

3 

a 

n2 

15 

(Pr(f)  O0(k.^  4  I)*1 

g(n3  4  3n2)  4  nq 

atore  In  14 

16 

Eq.  122 

n3 

atore  In  15 

17 

H(k-l.i) 

n  a 

is 

Eq.  123 

n3  +  n^e  4  n  m2 

atore  In  17 

19 

n2 

20 

n 

21 

Eq.  121 

n2m  +  n3 

atore  In  20 

22 

Eq.  119 

tt2 

atore  In  l 

23 

Eq.  120 

2o3 

2 

n 

Total 

8a5  4  n2{8mel)  4  4«2n 
♦  |(nJ  *  3n2)  4  nq 

I0n2  ♦  2n  4  J2*2 

4  m  *  }*  n 

I6IMLUA 

Computational  Requirement*  o(  the  SUI  Filter 


Step 

Computation  Sequence 

Operation* 

Storage 

l 

t/k/k-l) 

°i 

2 

Pt(WL-l) 

3 

♦u(krl.k) 

4 

\W 

9,  *  Pt 

5 

U*l(k)  t»t(k)  U^(k) 

***  Table  11 1 

atore  In  4 

6 

LU<4) 

9,  *  Pt 

7 

tuw  UjOO 

°l  *1 

ator*  In  6 

8 

tjW 

pt  a  m 

9 

U^OQ  l^tk) 

»!* 

store  to  8 

10 

Ht(k) 

n  *"» 

U 

UjUw  Ht(k) 

•tore  In  10 

IS 

«t(k) 

<1 

13 

«;‘w .  (w 

*  1  (Continued) 

«! 

•tore  in  12 
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TABU  UA  (Continued) 


Step 

Computation  Sequence 

Opera t tone 

Storage 

14 

V*) 

2 

<1 

15 

Ujl(k)  Rt(k)  UgT(k) 

3 

<1 

atore  in  14 

16 

Eq.  172 

2a|  +  nt  (^VPj)  +  ni<9t'pl) <a,l"V  \ 

o  *  i 

+  ntpt  +  (qt  -  Pl) 

atore  in  2 

17 

Eq.  171 

a>»f(<ii*Pi)  +  2ni<Vpi)'H,i','<Vpi)3 

“l  *  (<li  '  pi} 

18 

Eq.  162 

ni  +  *“1  ’i 

store  in  1 

Total 

Sn\  +  q3  +  2(qt  -  Pt)3  ♦  n*(t  +  5^-^]  +  Pi<3»t+«») 

*1  *  °i  +  qi  pi 

+  q®(i  +  3^)  +  q1  -  30^  pt  qt  +  2n1(2q1  -  Pt) 

+  ptn  +  2qt  nt 

Q 

4  essresete  eye ten  diacneioo  p  £  interactive  naasurament  dimension 

°i 

£  local  eyetca  diaeaeion 

qt  4  local  measurement  diaeneion 

MULia 

Step  5  of  the  SUI  Filter  Coaputetioo  Sequence  (tef .  (91]) 


Machine 

Execution  Tine  (sec.) 

IBM  370/195  (Argonnn  Hatloual  Laboratory) 

1.0 

MN  360/75  (University  of  Illinois) 

9.7 

IBM  360/65  (AXES  Laboratory) 

17.0 

IBM  370/165  (Uni varsity  of  Toronto) 

2.6 

IBM  370/168  Mod  3  (Stanford  University) 

2.3 

Burroughs  6700  (University  of  California,  Sea  Otago) 

82.0 

CSC  6600  (Klrtlend  Airforce  Beat) 

4.4 

GDC  Cyber  175  (NASA  Langley  research  Center) 

1.2 

GDC  7600  (national  Center  tor  Atoaoepheric  Be search) 

0.87 

CSC  7600  (UMVence  Live  more  Laboratory) 

1.2 

COC  6400  (Nor three tern  University) 

15-0 

CCC  6400/6500  (fctrdae  University) 

17.0 

GDC  6600/6400  (Uolwre tty  of  Tessa) 

5- a 

Money**!!  6070  (Bell  Laboratories) 

9.6 

Ualvac  1110  (Bnlveralty  of  Wisconsin) 

7.7 

tWC  KA  -  FW  -  10  (Yale  University) 

79.0 

AMDAHL  4 70V/ 6  (University  of  Michigan) 

2.1 

IABU12 

Computational  tagulrs— ate  of  the  $M  Filter 

Step  Opgutlw  Sequence  Opemlont 

l  it(k/k) 

a  »u(h/k) 

5  t^kal.B) 

4  \t(kel.k)  tjtk/k) 

5  «9-  W* 

6  i*.  m 

(Continued) 


Storese 


etsee  la  l 


■! 


•tore  la  2 
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UIU  12  (Continued) 


Stap 

Computation  Ssquenca 

Operations 

Storage 

7 

Eq.  186 

q|  +  (of  qt  +  qf)H 

4 

8 

Bq.  190 

°i  qi  +  ^i  qi  +  qi 

“i  *  qi 

9 

Eq.  189 

qi  +  ®“i  qi  +  ^“i  +  ni 

store  in  4 

10 

Eq.  198 

store  in  S 

Total 

2n|0H3)  *  n*(l  +  8qt  +  H  +  qt  N) 

3n^  + 

+  ni^qi  N  +  8qi  +  2qi^  +  qi  +  qi  +  qi 

+  q®  +  nt  qt 

H  ^  uuaber  of  subsystem*  q^  £  local  aaaturoent  dimension 

n;  £  local  system  dlaeocloc 


16SUL11 

trend*  in  Speed  of  Computer  Oparatlona 


Operation 

1968 
(p  »ec) 

im 

(u  »ec) 

1980 
(w  *ec) 

toad 

8 

1 

0.66 

Multiply 

10 

6 

886 

Divide 

15 

7 

5.61 

Add 

3 

? 

1.54 

Store 

8 

8 

0.44 

Increment 

Index 

Register 

■  8 

1 

0.33 

$-o  svetwsm 

The  question  of  how  to  attain  computation  efficiency  ha*  pit* ltd  a*ny  engineers  despite  th*  f*ct  that 
•any  attempts  ha**  b«an  and*  to  present  guideline*  a*  to  which  a  l  for  it  ha  la  th*  hast  (aoat  efficient).  The 
enawer  1*  attll  Imprecise.  tinea  it  depend*  on  factor*  toch  at  operational  computer  paraaatera  (instruction 
act,  word  length,  epu  time,  ate.).  t>rogt#**»tag  net  hod*  (single  or  double  prectaioo,  llnaar  or  eoUt- 
dlaanalon  arraya.  exploitation  of  ayaaatrlc  and  «par«e  aatrlcai,  ate.),  th*  aica  and  complexity  (cross* 
coup l in*)  of  th*  transition  matrix,  and  methods  cf  proca** lap  measurement  data  (simultaneous,  subgroup, 
sequential ,  dacaatralieed.  ate.). 

The  purpose  of  thta  paper  la  to  provide  en  order-magnitude  approximation  oo  computational  requirements 
Of  vtrlou*  filtering  algorithms  without  making  eny  specific  recoemendatlene  aa  to  which  one  la  th*  “best". 
Results  are  given  In  tabulated  fora  (Tablet  1- If) •  la  ualrg  the»e  teblaa,  caution  mut  be  exerclaed  (at* 
pectetly  when  comparisons  are  a*d*  among  algorithm*)  ainc*  they  are  not  •  and  cannot  be  •  emptied  on  a 
unifora  beat*.  For  example,  Herman's  SUIT  and  Factor  I  ted  filter*  are  designed  for  the  protesting  of 
taquentlel  aeeauraaent  data  of  a  aero  order  dynnalc  ayatea;  the  Partitioning  ftltar  la  designed  to  deal 
with  unknown  paraaatera  aa  well  ea  ttete  vttlfastlon,  hence  thie  atgomtws  te  efficient  in  the  sense  that 
e  separate  adaptive  routine  la  not  needed.  Th*  Decentralised  fllter'i*  aoat  appropriate  for  targe-acala 
but  decomposed  eubsyatteu  application;  it  1*  efficient  tn  the  aeoee  that  computer  operatloae  are  leae  for 
e  set  of  subsystem*  than  that  required  for  the  aggregate  ayatpa.  Thar* (ore,  users  of  theat  algorithm*  are 
advleed  to  perform  ccet-effectlveneta  trade-off  etudiea  according  to  given  situations  -  before  deciding 
which  algorttba  to  has  eat  acted,  ft  le  hoped  that  this  paper  dap*  preside  sufficient  laferaettoe  for  such 
trade-off  atudiea. 
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SUMMARY 


laplenental  design  of  real-time  estimators  is  viewed  as  ’.he  napping  (mechanization)  of  estlaatlon 
algorithms  into  n  software  structure  designed  to  function  in  a  real-tine  environment.  This  means 
nee  ting  real -tine  constraints  of  the  problea  and  overcoming  the  liaitationc  of  hardware  while  attaining 
the  specified  functional  performance.  Use  of  a  microprocessor  or  a  distributed  system  of  small  computers 
as  hardware  imposes  additional  constraints  on  mechanization  of  real-tire  estimators:  the  "smallness" 
of  individual  processing  elanwnts,  arranged  in  a  loose  federation,  must  be  overcome. 

Distributed  hardware  architecture  is  suitable  for  a  class  of  real-time  control  systems,  built 
around  an  estimator,  that  do  not  require  sophisticated  and  centralized  management  of  resources.  Such 
systems  are  designed  to  work  under  predetermined  maximum  loading  and  are  characterized  aa  being  boundedly 
loadable.  Small-  to  mediusi-scale  estimation  schemes  used  in  navigation,  guidance,  and  control  applications 
typically  belong  to  this  class. 

The  article  proposes  the  use  of  euitlprogrammed  processing,  supporting  execution  of  concurrent 
processes,  as  a  suitable  processing  environment  for  real-time  estimators  which  are  to  be  Implemented  on 
the  above  described  hardware.  Concurrent  processing  is  presented  as  the  basic  technique  for 
overcoming  the  hardware  limitations  and  for  meeting  the  real-time  constrslnta  of  the  estimation  problem. 

Design  of  a  decentralized  real-time  operating  syc  tea  for  controlling  multiprogrammed  execution  of 
processes  in  a  distributed  system  is  outlined  next.  This  operating  system  consists  of  autonomous, 
local  real-time  executives,  which  operate  under  fixed  allocation  of  resources.  Process  scheduling  and 
communications  are  discussed. 

The  established  structure  of  real-time  software  and  process  control  can  thun  be  utilized  to  mechanize 
estimation  algorithms  as  concurrent  processes.  Several  known  schemes  for  decomposing  a  Kalman  filter 
into  concurrent  processes  are  illustrated. 


I.  INTRODUCTION 
A.  TECHNOLOGICAL  PERSPECTIVE 


During  the  last  decade,  adoption  c f  mlcroprucenaors  and  distributed  systems  of  micros  as  hardware 
for  implementation  of  real-time  estimation  tthemes  has  proliferated  the  use  of  Kalman  filtsra  in  neu 
and,  until  recently,  undreamed  applications.  The  new  hardwire  technology  end  improved  theoretical  and 
implementations!  understanding  of  Kalman  filtsra  have  made  many  of  these  applications  technically 
feasible.  Due  to  dramatic  decrease  in  the  cost  of  hardwire,  Kalman  filters  -  even  in  applications  for 
which  a  few  years  ago  only  s  very  crude  estimator  would  have  been  considered  -  have  become  economically 
attractive.  In  many  newly  emerging  real-time  control  systems  built  on  the  methodology  of  artificial 
Intelligence,  Kalman  filters  will  continue  to  play  a  central  role.  The  new  artificial  intelligence 
techniques  will  enable  the  system  to  "see"  and  perhaps,  to  some  extent,  interpret  its  own  environment. 
However,  the  need  for  a  mechanism  which  would  estimate  the  state  of  the  system  will  continue  to  persist. 

In  fact,  such  future  systems  will  place  even  r.lgher  technical  demands  on  estimation,  partly  due  to 
Increasing  availability  of  smart  sensors  (designs  cf  which  also  exploit  the  microprocessor  and  special 
chip  technologies)  and  of  expanding  variety  of  measurements  from  such  sensors. 

Uc  complement  the  above  Introductory  philosophical  remarks  by  quoting  R.K.  Smyth  1 Smy J ,  who  states 
that  “greet  changes,  propelled  by  microprocessor  technology,  are  sweeping  avionics  and  control”  and 
that  they  will  continue  in  the  deci-.e  ahead.  He  further  Identifies  the  disciplines  and  technologies 
which,  according  to  him,  will  most  likely  Influence  the  next  generation  of  systems.  They  are  "NAVSTAR/GPS, 
ring-laser  gyroe,  alr-tvefflc-control  update,  digital  fly-by-wlre,  full-authority  digital  propulsion 
control,  flat-panel  display*  and  Integrated  date-control  centers,  modular  and  distributed 
avionics  a.chltecture,  cost  of  airborne  data  processing,  digital  busta  Including  potential  of  fiber 
optics,  modern  control  theory  Including  Kalman  filter*  and  optlmal-stat*  estimation,  direct  digital 
aynthaals,  high-level  software  languages,  VLSI/VHSI  circuits,  and  actuation  and  power-gene ration  device* 
maul*  possible  by  new  rare-earth  magnetic  materials." 

Successful  integration  of  such  diverse  factor*  requires  *  multidisciplinary  approach  if  th*  potential 
synergies  le  to  be  exploited.  This  requires  that  each  apeciallst  member  of  *  multidisciplinary  dsaign 
tea  understand  th*  other  disciplines  affecting  his  aspect  of  design.  At  an  example,  consider  a  control 
specialist  who  le  a  member  of  the  team  entruated  with  the  design  of  a  real-time  estimation  schema. 

Such  a  person  should  b*  capable  of  not  only  performing  hi*  traditional  functions  (such  a*  select  it*  an 
appropriate  system  modal  and  th*  right  estimation  algorithms)  but  also  stating  the  functional  requirements 
for  sensor*  (who**  meesutemente  th*  estimation  scheme  is  to  use)  or  specifying  tbs  mechanisation  of 
estimation  algorithms  in  th*  form  of  real-time  software.  Current  digital  technology  sake)  designing 
smart  sensors,  capable  of  aiding  th*  sstlmstlon  process,  s  practically  attainable  goal.  (Thsst  ar* 
the  sanaors  capable  of  producing  not  only  th*  basic  sstlmstlon  measurements  but  also  tome  system  identif¬ 
ication  date.  Having  such  extra  Information  continuously  furnished  to  the  estimator  may  preempt  the 


formidable  problem  of  determining  the  parameter*  of  noise  statistics  under  severe  timing  constraints.) 
In  the  same  vein,  understanding  the  possibilities  offered  by  recent  Ideas  In  operating  system  theory 
and  software  engineering  may  greatly  facilitate  the  transformation  of  the  mathematical  model  of  an 
estimation  scheme  (l.e.,  of  estimation  algorithms)  into  robustly  working  real-time  software.  This 
latter  aspect  of  real-time  estimator  development  la  the  main  topic  of  the  present  exposition. 

B.  IMPLEMENT  AL  DESIGN  OF  ESTIMATORS 


Development  of  Kalman  filters  (or,  more  generally,  of  recursive  estimators)  for  real-time  applications 
roughly  involves  three  aspects  of  design:  system  modeling,  algorithm  design,  and  lmplemental  design. 

System  modeling,  although  critically  important,  is  very  dependent  on  the  problem  at  hand  and  so  it  is 
not  discussed  In  the  sequel.  In  a  wide  sense,  algorithm  design  addresses  not  only  the  design  of  kernel 
estimation  algorithms  but  also  the  design  of  complementary  procedures,  such  as  one  whose  function  is 
to  detect  and  then  respond  to  detected  nonwhite  noise  in  measurements.  Algorithm  design  Is  extensively 
covered  in  current  literature.  The  same  cannot  be  said,  however,  about  the  lmplemental  design,  l.e,, 
about  the  process  of  mapping  the  algorlthsu  Into  a  system  of  software  procedures  which,  when  executed 
on  some  target  equipment,  will  interact  correctly  with  the  environment  and  among  themselves  and  also 
will  satisfy  the  real-time  constraints  of  the  problem.  One  possible  reason  for  this  paucity  of  attention 
in  literature  to  implements',  design  of  real-time  estimators  is  that  it  cannot  be  discussed  in  a 
mathematically  concise  way.  The  other  reason,  which  is  probably  more  fundamental,  is  that  It  has  been 
viewed  as  an  exclusive  domain  of  programmers. 

C,  SCOPE  AMD  OBJECTIVES 


The  present  article  addresses  estimation  and  control  system  specialists  who  mainly  are  experts  In 
system  modeling  and  algorithm  design  but  who  also  would  like  to  learn  more  about  transforming  their 
designs  into  working  real-time  schemes.  As  It  will  become  evident  from  the  sequel,  such  a  control 
specialist  must  depend  on  system  programmers  or  on  software  engineers  who  also  are  members  of  the 
design  team,  for  ^vice  and  contributions.  Since  the  design  team  Is  often  led  by  the  control  specialist, 
it  is  important  that  he  know  how  to  communicate  with  his  software  counterparts  and  atate  the  requirements 
for  teplamtntal  design.  Thus,  one  of  the  main  objectives  of  the  present  article  is  to  help  a  control 
specialist  acquire  the  technical  background  for  performing  these  functions. 

The  level  of  the  present  article  is  Introductory:  we  assume  that  the  control  specialist  know 
little  about  the  design  of  real-time  software,  especially  about  its  processing  environment. 

Hence,  in  the  sequel,  we  do  not  discuss  a  wide  spectrum  of  design  possibilities  In  generality  but 
rather  concentrate  on  a  few  approaches  that  have  been  experimentally  proven  to  work,  explaining  them 
sometimes  In  detail.  He  feel  that  this  trill  enhance  the  tutorial  value  of  the  exposition.  One  aspect 
of  lmplemental  design,  which  Is  stressed.  Is  the  real-time  process  control  environment  for  estimation 
algorithms.  However,  this  la  not  Intended  to  be  a  general  exposition  of  real-time  operating  systems, 
so  hut*  again  we  take  a  narrow  path  through  the  labyrinth  of  issues  concerned  with  operating  systen 
and  software  engineering  design. 

He  further  narrow  the  scope  of  discussion  to  real-time  estimator  design  for  Implementation 
on  distributed  microprocessor  systems,  although  we  use  the  term  “microprocessor"  generlcally:  It 
refers  to  almost  any  small  computer.  However,  this  restrictive  assumption  about  hardware  has  Implication* 
on  the  class  of  control  systems  considered. 

0.  BOUNDBDLY  LOADABLE  VS.  NONSATURABLE  SYSTEMS 


Currently,  distributed  systems  of  Aasll  computers  sre  used  mainly  for  small  to  madlum  alas  control 
systems.  Examples  of  such  systems  built  around  estimators  sre  navigation,  guidance,  end  flight  control 
systems,  t*  well  as  various  types  of  artificial  Intelligence  systems  in  which  tbs  dynamics  of  system 
state  can  be  modeled  stochastically  and  must  be  estimated.  We  refer  to  such  systems  as  bounded ly 
loadable j  l.e.,.  they  are  a  priori  deelgned  to  take  a  certain  maximum  loading,  perhaps  In  the  form  of 
fixed  maximal  cates  for  measurement  Inputting  end  procewslng.  We  do  this  for  two  reasons:  (1)  typically, 
such  control  ays  tree  do  not  require  sophisticated  resource  allocation  algorithms  and  ao  they  are  eaeler 
to  understand;  and  (2)  they  ace  the  systems  moat  likely  to  be  Implemented  on  the  computers  of  tha  type 
c one  ids  red  here. 

In  contrast,  we  cvuld  also  have  considered  estimator  design  for  large-scel*  control  system*  which 
ar«  smart  enough  to  select  Intelligently  any  additional  loading  past  a  nsar-saturstton  point  so  a*  not 
to  overload  themselves  and,  at  tha  same  time,  to  perform  their  mission  in  some  optimal  way.  For 
convenience,  we  shall  call  them  nonsaturable  systems*  Various  iavge -scale  air  traffic  control  and 
defense  ./stems,  dlecuseed  In  literature  during  the  sixties  sat  seventies,  are  example*  of  tht  Utter 
type  of  control  system*.  A. though  many  such  yystema  use  estimator*  as  kernsl  algorithms,  they  would 
Hpt  (oven  nowadays)  be  l*:. 'waved  on  distributed  system*  of  small  computer*.  More  likely,  multiprocessor 
architecture*,  augmented  v*ch  special  (such  as  parallel)  processor,  would  be  applied  to  tuch  a  problem. 
They  would  also  require  the  use  of  complex  process  tug  resource  allocation  algorithms  and,  thue,  of  a 
very  different  philoeophy  o'*  rex '-time  processing  control  from  the  one  presented  In  the  sequel.  The 
use  of  eophistlcaemd  resource  n  '  station  probably  would  make  central  Usd  processing  control  preferable. 
Such  processing  control  scheme*  as-'  not  only  more  difficult  to  understand  but  also  such  more  difficult 
to  validate.  Instead,  wa  shall  describe  whet  could  be  characterised  «.*  a  fixed  allocation,  decentralised 
processing  control  schema,  l.e.,  decan tr* Used  over  the  processing  elements  o'  a  distributed  system. 

The  total  state  of  a  nonsaturable  system  is  partly  defined  In  tens  of  Ch*  state*  of  N  object* 
currently  handled  (processed)  by  the  systen.  For  example,  if  each  object  represent*  *  tracked  aircraft, 
the  state  of  an  object  may  eonalst  of  the  description  of  Ur  position,  velocity ,  sod  .same  parameters. 

An  the  number  of  handled  object*  increase*  sod  a*  t  nonsaturable  system  approaches  its  saturation 
point,  the  system  must  decide  (typically  on  the  basts  of  soma  risk  faction)  which  present  object* 
could  be  deleted  *t  t  minimum  risk  in  order  to  create  proceesius  .opacity  for  newly  incoming  and 


possibly  more  critical  objects.  Often  the  estimation  procedure  for  all  N  objects  Is  nearly  Identical. 
In  such  a  tltuatlon,  the  use  of  large  scale  computers  with  parallel  or  vector  pipeline  arithmetic 
units  Is  attractive.  This  leads  to  estimators  which  differ  In  structure  and  mechanization  frost  those 
considered  In  the  present  exposition,  for  such  large-scale  coaputers  can  be  best  exploited  If  all  K 
parallel  estiaation  processes  are  centrally  controlled. 

In  contrast,  a  boundedly  loadable  system  Is  a  priori  designed  to  handle  no  aore  than  B  objects, 
where  B  Is  a  fixed,  saall  positive  integer.  Simple,  fixed  allocation,  decentralized  real-tloe  process 
control  scheaes  ere  effective  for  boundedly  loadable  estiaation  systeaa. 

E.  OUTLINE 


Typical  microprocessor  system  architectures  under  consideration  consist  of  small  generic  building 
blocks,  such  as  alcrocoaputers,  stand-alone  direct  access  aeaory  units,  and  digital  Interface  units, 
the  latter  needed  for  communications  among  alcrocoaputers  or  with  the  "outside”  world.  These  building 
blocks  are  Interconnected  by  neans  of  bus  systems  and  direct  access  global  aeaory  units.  Smyth  [Say] 
mentions  the  DAIS  ani  the  Draper  Laboratory  Fault-Tolerant  Multiprocessor  as  prototypes  of  such  new 
architectures.  We  shall  loosely  refer  to  such  architectures,  forned  from  "small”  building  blocks,  as 
distributed  microprocessor  systems.”  Consequently,  we  begin  (in  Section  II)  by  examining  the  lapllcatlons 
of  microprocessor  use  on  algorithm  design  and  Implementation.  Then  (In  Section  III),  we  proceed  to 
the  problea  of  controlling  in  real-tiae  the  coaputatlonal  processes  resulting  froa  the  software  organization 
recommended  in  Section  II.  Next  (in  Section  IV),  we  review  current  practices  of  Kalman  filter  design 
for  real-time  applications  without  paying  too  auch  attention  to  the  complications  Imposed  by  the  hardware 
under  consideration.  The  next  section  (Section  V)  addresses  the  problems  arising  due  to  hardware 
limitations  and  timing  constraints.  It  Illustrates  techniques  for  overcoming  many  of  these  problems 
by  decomposition  of  filter  algorlthas  into  concurrently  executable  procedures.  A  summary  follows  In 
Section  VII.  Three  appendices  follow  at  the  end:  Appendix  A  suaurlzea  the  standard  Kalman  filtering 
equations;  Appendix  B  reviews  the  so-called  U-D  factorization  algorlthas  for  a  Kalman  filter;  and 
Appendix  C  states  the  estiaation  problea  is  GPS  navigation,  which  Is  referred  to  In  several  examples 
throughout  the  text. 

II.  PROGRAMMING  FOR  REAL-TIME  DISTRIBUTED  SYSTEMS 

A.  HARDWARE  ARCHITECTURE 

For  the  development  of  our  oaln  theae,  we  need  a  generic  model  of  hardware  architecture.  Thus, 
we  view  distributed  systems  considered  here  as  built  froa  constituent  computers  by  Interconnecting 
these  computers  either  through  shared,  directly  accessed  global  aeaory  units  or  through  data  links 
(busea).  These  links  aay  vary  In  speed,  parallelism,  and  length.  Obviously  some  distributed  systeaa 
use  both  typos  of  Interconnections,  l.e.,  direct  access  global  aeaory  units  as  well  as  buses. 

For  our  purposes,  a  constituent  processing  aleaent  of  any  distributed  system  under  consideration 
Is  assumed  to  be  a  small  computer,  not  necessarily  a  “microcomputer"  In  the  strict  sense  of  the  tera, 
which  consists  of  a  CPU  (an  Instruction/arithmetic  processing  unit)  and  Interconnection  ports /devices. 

It  usually  has  some  local  (or  private)  memory  and  aay  also  have  apeclal  devices,  such  ss  a  floating 
point  arithmetic  unit,  connected  to  It,  In  the  sequel,  such  a  constituent  computer  of  a  distributed 
system,  regardless  of  tfiethtr  it  Is  a  microcomputer  or  a  computer  of  soma  different  type,  la  called  a 
processing  element  (PB).  As  a  special  case  considered,  the  entire  distributed  system  aay  consist  of  a 
single  PB.  If  a  PE  accesses  both  global  and  local  memories,  then  its  Instruction  address  space  must 
possess  facilities  to  address  both  types  of  meaorlts. 

Another  building  block  needed  for  the  assuasd  model  of  distributed  systems  Is  a  global  data  memory 
unit.  "Data”  refers  hers  to  the  reed-vrlte  and  randoa  access  properties  of  auch  memory,  while  "global” 
emphasises  that  auch  a  memory  unit  la  accessible  free  at  least  two  processing  elements.  In  constrsst, 
a  processing  element  may  have  two  types  of  local  memory:  local  data  memory  for  storing  the  problem 
data  which  does  not  have  to  be  common  Seated  to  other  processing  element*  aid  local  program  memory  of 
the  read-only  type  for  storing  the  Instructions  of  programs  reaidtng  in  that  processing  element.  In 
our  model  of  a  distributed  system,  executable  Instructions  of  a  program  are  always  aseuaed  to  be  stored 
In  the  local  memory  of  a  processing  element. 

As  an  aside,  many  Ideas  presented  In  the  sequel  apply  to  hardware  architectures  more  general 
than  one  juat  Introduced.  For  Instance,  wm  could  have  considered  hierarchical  distributed  systems 
In  which  soma  processing  element*  thamaelvea  are  distributed  eystamt;  or  distributed  systems  each 
processing  element  of  which  la  a  multiprocessor  system  by  ltaalf,  consisting  of  several  CPU*  inter¬ 
connected  by  common  acceaa  memory  modules.  But  the  generic  model  introduced  earlier,  which  la  simpler 
and  leas  centralised  than  the  archltecturaa  mentioned  in  the  present  paragraph,  can  be  affectively 
used  with  a  relatively  simple,  fixed  allocation  operating  system.  Furthermore,  it  1»  sufficient 
for  the  applleetlona  considered  here.  Hence,  it  will  be  assumed  In  the  sequel. 

I.  PROCESSES  EXECUTED  IN  A  DISTRIBUTED  3TSTBM 


The  notion  of  a  (computet tonal)  process  la  fundamental  In  modern  theory  of  operating  systems  and 
la  discussed  In  recant  texts  on  operating  systems  and  system  programming  (e.g.,  [Cof  73),  (Fra),  [Cra], 
and  (Han)).  It  la  also  used  extensively  In  the  sequel  to  explain  our  model*  of  real-time  software  sad 
real-time  process  control.  The  term  “process'  will  be  used  to  describe  the  behavior  or,  aay,  the 
dynamics  of  s  computer  program  that  la  stormd  (or  Is  ‘residing”)  in  a  computer. 

Freeman  ((Fra),  p.  10S)  further  elaborate*  this  concspt  by  noting  that  “a  program  specif!**  * 
single  sequence  of  actions.  When  we  admit  the  possibility  of  a  program  stopping  before  it  finishes 
execution  (to  welt  for  ■  signal  or  because  the  processor  has  been  taken  away  froa  It,  for  example),  we 
must  associate  with  each  program  In  execution  some  information  that  record#  lta  current  atate  (what 
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the  value  In  the  Instruction  counter  lst  where  the  prograa  Is  In  memory,  and  so  on).  The  concept  of  a 
program  being  "In  execution,"  but  not  necessarily  being  executed  at  a  particular  moment,  Is  what  we 
shall  now  term  a  process.  Operationally  In  a  system,  a  process  Is  a  prograa  specifying  a  sequence  of 
actions  and  the  associated  Information  that  represents  Its  current  state." 

Thus,  a  single  process  Is  just  a  generalisation,  or  an  abstraction,  of  a  processor  moving  through 
the  text  of  a  computer  program.  However,  one  additional  idea  is  Implied:  there  exists  a  controlling 
mechanism  (an  operating  system)  which  is  keeping  track  of  the  state  of  the  processing  resources  (hardware, 
data,  and  programs)  being  used  by  or  reserved  for  the  executing  programs.  Keeping  track  of  the  state 
becomes  Important  If  the  execution  of  a  program  can  be  temporarily  Interrupted  by  the  operating  system 
in  favor  of  another  waiting  program. 

Analogous  to  finite  automata  theory,  it  is  convenient  to  characterise  a  process  by  describing 
the  process  states  which  such  a  process  may  attain  and  by  defining  the  rules  that  govern  transitions 
among  these  states.  When  a  prograa  is  In  execution  or  is  considered  by  the  operating  system  to  be 
scheduled  and  executed,  we  say  that  the  process  created  by  that  prograa  is  in  sn  active  state;  otherwise, 
l.e.,  the  process  Is  said  to  be  in  an  Inactive  state.  Typically,  there  is  only  one  Inactive  state. 

The  concept  of  a  process  Is  a  powerful  tool  for  understanding,  modeling,  and  designing  a  computer 
system  In  which  several  programs  reside  in  memory  simultaneously  so  that  the  central  processor  of  the 
computer  (which  in  our  case  is  a  processing  element  of  the  distributed  system)  keeps  switching  among 
then  according  to  some  scheme  or  schedule.  In  such  a  situation,  processes  created  by  these  programs 
are  aald  to  be  executing  concurrently ,  l.e.,  Interleaved  In  time.  The  resulting  processing  environment 
Is  called  nultlprogr earned  processing.  This  Is  the  processing  model  which  we  assume  for  estimation 
schemes  executed  on  a  distributed  system,  or  even  on  a  single  processing  element. 

When  a  distributed  system  is  operating,  several  processes  may  be  executed  concurrently. 

Concurrent  execution  of  processes  can  occur  in  several  ways.  On  a  single  processing  element  of  a 
distributed  system,  for  example,  processes  may  be  executed  interleaved  In  time  by  dividing  the  time 
line  into  segments  and  alternating  the  processes  among  segments  such  that  the  process  vrtilch  starts 
execution  at  the  start  of  a  segment  is  interrupted  at  its  end,  at  which  time  the  use  of  the  processor 
13  passed  to  another  process.  Thus,  concurrent  processes  may  occur  under  multiprogramming,  even  on  a 
single  processor.  Another  form  of  concurrent  process  execution  in  a  distributed  system  arises  when 
processes  are  executed  simultaneously  and  possibly  asynchronously  in  several  processing  elements  of 
the  system. 

The  hardware  facilities  needed  for  data  communications  among  two  communicating  concurrent  processes 
depend  on  where  these  processes  are  executed.  If  they  are  executed  in  interleaved  time  fashion  within 
the  same  processing  element,  the  local  data  memory  unit  of  that  processing  element,  provided  that  such 
a  unit  exists,  may  be  used  for  communications  smong  processes;  otherwise,  t  global  data  memory  unit, 
accessible  from  that  processing  element,  ia  needed.  On  the  other  hand,  if  two  communicating  concurrent 
processes  are  executed  on  different  processing  elements,  then  two  (not  necessarily  mutually  exclusive) 
possibilities  exist:  either  s  mutually  accessible  global  memory  wit  is  available  or  data  links 
(buses)  interconnecting  the  syeten  are  needed. 

Designer  of  a  real-time  estimation  (or,  more  generally,  of  a  realtime  control)  system  ia 
responsible  for  structuring  his  algorithms  so  that  they  result  in  computational  processes  which 
properly  cooperate  smong  themselves  in  real-time;  i.e.,  they  are  appropriately  synchronised  and 
can  correctly  communicate  (exchange)  data.  Selection  of  an  appropriate  programming  language  and 
access  to  convenient  real-time  process  management  utilities  (the  latter  to  be  furnished  by  the  real-time 
operating  system)  will  make  designer's  work  easier.  Still,  in  order  to  be  assured  about  the  lmplewenta- 
blllty  of  design,  he  suet  conceptually  understand  the  logical  consequences  of  process  synchronisation 
and  coamunicatlon  requirements  and  must  be  cspsble  of  translating  these  requirements  into  his  design. 

The  purpose  of  Section  III  in  the  sequel  is  to  introduce  process  management  concepts  and  tachnlquas 
that  have  proven  themselves  in  implementation  of  real-time  estimation  schemes. 

c.  software  model 


Workload  partitioning,  algorithm  scheduling,  and  memory  siting  are  critical  tasks  in  design  of 
real-time  control  (in  our  caee,  estimation)  systems  for  lmplemantatlon  under  severe  processing  constraints. 
These  teaks  are  begun  early  in  the  development  cycle,  usually  during  preproposal  time  investigations, 
and  art  reiterated  many  times  thereafter  until  s  design  that  satisfies  requirement  specifications 
emerges.  The  capability  to  transform  algorithms  into  working  real-time  software  is  the  key  to  success 
In  this  endeavor.  This  process  of  lmplsnental  design  Is  greatly  helped  by  having  s  suitable  model  of 
software  architecture.  As  will  became  evident  in  Section  III,  such  s  model  is  also  needed  for  design 
of  s  rsal-time  operating  system. 

The  software  model  introduced  next  in  the  sequel  is  intended  for  s  bounded ly  loadable  system 
«dilch  is  to  be  implemented  on  a  distributed  system  of  small  computers  or  even  on  s  single  microprocessor . 

This  model  assumes  that  some  extended  fora  of  FORTRAN  Is  used  as  the  programming  language.  It  is  mentioned 
later  in  the  sequel  how  a  software  modal  for  sn  ALGOL- like  language,  auth  aa  ADA,  would  differ  from 
the  modal  to  be  introduced  next. 

M008U 

a.  The  entire  real-time  applications  software  implementing  sn  estimation  scheme  is  partitioned 
into  s  set  of  programs  and  date  sets. 

b.  Bach  prograi  contalua  a  main  procedure  and  may  alao  contain  aubprograma  (subordinate  or  lower- level 
procedures). 

c.  Each  program  can  create  precisely  one  real-time  procase  (which  may  remain  inactive),  ao  there 
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is  one-to-one  correspondence  between  programs  and  processes.  (This  assumption  is  not  too 
restrictive  for  boundedly  loadable  systems  implemented  on  small  computers.  Besides,  the 
resulting  implementation  can  be  more  easily  understood  and  tested  than  that  of  a  system  with 
no  restrictions  on  the  extent  of  reautrancy.  Here,  a  program  unit  is  said  to  be  reentrant 
if  it  may  be  shared,  or  executed  concurrently  by  several  processes  at  a  time.) 

d.  Subprograms  of  the  following  two  types  are  admissible: 

a  Private  subprograms  -  such  a  subprogram  belongs  to  a  single  program  and,  by  Assumption  c, 
to  a  single  process,  because  It  may  be  invoked  only  from  that  program; 

#  Shared  subprograms  -  such  a  subprogram  may  be  invoked  from  several  programs  and,  thus, 
used  by  the  processes  generated  by  these  programs.  Shared  subprograms  must  be  implemented 
as  reentrant  procedures,  but  only  as  reentrant  procedures  of  a  priori  known  maximum  concurrency. 

e.  Fixed  allocation  of  programs  to  a  processing  element  is  used:  at  design  time,  each  program 
is  assigned  to  a  single  processing  element  and  Is  never  "split"  between  several  processing 
elements.  In  the  event  that  software  redundancy  can  be  tolerated,  the  present  assumption 
does  not  forbid  assigning  replica  of  a  program  (or  of  a  subprogram)  to  several  processing 
elements,  or  even  to  a  single  processing  element. 

f.  With  Assumptions  c  and  d,  data  sets  of  applications  programs  can  be  divided  into  three  hierarchical 
globality  levels:  Interprocess  (Interprogram)  conaunlcatlon  data  sets;  Intraprocess  (Intraprogram) 
communication  data  sets;  and  local  data  sets.  For  example,  rtien  FORTRAN  (possibly  extended 

to  handle  limited  reentrancy)  is  used  as  a  programming  language.  Interprocess  and  Intraprocess 
communication  data  sets  are  implemented  as  ordinary  labeled  COMMON  blocks;  local  data  sets, 
as  locally  declared  data  with  special  provisions  made  for  handling  local  variables  in  reentrant 
procedures.  Furthermore,  variables  and  constants  are  always  placed  Into  separate 
data  sets.  In  order  to  distinguish  between  two  types  of  constants  --  (i)  physical  constants, 
such  as  the  speed  of  light  or  the  equatorial  radius  of  a  reference  ellipsoid,  and  (li)  design 
parameters,  such  as  the  length  of  a  state  vector,  which  may  change  as  the  design  progresses, 
but  become  constants  by  the  time  it  is  completed  —  they  are  put  into  separate  data  Bets. 

Although  the  use  of  an  ALGOL-like  programming  language,  such  as  ABA  (Weg),  Instead  of  FORTRAN 
would  hardly  perturb  the  model  described  above,  it  would  affect  the  format  and  the  structure  of  the 
source  program.  First,  the  nested  block  structure  of  ALGOL-like  languages  such  as  AhA  would  facilitate 
hierarchical  nesting  of  procedures  (private  to  a  program)  and  the  corresponding  nesting  of  data  sets 
according  to  the  globality  (scope)  of  data  access.  Furthermore,  ABA  provides  basic  language  constructs 
for  defining  and  Implementing  process  control  and  synchronisation  mechanisms,  such  as  the  mechanism 
for  synchronised  communications  among  concurrent  processes  discussed  la  Section  III.  (For  more  information 
on  this  subject,  refer  to  discussion  of  multitasking  In  (Weg}). 

D.  IMPLICATIONS  ON  ALGORITHM  DESIGN 


Adoption  of  distributed  microprocessor  systems  at  hardware  has  far-reaching  Implications  on  algorithm 
design.  In  order  to  meet  the  real-time  response  constraints  while  r.ot  exceeding  the  throughput  capacity 
of  individual  processing  elements,  the  entire  estimation  procedure  must  be  partitioned  into  concurrently 
executeble  and  Intaractlng  processes.  The  main  Implications  of  the  resulting  workload  partitioning 
are  that  some  algorithms  will  be  decomposed  Into  a  set  of  concurrently  executable  smaller  algorithms, 
the  functional  performance  of  tdilch  will  not  be  as  good  as  that  of  the  original  algorithm. 

In  the  sscond  part  of  the  present  exposition,  we  review  several  common  decomposition  schemes  for 
a  Kalman  filter.  It  will  eufflce  to  mention  at  this  point  that  for  a  recursive  estimator,  such  as  a 
Kalman  filter,  the  covariance  processing  and  the  computation  of  Kalman  gains  are  tlma-conaumlng  procedures 
on  a  microprocessor  and  so  nay  become  prohibitively  expensive  if  Che  microprocessor  has  no  hardware- 
implemented  floating  arithmetic  and  If,  consequently,  all  floating-point  computing  mutt  be  performed 
in  interpretive  form.  On  the  other  hand,  there  may  exist  •  requirement,  or  just  a  need,  to  process 
the  incoming  measurements  st  a  rate  which  would  exceed  the  processing  capacity  of  a  processing  element, 
i.«.,  which  would  "bust"  its  tlms  Una.  As  explained  in  the  second  part  of  the  present  exposition, 
this  problem  has  several  solutions,  each  of  *>lch  results  in  auboptlaal  performance. 

According  to  our  earlier  characterisation  of  distributed  microprocessor  systems,  it  would  seam 
that  such  a  system  could  be  Incremented  In  small  steps  by  adding  to  It,  on  the  bails  of  nsed,  processing 
elements,  global  data  memory  unite,  and  other  special  boxes.  Besides,  hardware  costa  art  relatively 
low.  Then  whet  ere  tb*  reasons  for  having  to  struggle  with  an  auatare  hardware  budget?  The  answer  to 
this  question  Is  that,  in  estimation  application?  considered  hers,  severe  constraints  ar~  typically 
Imposed  on  the  power  consumption,  vdume,  and  weight  of  equipment.  Besides,  If  the  equipment  aider 
consideration  is  to  be  manufactured  In  large  quantities,  even  small  aavlnga  in  coat  par  unit  count. 

Another  ramson  Is  the  ‘emsllneae*  of  individual  precasting  alamanta.  For  example,  a  single  element 
may  not  have  sufficient  throughput  capacity  to  aceotaaodate  a  Kalman  filter  with  a  minimally  acceptable 
rate  of  measurement  processing.  On  thm  other  hand,  wt  may  not  went  to  epllt  tht  filter  algorithm 
amnng  two  proceealng  alamanta  for  reasons  such  as  tht  complexity  or  degraded  performance  of  tho 
modified  algorithm. 

B.  partitioning  or  WORKLOAD 

During  e  design  cycle,  the  proceealng  workload  rseultlng  from  algorithms  Is  eevetal  tlmss 
rmpartltloned  into  concurrent  proceseae  until  e  satisfactory  partitioning  is  obtained.  We  eay  that  a 
workload  partitioning  is  acceptable  If  It  (1)  satisfies  the  overall  hardware  budget;  (li)  dote  not 
overload  individual  processing  slemsntat  (ill)  yields  minimally  required  or  better  execution  rates 
to  time-critical  algorithms;  (lv)  does  not  appreciably  degrade  the  functional  performance  of  algorithms 
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through  their  decomposition;  (v)  doe*  not  excessively  complicate  the  overall  structure  of  real-tine 
software;  (vi)  does  not  cause  too  auch  real-tlae  processing  overhead  through  introduced  concurrencies. 

It  is  difficult  to  satisfy  this  long  list  of  requirements  without  sone  compromises.  However,  the 
first  four  are  essential  to  the  lapleaentabllity  and  performance  of  design. 

Workload  partitioning  is  a  critical  design  issue  in  distributed  real-time  processing.  Thus, 
several  years  ago,  when  the  interest  in  such  processing  began  to  emerge,  attempts  wre  made  to  formulate 
the  principles  and  derive  the  algorithms  for  optimal  workload  partitioning.  Samples  of  work  from  that 
period  are  (Gyl  76]  and  [Jen].  Later  design  experience  showed  that  it  is  best  to  take  a  heuristic 
approach,  roughly  based  on  the  following  principles: 

(1)  Distribute  the  workload  over  the  processing  elements  (PEs)  so  as  to  minimize  the  level  of 
communications  among  the  processes  executed  on  different  FEs,  thus  minimizing  the  inter-PE 
data  traffic  and  the  use  of  global  data  memory,  while  not  overloading  individual  PEs. 

(2)  Decompose  the  workload  assigned  to  a  PE  into  processes  so  that  the  tasks  that  must  run  at 
about  the  same  rate  and  can  be  executed  at  about  the  same  time  are  assigned  to  the  same 
process  an!  thus  are  implemented  as  part  of  the  same  program. 

(3)  Guarantee  that  all  time-critical  algorithms  (in  their  original  or  in  a  decomposed  form)  will 
execute  at  acceptable  rates. 

Ideally,  hardware  requirements  should  be  stipulated  as  a  byproduct  of  completed  workload  partitioning, 
but  in  practice  this  is  seldom  the  case. 

It  should  be  evident  from  the  foregoing  that  quantitative  prediction  of  proceasor  and  memory 
loadings  must  be  obtained  for  each  candidate  partitioning  before  it  can  be  evaluated.  Thus,  the  activity 
called  ‘software  timing  and  sizing,*  is  very  critical. 

P.  PREDICTION  OP  PROCESSOR  AND  MEMORY  LOADINGS 


Real-time  system  designers  often  refer  to  the  task  concerned  with  the  prediction  of  processor  and 
memory  loadings  as  software  timing  and  sizing  (STAS).  This  task  produces  input  data  for  workload 
partitioning  and  also  evaluates  candidate  partitioning  schemes.  So  it  is  reiterated  several  time* 
during  the  development  cycle  of  a  real-time  system. 

Load  prediction  requires  auch  clerical  effort  (compilation  and  tabulation  of  input  data,  performance 
of  arithmetic,  and  generation  of  reports).  Hence,  these  tasks  must  be  computerized.  After  this  has 
been  dons,  tha  effort  la  essentially  reduced  to  derivation  of  execution  timing  and  memory  occupancy 
data  for  individual  modules  of  the  currently  tentative  software  model.  One*  the  ST&S  data  base  has 
been  established  by  entering  in  it  the  initial  timing  and  sizing  estimates  for  individual  modules, 
from  than  on  it  needs  only  to  be  updated  at  better  estimates  become  avsllabls. 

Even  the  derivation  of  execution  timing  and  meeory  occupancy  for  individual  modules  can 
be  expedited  by  means  of  automated  data  processing  techniques.  As  an  example,  tha  following  methods 
have  been  found  to  be  useful  i 

(1)  Extending  the  compiler  of  the  programming  language  to  enable  it,  as  a  byproduct  of  compilation, 
(i)  to  aagment  the  source  code  into  blocks,  each  block  snding  with  a  brand:  operation,  then 

to  time  eech  block  while  using  en  Inputted  timing  model  of  the  tsrget  machine;  and  (it)  to 
eetlmate  the  memory  requirement*  of  each  module  of  executable  code  and  of  eech  data  set 
defined  in  the  source  code. 

(2)  Using  an  Instruction-level  simulator  to  derive  a  timing  model  of  each  program,  based  on  the 
a  priori  inputted  probabilities  or  on  the  experimentally  observed  frequencies  of  vsrloue 
execution  paths. 

With  the  lnfoimetlon  provided  by  (1)  and  (il)  of  Hethod  1,  an  experienced  enalyet  can  quickly 
time  the  moat  critical  execution  path*  end  compute  the  aewory  requirements  of  hi*  software  design. 

Method  2  produces  a  probabilistic  model  of  processor  loading  ind  data  memory  occupancy,  and  so 
Its  outputs  in  s  ssnse  complemsnt  those  of  Method  i. 

After  either  (1)  or  (2)  has  been  accomplished  the  first  time,  the  obtained  processor  timing  and 
memory  siting  data  can  b*  entered  In  the  software  model  date  bass.  Thereafter,  a*  the  design  progresses 
toward  maturity,  this  modal  and  the  processor  and  memory  loading  predictions  for  it  need  only  to  he 
iteratively  refined. 

The  t echo iq mas  illustrated  aboys.gr aseppos*  the  availability  of  some  source  code.  Often,  the 
source  code  of  key  algorithms  becomes  Available  early  during  tha  davalopmtnt  cycle  and  esn  b*  used  to 
bootstrap  the  timing  and  ailing  procass  by  means  of  th*  above  described  techniques.  Typically,  these 
algorithm*  are  programmed  early  la  tha  high-level  language  of  the  ultimate  real-time  cods  for 
performance  analysis  simulations  on  a  large  computer  in  oonreal-timn  mods. 

In  the  event  that  th*  source  cod*  of  key  algorithms  is  not  available  when  the  timing  and  siting 
process  must  b*  bootstrapped,  on*  may  resort  to  mathematical  timing  and  sizing  modal*  of  kay  algorithms. 
Por  a  Kalman  filter  or  a  similar  estimator,  such  a  modal  1*  formulated  in  term*  of  the  state  vector 
end  measurement  vector  length*  t  it  predict*  th*  timing  in  term*  of  basic  arithmetic  operation  counts 
and  th*  memory  occupancy  in  terms  of  memory  needed  to  store  th*  principal  vector*  and  metric**, 
lafarancea  [May],  [Tho],  and  (tie]  coutain  such  loading  prediction  model*  for  th*  estimation  algorithms 
considered  in  tha  sequel. 


■e 
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G.  DATA  IDENTIFICATION 


Experience  In  implements!  design  of  real-time  estimation  systens  has  taught  a  bitter  lesson 
about  the  identification  of  time-  and  source-dependent  data  and  events.  Implicit  identification 
techniques,  based  on  the  order  in  which  data  arrives  (or  is  generated)  or  dependent  on  the  location 
where  it  is  placed,  are  often  favored  by  novice  designers.  But  they  are  dangerous:  some  data  may 
arrive  late  or  never;  some  sources  may  intermittently  fail  to  send  data;  the  implicit  identification 
of  data  by  position  in  an  array  may  be  perturbed  by  the  deletion  or  addition  of  sources.  Also, 
implicit  identification  techniques  lead  to  rigid  and  cumbersome  implementations. 

Explicit  identification  of  data  snd  events  and  recording  of  their  reference  times  constitute 
a  safer  approach.  It  also  leads  to  a  more  flexible  implementation  In  the  sense  that  the  meaning  of 
real-time  data  no  longer  depends  on  the  time  when  It  becomes  available  and  on  the  memory  location 
where  it  can  be  found. 

Technically,  explicit  identification  means  that  every  data  group  (or  record  of  an  event)  tAilch  is 
time-dependent  is  time-tagged  by  its  reference  time.  Similarly,  every  source-dependent  data  group  is 
tagged  with  the  identification  (ID)  of  IBs  source.  For  example,  suppose  that  a  real-time  Kalman  filter, 
used  for  target  tracking,  operates  on  range  and  range-rate  measurements  of  tracked  targets.  Then 
every  simultaneous  batch  of  range  and  range-rate  measurements  is  time-tagged,  say,  with  the  estimated 
time  at  which  the  radar  signal  is  reflected  from  the  target,  or  with  the  observed  time  at  which  the 
signal  is  returned.  Furthermore,  as  new  targets  are  detected  and  go  into  tracking,  they  are  assigned 
explicit  and  unique  target  IDs. 

To  facilitate  the  time-tagging  of  real-time  idata  and  events,  a  clock  is  needed’.  Such  a  clock, 
which  will  be  called  system  control  clock,  is  usually  hardware-implemented  and  is  driven  by  an  oscillator. 
The  time  of  this  clock  must  be  available  to  all  processing  elements  of  a  distributed  system.  In  Section 
IiI.C,  tee  outline  how  the  system  control  clock  can  be  used  to  synchronize  processes  over  the  entire 
distributed  system.  Such  a  clock  would  be  needed  even  in  a  uniprocessor  (not  necessarily  aultlprogrammed) 
real-time  system.  Computational  processes  retrieve  the  time  of  this  clock  by  calling  a  special  subprogram. 

H.  SOFTWARE  DEVELOPMENT  METHODOLOGY  AND  TOOLS 


During  the  seventies,  a  great  deal  of  progres*  was  scored  in  the  areas  of  software  engineering 
and  software  development  management.  Some  of  the  events  contributing  to  this  progress  were  the  emergence 
of  a  robust  progressing  style,  known  as  structural  programming,  of  software  development  management 
technlquet  such  as  the  chief  programmer's  teas,  snd  of  computer-aided  software  development  and  software 
management  toole  such  as  Interactive  program  development  terminals,  data  base  management  systems,  or 
programming  languages  amenable  to  structured  programming  and  concurrent  processing. 

These  techniques  and  tools  are  useful  In  development  of  real-time  software  for  the  aultlprogrammed 
processing  environment,  but  they  are  well  covered  in  the  literature  and  thus  are  not  addressed  here. 

The  only  thing  which  we  want  to  note  is  that  software  development,  especially  its  testing  and  validation, 
for  tha  applications  and  processing  environment  addressed  here  requires  a  hierarchical  sequence  of 
simulations. 

One  sterts  with  high-level  fwctional  simulations  of  key  algorithms  in  order  to  validate  their 
performance  and  to  determine  the  required  processing  rates.  These  high-level  simulations  are  usually 
put  together  end  performed  by  the  control  specialist  responsible  for  algorithm  design.  They  are 
performed  off-line,  (l.e.,  not  In  reel-time)  on  s  large-scale  computer  system.  The  insights  obtained 
from  such  simulations  facilitate  the  timing  and  siting  of  software  snd  the  partitioning  of  workload 
for  real  time. 

As  ths  design  process  progresses  and  as  the  control  specialist  begins  to  think  shout  the  Implementation 
of  algorithms  for  the  real-time  processing  environment,  he  sterts  (while  guided  by  the  feedback 
from  the  performance  analysis,  the  tentative  workload  partitioning  tchame,  and  the  results  of  aoftware 
timing  end  eiting)  to  modify  and  restructure  them.  In  this  effort,  he  continues  to  use 
the  off-line  simulator  as  a  testbed.  If  this  Iterative  process  of  design  refinement  is  continued  long 
enough,  tha  modeled  real-time  aoftware  and  algorithms  begin  to  look  more  and  mors  similar  to  tha  ultimate 
product.  At  the  same  time,  the  level  of  simulations  progressively  goes  down  as  more  details  are  modeled, 
eimulated,  and  Invest {gated. 

Ultimately,  the  off-line  slaulstton  process  eventually  reaches  a  point  of  diminishing  returns, 
primarily  becauaa  of  tha  difficulties  in  creating  sufficiently  realistic  simulation  scenarios,  needed 
for  complete  validation  of  design,  and  In  modallng  with  fldality  interactions  among  concurrent  processes. 
Also  st  this  time,  pertly  tested  real-tlms  software  for  the  target  hardware  usually  becomes  svsiiable. 

(This  availability  can  be  speeded  up  by  copying  pertinent  portions  of  off-line  Simulation  aoftware  and 
than  embedding  the  copied  software  Into  the  prepared  control  structure  of  re»l*tloe  software.) 

The  n«tt  atop  la  to  switch  to  on-line  (or  real-time)  simulations  In  which  the  actual  real-time 
hardware  and  aoftware  are  driven  by  special  teat  equipment.  In  order  to  serve  Its  purpose,  such  teat 
equipment  must 

a  la  capable  of  ganeratlt^  (or  acquiring)  at  real-time  rates  tha  measurements  and  other  system 
inputs  that  would  he  highly  similar,  If  not  Identical,  to  the  actual  measurements  (inputs) 
of  ths  tsrgst  operational  system. 

a  Foaaaaa  built-in  facilities  for  collecting  and  reporting  parfoimanca  data. 

For  comparison,  It  le  desirable  that  the  performance  analysts  reports  generated  In  simulations 
on  the  special  taat  equipment  be  deslgnmd  to  look  similar  to  form  snd  contents  to  tha  parfotmaoce  reports 
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generated  In  off-line  simulations.  Finally,  for  testing  various  failure  aoden,  the  special  teat  equipment 
oust  be  capable  of  generating  a  wide  range  of  extreme  measurements  and  Inputs.  Examples  of  such  special 
test  equipment  are  briefly  described  in  [Dam]  and  [Gyl  80]. 

III.  CONTROL  OF  REAL-TIME  PROCESSES 

A.  INTRODUCTION 


Modern  theory  of  operating  systems,  based  on  the  concepts  of  process  (task)  and  of  proceas  management, 
furnishes  powerful  tools  for  understanding  and  designing  not  only  operating  systems  but  also  the  applications 
software  executed  under  the  control  of  an  operating  system.  (In  the  sequel,  the  term  "applications 
software,"  in  contrast  to  "system  software,"  will  refer  to  that  portion  of  real-time  software  which 
implements  control  or  estimation  algorithms;  analogous  meanings  will  also  be  assigned  to  "applications 
program"  and  "applications  process.")  In  real-time  multi  programmed  computing,  the  applications  processes 
are  more  intertwined  with  process  state  control  than  in  non-real -time  environment.  Hence,  the  designer 
of  such  applications  software  needs  to  understand  certain  aspects  of  process  management  in  order  to  be 
able  to  ccme  up  with  a  working  implemental  design.  In  contrast,  a  casual  noureal-time  scientific 
programmer,  who  programs  in  a  high-level  programing  language,  rarely  needs  to  know  much  about  the 
operating  system  beyond  specifying  his  job  to  the  operating  system,  getting  it  into  the  computer,  and 
writing  input  and  output  statements.  Thus,  he  can  perform  his  functions  nearly  without  any  understanding 
of  the  actual  environment  in  which  his  program  is  executed. 

Understanding  of  the  following  aspects  of  proceas  management,  we  think,  is  essential  to  a  designer 
of  a  real-time  multlprogrmmmed  estimation  (or  control)  system:  resource  allocation,  process 
synchronization,  process  scheduling,  and  lnterprocesa  communications.  Renee,  the  purpose  of  Section 
III  is  to  review  these  as  well  as  other  related  aspects  of  process  management.  Although  it  la  not  our 
intention  to  be  digressed  by  a  lengthy  exposition  of  real-time  operating  systems,  discussion  in  the 
sequel  le  essentially  self-contained.  (We  shall  henceforth  use  the  term  "real-time  executive*  or  its 
abbreviated  form  "RT  executive"  to  refer  to  a  real-time  operating  system.  We  shall  also  use  the 
terms  "process  management"  and  "process  control"  synonlmously.) 

S.  FUNDAMENTAL  ISSUES  OF  PROCESS  MANAGEMENT 


In  the  description  of  software  model  for  e  distributed  system,  we  assumed  multiprogramming , 
me  the  proceaslng  environment  in  the  processing  elements  (PCs)  of  such  e  system.  We  also  noted  that 
concurrent  processes  executed  within  «  processing  element  or  In  several  different  proceaslng  elements 
communicate  among  themselves  vis  an  exchange  of  problem  and  control  data.  The  present  section  reviews 
fundamental  issues  of  proceas  management  In  the  mult iprogrammed  multiprocessing  environment  in  order 
to  establish  s  perspective  for  the  design  approach  described  in  the  Utter  parte  of  Section  III. 

These  fundamental  Issues,  mil  known  in  operating  system  theory,  are  mutual  exclusion, 
synchronisation,  deadlocks  end  th«ir  prevention,  and  Interproceas  communications,  (.‘rltlal  regions  and 
communication  primitives  sve  presented  ae  technique  for  implementing  mutual  exclusion  and 
a ynchront ration.  There  era  two  additional  laauas  of  proceas  management,  allocation  of  memory  resources 
end  allocation  of  processor  tlma  (or  tchadullng),  which  will  be  midreseed  in  Sections  XII. C  through 
IU.E.  Due  to  the  limitation  of  specs,  this  exposition  of  fundamental  issue*  la  more  concise  then  it 
ought  to  be.  The  issues  are  discussed  only  to  the  extent  needed  to  make  e  control  specialist  metre  of 
their  existence  and  criticality.  For  s  more  detailed  exposition,  the  reader  le  referred  to  recent 
texts  on  operating  systems  end  system  progrmamlng  such  as  (Cof  731,  (Fre),  |Gra],  (Han),  or  (Had|. 

(Fee!  and  fSre)  ere  readehla,  elementary  expositions  of  the  topic,  written  eteinly  tor  aspiring  ayetam 
programs* re;  jDanJ  and  (Had)  are  more  detailed  but  still  elementary  texts,  less  advanced  than  (Cof 
711. 

1.  Critical  keg  I  one 

Consider  *  computer  program,  say  PR0Gj,  and  the  process  Pj  crested  by  execution  of  this  program. 

A  critical  region  (Ck)  of  program  PROG,  is  an  executable  segment  of  Instructions  in  FROCj ,  the 
executions  of  vtUeh  may  produce  unpredictable  and  varying  results  if  the  values  of  eom*  variables 
referenced  from  within  this  fik  ere  changed  by  another  process,  stay  P*,  while  Pj  it  executing  the  CR. 
Harm,  K  le  aaXumtd  to  be  a  proeeee  concurrent  with  F,.  This  mty  occur  if  (1)  we  do  not  know  anything 
about  the  relative  rpeeds  of  processes  Pj  and  P|,  end  (2)  we  do  not  program  FtOGj  and  fROGfc  so  at 
to  prevent  the  unpredictable  reeulta. 

If  in  the  Illustration  of  the  preceding  paragraph  Ck,  and  C\  are  critical  regions  in  program# 
FTOCj  and  PtOCy ,  respectively,  then  two  mutually  exclualve  poaalbllltle*  exist:  either  Ctj  end  Ckg 
are  critical  with  respect  to  each  other  (due  to  accessing  of  the  seam  data  aat)  or  site  Ck,  and  Ck^ 
are  mutually  exclusive  because  each  of  them  accesses  a  different  date  set.  Hence,  in  order  to  be 
precise  about  a  critical  region,  enn  must  also  specify  the  date  set  with  respect  to  which  the  Ck 
le  critical.  In  the  preceding  paragraph,  w*  could  hove  done  It  by  writing  Ckj(D)  and  C\(D),  where 
D  would  have  referred  to  a  mutually  eccaatsd  data  aat. 

The  next  example  Illustrates  the  use  of  critical  regions  in  a  Kalman  filter. 

EXAMPLE:  Unpredictable  results  la  a  par alls ly  mechanised  Kalmta  filter. 

'Suppose  that: 

e  Process  F,  propagates  the  atete  vector  I,  predicts  the  measurement  vector  m,  stores  (perhaps 
occasionally)  In  a  buffer  the  data  needed  for  computation  of  linemrUed  atate-to-meaaurmeant 
transformations  H,  computes  residuals,  retrieves  Kalman  gains  K  computed  by  process  Pc,  and 
applies  thae  to  update  e. 
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•  Process  Fc  propagates  the  state  error  covariance  matrix  P,  retrieves  the  data  for  computation 
of  H,  computes  Kalman  gains  K,  and  updates  P. 

Two  segments  In  program  PROG,  are  critical  regions:  one  [say  CRsi(H)j  contains  the  code  for 
storing  in  a  record  (from  which  Pc  reads)  the  data  for  computation  of  H;  the  other  (say  CR82(K)] 
copies  (from  the  array  into  which  Pc  writes  )  for  its  own  use  the  Kalman  gains  K  computed  by 
Pc.  Similarly,  there  are  two  corresponding  segments  in  program  PROGc  which  also  are  critical 
regions:  one  (denoted  by  CRci(H)!  retrieves  the  data  for  computation  of  H;  the  other  (to  be 
denoted  by  CRc2(K)l  stores  the  computed  Kalman  gains.  If  the  relative  speeds  of  proceses  Ps 
ami  Pc  are  unpredictable  (even  if  they  are  executed  concurrently  on  the  same  processing  element) 
and  if  no  precautions  are  made  to  synchronize  or  otherwise  regulate  P8  and  Pc  in  accessing 
mutually  accessed  data,  then  the  results  are  unpredictable. 

Section  I1I.E  reviews  interprocess  commuiication  techniques  for  preventing  such  disasters.  The 
preceding  illustrations  lead  to  the  first  fundamental  issue  of  process  management,  mutual  exclusion  of 
communicating  processes,  which  we  discuss  next. 

2.  Mutual  Exclusion 

Mutual  exclusion  of  interdependent  processes  (of  each  process  with  respect  to  a  mutually  related 
critical  region  in  its  generating  program)  means  that  no  more  than  one  process  can  be  in  its  critical 
region  at  a  given  time.  We  say  that  a  process  is  in  a  critical  region  if  it  has  already  started  the 
execution  of  the  first  executable  instruction  of  this  region  but  has  not  yet  completed  the  execution 
of  the  last.  Actually,  the  statement  “the  time  when  process  P  is  in  a  critical  region  CR"  refers  to 
the  entire  time  interval  during  which  the  above  defined  conditions  hold,  i.e.,  to  the  time  interval 
spanned  by  the  following  two  events:  “P  has  entered  CR“  (P  has  started  the  execution  of  the  first 
instruction  of  CR)  and  “P  has  left  CR“  (P  has  completed  the  execution  of  the  last  instruction). 

We  assumed  in  the  above  the  principle  of  Indivisibility  of  Instruction  execution.  According  to 
this  principle,  execution  of  an  instruction,  such  as  storing  a  quantity  into  a  memory  location  or 
readlrg  one  from  it,  is  an  lrxllvlaible  operation  in  the  sense  that  the  action  performed  by  such  an 
instruction  cannot  be  interrupted  after  its  execution  has  been  started  and  before  it  is  completed.  By 
programming  s  short  unlnterruptable  procedure,  we  can  generalize  this  concept  to  an  “indivisible 
macrooperatlon«“  In  Section  III.E,  we  discuss  the  use  of  such  indivisible  macrooperatlons  (or 
procedures)  in  construction  of  communication  primitives.  These  will  be  wlnterruptable  segments  of 
code,  sometimes  Implemented  as  unlnterruptable  subprograms,  designed  to  protect  entries  to  and  handle 
exlta  from  critical  regions, 

1.  Synchronisation 

Synchronization  of  a  process  P*  with  some  other  process  Pj,  where  ifj ,  or  with  several 
other  processes  meant  enauriry  that  Pt  will  not  proceed  past  some  given  point  without  sn  explicit 
signal,  which  P<  itself  esnnot  generate  due  to  lack  of  information  abaur  process  Pj  (or  about  several 
other  processes),  (tones,  this  information  must  be  explicitly  or  implicitly  provided  to  Pj  from  outside, 
i.e.,  by  Pj,  by  other  processes,  or  by  the  real-time  executive.  Mote  that  strictly  sequential  processing 
on  a  single  processor  does  not  require  any  synchronisation  information. 

A  real-time  executive  paeeea  synchronization  information  Implicitly  by  scheduling  process**  for 
execution.  Explicit  exchange  of  synchronisation  information  among  concurrent  processes  generally 
requires  the  use  of  critical  regions  serviced  by  appropriate  communication  primitives. 

4.  Deadlocks 

Two  processes  are  said  to  be  desd locked  if  neither  can  continue  until  the  other  continues.  A 
system  deadlock  occurs  when  sll  processes  in  tire  system  become  deadlocked. 

Two  concurrent  processes  Pj  and  Pj,  commun  lest  log  through  the  execution  of  critical  regions  CR) 
end  CR},  respectively,  may  became  deadlocked  if  the  critical  regions  are  improperly  implemented:  for 
example,  iff)  hangs  up  after  entering  CRj  when  it  finds  out  that  P}  meanwhile  has  entered  CR}, 
and  vice  versa. 

As  pointed  out  in  literature  on  operating  system  concepts  (e.g.,  (Prs),  p,  15?),  the  occurrence 
of  a  deadlock  la  defined  by  the  simultaneous  coexistence  of  the  following  conditions* 

(1)  Processes  claim  exclusive  control  of  the  resource#  that  they  need  for  execution. 

(2)  Procesees  hold  resource#  already  allocated  to  them  while  awaiting  additionally  needed  resources. 

(1)  Resources  cannot  be  forcibly  removed  from  the  processes  holding  them  until  these  procesees 
no  longer  need  them. 

(4)  There  eelste  a  circular  chain  cf  processes,  such  that  each  process  In  the  chain  holds  sons 
resources  requested  by  the  next  process  in  the  chain. 

Although  there  is  little  probability  of  a  deadlock  in  a  typical  mult  {programed  system  designed 
even  without  any  safeguards  against  deadlocks,  it  is  imperative  that  any  real-ties  system  be  designed 
so  chat  deadlocks  In  such  a  system  cannot  occur,  I.e.,  so  the  above  four  conditions  can  never  be  satisfied 
simultaneously.  In  applications  considered  hare,  we  shall  attain  thin  objective  by  proper  design  of 
comauolcation  primitives  and  by  raqulrlng  that  no  process  by  design  be  allowed  to  stay  in  a  critical 
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region  longer  than  some  a  priori  fixed  length  of  tine. 

C.  DECENTRALIZED  REAL-TIME  EXECUTIVE  WITH  FIXED  RESOURCE  ALLOCATION 

Neat,  we  characterize  a  claea  of  RT  executives,  several  variations  of  which  were  successfully 
used  in  Phase  I  CPS  (Global  Positioning  System)  navigation  user  equipment  sets  designed  by  Texas  Instruments 
(references  {Daa],  (Gyl  80],  and  [Opa]  describe  the  navigation  filters  used  In  these  sets).  The  aaln 
features  of  this  class  of  ST  executives  aro  as  follows: 

(1)  Decentralised  control  -  each  processing  element  (PE)  has  Its  own  autonomously  functioning 
ST  executive  which  supports  aultl progressed  execution  of  concurrent  processes  In  the  PE. 

(2)  Fixed  allocation  -  there  Is  fixed  allocation  of  PEs  to  programs  and  of  aeaory  to  global 
data  sets,  with  the  assumption  that  the  software  oodel  defined  In  Section  II. C  is  used  (one 
Implication  of  which  Is  one-to-one  correspondence  between  prograas  and  processes). 

(3)  Synchronisation  of  processes  -  processes  executed  In  the  distributed  system  and  data  coaaunlcatlons 
aaong  these  processes  sre  synchronized  by  means  of  periodic,  systemwide  Interrupts. 

For  this  purpose,  the  tlae  line  Is  decomposed  Into  consecutive  Intervals  of  fixed  length  At 
an!  an  Interrupt  le  broadcast  aysteawlde  at  the  end  of  eech  Interval.  Such  Intervals  are 
often  called  fundaaentel  tlae  frames  (FTFa).  However,  the  jeh  processing  eleaent,  PEj, 
aey  be  set  up  to  respond  only  to  every  (nj)th  Interrupt  end  to  Ignore  the  others  Typically, 
nj  la  choaen  to  that  (nj)At  equals  tha  period  of  the  hlghest-rate,  periodically  executed 
process  In  Hj . 

(4)  Restricted  Resntrsrcy  -  since  (according  to  the  software  model  outlined  in  Section  II, C) 
there  is  ons-to-ons  correspondence  between  the  programs  end  the  processes  crested  by  these 
prograas,  subprograms  concurrently  shared  by  at  least  two  processes  are  lapleaented  either 
ee  uninterruptible  procedures  (If  they  execute  fast)  or  else  as  reentrant  procedurea  of 

a  priori  known  maximum  reentrancy. 

(5)  Scheduling  -  three  types  of  processes  are  edalealble  and  characterised  according  to  the  wey 
they  are  scheduled: 

e  Cyclic  (C-)  proceezee  -  such  e  process  la  executed  at  s  fixed  rate,  with  the 
stipulated  execution  rate  guaranteed  tc  he  met, 

*  Owedline  (D-)  prcceeaee  -  tech  time  when  euch  a  process  la  scheduled,  the  deedllne 
of  Its  execution  completion  le  specified;  the  RT  executive  trlee  to  meet  but 
does  not  guarantee  this  deadline, 

a  Eackground  (S-)  processes  -  such  s  procsss  Is  si located  all  the  processing  tlae 
of  era  that  remains  "after  (or  betwsen)  execution  of  foreground  end  deedllne 
processes;  at  any  time,  at  most  a  tingle  active  background  process  la  allowed 
in  a  FB. 

Scheduling  of  proceseea  le  further  discussed  In  Section  III.O. 

(6)  tnterprucsee  communications  -  depending  on  the  nature  of  Intercccloo  between  two  processes, 
vHiy  the  first  or  both  oTThs  following  Interprocess  canaamlcatlon  types  sre  permitted  without 
further  reservations  than  those  etated  below: 

e  Critical  regions  (sections),  protsctsd/controlled  by  the  WAIT  and  SIGNAL  coseunl- 
cstlon  primitives,  may  bs  used  to  Implement  data  communications  among  any  two 
concurrent  processes, 

s  bats  buffers,  whose  access  la  controlled  by  a  SET/RESET  flag,  may  be  used  to 
leplsmaot  one-way  communications  between  two  cyclic  process**,  scheduled  at  the 
sews  rate,  Whose  executions  are  usually  not  lntsrlssved  In  spilt  fashion;  l.s, 

If  A  and  1  are  two  such  processes,  than  whan  process  A  starts  an  execution 
Instance,  It  will  complete  this  arecution  before  >  can  etart  its  next  execution 
Instance. 

Interprocess  communications  are  further  discussed  in  greater  detail  In  Section  UI.S. 

D.  PROCESS  SggPtfLIWC  AMD  F ROC1S5  STATE  CONTROL 

1.  Scheduling  Philosophy  and  Requirements 

Process  scheduling  allocate*  the  processor  ties  to  protects*  sad  thus  dstermloes  whan  processes 
will  be  executed.  With  our  aodtl  of  decentralised  process  control,  ail  process**  assigned  to  e  processing 
elvmeot  will  be  scheduled  Independently  of  proceeeee  atecoted  on  ether  processing  elements. 

Since  scheduling  also  Influences  tha  structuring  of  algorithms  Into  concurrent  processes,  scheduling 
Is  an  important  lasim  of  real -time  system  design.  Scheduling  can  bs  best  explained  through  process 
states  sad  process  state  control.  This  Is  tbs  approach  which  we  take  la  the  present  section.  Rut  we 
proceed  with  dlecudelon  of  process  atote  control  only  to  tha  amtsot  needed  to  define  selected  scheduling 
etra  sgles  for  the  decuat rail ted,  fixed  allocation  schema  Introduced  in  the  preceding  section.  Coffman 
I’Cof  13)  end  (Cof  74]  dlecuat  process  scheduling  on  an  advanced  (abstract)  level;  references  ]Pre|, 

(Van I,  and  (Nad]  deal  with  It  on  a  note  aleamntary  (leas  mstheastlcol)  level.  Literature  on  scheduling 
for  real-time  pro^aeeing  is  typically  difficult  for  a  nonspeclallst  to  follow  and  le  mainly  confined 


to  journal  article*:  reference*  [Bss],  [Ber],  [Jor],  [Man],  and  [Liu]  supplement  the 
narrow  viewpoint  on  real-time  taken  In  the  present  article. 

A*  noted  above,  process  scheduling  may  be  best  Introduced  by  defining  the  states  which  a  process 
of  a  specified  type  may  attain  and  then  by  defining  the  rules  governing  state  transitions.  We  do 
this,  but  In  an  Informal  fashion.  Detailed  design  of  a  scheduler  for  a  real-tine  executive  Is  a  task 
which  la  usually  delegated  to  system  programmers;  thus,  a  control  specialist  Is  seldom  concerned  with 
such  details.  However,  to  do  his  part  of  design,  he  needs  to  understand,  In  addition  to  the  Information 
conveyed  by  the  state  transition  graphs,  the  scheduling  priorities  of  processes,  the  facilities  provided 
to  him  by  the  real-time  executive  for  changing  process  states,  and  the  attributes  by  which  he  can 
define  or  redefine  a  process  or  change  Its  state. 

Many  strategies  are  possible  for  setting  scheduling  priorities.  In  literature  (e.g.,  [Fre]), 
priority  disciplines  are  often  divided  Into  two  major  classes: 

*  Static  priorities  -  such  a  priority  Is  set  a  priori  in  the  sense  that  It  cannot  change  while 
the  process  to  which  It  applies  Is  In  an  active  state. 

*  Dynamic  priorities  -  such  a  priority  may  change  while  the  process  to  which  it  applies  Is  In 
an  active  state. 

In  contrast  to  nonreal-tlme  systems,  scheduling  of  real-time  processes  requires  some  use  of  dynamic 
priorities  or  perhaps  of  a  mixture  of  static  and  dynamic  priorities.  This  becomes  clearer  If  one 
recall*  that  In  a  general  nonreal-tlme  system,  (1)  very  little  Is  a  priori  known  about  the  lncoelr^ 

Jobs  (processes);  (2)  Incoming  jobs  are  Imprecisely  characterized  as  they  come  Into  the  system;  and 
(3)  the  optimality  criteria,  such  as  maximizing  the  throughput  without  much  regard  for  the  turnaround 
time  of  individual  jobs,  make  sense.  Besides,  the  techniques  for  Implementing  schedulers  operating  on 
fixed  priorities  are  better  understood  by  programmers.  These  observations  explain  why  static  priorities 
are  so  widely  used  In  general,  nonreal-tlme  processing. 

On  the  other  hand,  the  main  objective  of  scheduling  In  the  real-time  applications  considered  here 
Is  to  meet  the  response-time  constraints  required  for  specified  performance  while  minimizing  the  coat 
of  hardware  or  while  staying  within  the  allocated  hardware  budget.  Designing  a  real-time  scheduler 
operating  on  fixed  priorities  is  not  difficult  If  the  available  hardware  resources  are  comfortably 
adequate,  For  example,  Jordan  [Jor]  discusses  a  simple  scheme  for  doing  It  by  means  of  an  a  priori 
fixed  multiharmonic  scheduling  pattern.  We  could  proceed  similarly,  slnee  nearly  all  processing  load 
In  the  applications  considered  here  is  due  to  the  algorithms  which  must  be  periodically  reexecuted. 

Ue  called  them  cyclic  algorithms.  Usually,  It  it  not  difficult  to  (t)  Identify  a  cyclic  algorithm 
with  the  shortest  period,  e«y  ATq;  (2)  define  a  harmonic  hltrarchy  of  h  periods  ATo.ATj,  AT2,  ••■vil'h 
such  thst  ATfc  «•  2dTj,_;  for  k  »  l,...h;  and  (3)  assign  every  cyclic  algorithm  to  a  period  class. 

Jordan  then  uses  this  technique  *a  *  basis  for  constructing  a  mult  (harmonic  scheduling  pattern.  But 
such  an  approach,  based  on  the  estimates  of  algorithm  maximum  execution  times,  underutil lies  the 
available  processor  resource*. 

A  hatter  approach  te  to  cUsetfy  all  procedure#  and/or  algorithm*  Into  three  categories:  (1) 
those  with  periodic  rates  that  cannot  be  slipped  because  of  the  enormous  penalty  that  would  have  to  be 
paid  otherwise  (in  estimation  work,  these  typically  are  the  procedure#  which  logically  control  the 
estimation  scheme  hut  are  not  the  eetimatlon  algorithms  themselves);  (2)  those  having  period  boundaries 
that  represent  desired  but  not  absolutely  required  completion  deadlines  (in  estimation  work,  these  are 
the  estimation  algorithms);  end  (3)  noncyclic  procedure* /eigorir.  teas  which  have  to  be  executed  only 
occasionally  due  to  special  condition#  thst  say  arise  and  which  typically  have  no  strict  deadline  (for 
example,  e  filter  initialisation  procedure). 

2.  Process  Types,  Their  State#  and  State  Transition* 

Experience  show#  that  nearly  all  real-time  estimation  aehemea  of  the  typo  considered  here  can  be 
realised  by  amen*  of  the  three  types  of  processes  (cyclic,  deadline,  end  background)  introduced  in 
Section  til .C.  (text,  we  characterise  these  processes  in  gtestsr  detail  than  previously  end,  by  means 
of  the  state  graph*  shown  lu  figure  XXX .0—1 .  define  their  states  end  the  rule#  governing  atste 
transitions: 

(1)  Cyclic  (C-)  process**  -  #t  th*  beginning  of  each  new  cycle  (echedulltwt/executlon  period) 
of  an  active  cyclic  process,  the  XT  executive  automat iceliy  reschedules  this  process  by 
putting  it  into  the  reedy  state  *0  that  the  procese  te  executed  within  tech  cycle  and 

the  event#  representing  the  starting  time  end  the  completion  time  of  an  execution  t nuance 
ere  not  eeperated  by  the  boundary  of  a  period.  In  other  words,  these  two  event*  are 
elway*  located  within  the  time  Interval  spanning  e  single  acheduling/exacutioo  period. 

As  indicated  in  Figure  ill.D-l,  n  cyclic  process,  after  It  becomes  activated  by  another 
procees  or  by  the  XT  executive,  tsmalna  active  until  it  become#  explictly  deactivated 
(Which  it  not  shown  in  th*  state  transition  diagram)  by  a  procees  or  by  th*  ET  executive. 

(2)  Deadline  (D-)  procees  -  such  a  process  must  be  activated  by  soother  preens*  or  by  th* 
it  eaecut iv* .  With  tech  activation,  one  needs  to  specify  th*  completion  deadline.  The 
scheduler  of  tT  executive  does  its  best  to  meet  th*  specified  deadline  or  tt  least  to 
miatmlt*  slipping  th*  deadline.  After  the  completion  of  each  execution,  a  deed  1  to* 
process  automatically  returns  to  th*  inset  in*  state, 

(3)  Background  (1-)  greets**#  -  #t  moat  one  active  background  procees  at  a  time  is  allowed 
to  a  procees teg  element  of  a  distributed  system.  Such  e  proem**  is  then  given  ail 
processor  time  that  remains  after  all  currently  active  cyclic  and  deadlins  process** 

have  been  serviced.  After  each  complete  execution  past,  a  background  process  automatically 
return*  to  the  Inactive  state. 
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Figure  III.D-l.  State  Transit  Iona  of  Thrt'O  Process  Types.  In  order  not  to 
clutter  the  diagrams,  it  is  not  shown  here  (although  it  is 
assumed)  that  a  process  of  any  of  the  above  shown  three  types 
■ay  be  deactivated  fro*  any  of  its  active  states. 


It  te  further  aaauaied  that  a  process  of  any  of  the  above  three  types,  regardless  of  its  current 
state,  can  be  forcefully  deactivated  by  invoking  a  special  utility  subprogram  furnished  by  the  *T 
executive,  this  provision  fecilltlatee  sudden  reconfigurations  of  algorithms  in  real-tins,  which  say 
be  1 •portent  in  handling  of  emergency  situation*  or  of  sudden  changes  in  operational  environment. 

Complete  specification  of  scheduling  require*  definition  of  priority  rule*  for  selecting  a  process 
(from  the  processes  waiting  in  various  reedy  queues)  to  be  executed  nest  on  the  processing  element 
under  consideration. 

J.  Priority  Hula* 

Symbolically  denote  the  priority  of  process  x  by  ??(*).  For  processes  x  and  y,  write  Pr(x)> 

Pr(y)  if  x  has  a  higher  priority  than  yj  Pr(*>  *  Pr(y),  If  the  priority  of  x  1*  higher  than  or 
equal  to  that  Of  y,  ate.  The  following  priority  aches*  ease*  to  b*  reasonable  for  scheduling  on 
a  processing  element  processes  of  the  three  type*  Introduced  earlier! 

e  Suppose  thet  C,  D,  end  t  represent  active  process**  of  the  cyclic,  deadline,  and  background 
type*,  respectively!  then, 

Pr(C)  >  Pr(0)  >  Pf{»>. 

•  For  any  two  active  cyclic  process**,  C|  and  Cj ,  with  period*  of  length*  i  £t(C|)  and  At(C2), 
respectively, 

Ft(C,)i  Fr(Cj)  if  At(C5)s4t(C2) 

with 

Pr(C,)  -  Pr(Cj)  only  if  At(Cj)  »dt(Cj>. 

*  For  any  two  active  deadline  process**  Dj  and  Oj, 

MOl>e  1* 

rUtp(Oi)uUt)(Oi)l  s  ?Utf<Dj),4t0(o1)1 

with 

F.(Oj)  -  Pt(0?)  only  If 

F|Atf(D| )  Juo(t>l  )l  •  F|.ltp(Dj),itp(D})| , 

whersr  (1)  P  it  a  i  sal -valued  priority  fraction  that  is  monotonies',  ty  nondecressiog  in  each 
oi  it*  argument*  |«.g„  Fir,*)  ••nto  Where  a  tad  b  arm  non^mgstlvs  constants:  if  s  >0 
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and  b  >  0,  then  F(r,x)  ie  strictly  increasing  both  in  r  end  in  zjj  (it)  Atp(x)  i«  the  processor 
time  needed  to  complete  the  current  execution  of  proceaa  x;  (ill)  At[>(x)  ti  the  time  remaining 
until  the  current  deadline  of  procees  z. 

Since  at  aoat  one  active  B-proceaa  la  allowed  at  a  tlae  in  a  proceaatng  eleaent,  no  reaolutlon  of 
prloritlea  aaong  the  B-proceaaea  aaalgned  to  the  aaae  proceaaing  eleaent  la  needed.  Priority  tlea 
aaong  deadline  proceaoea  can  be  reaolved  randomly,  i.e.,  arbitrarily.  However,  it  la  beat  to  epe-lfy 
a  priori  how  to  reeolve  the  priority  tlea  aaong  a  aet  of  cyclic  proceaaea  Cj,  Cj,  .  .,Cfc  with  perloda 
of  identical  lengtha,  for  it  affecta  the  order  In  which  theae  proceaaea  will  be  executed. 

4.  Overall  Synchronization  of  Proceaaea 

Suppoae  that  the  distributed  system  under  conalderatlon  contains  nre  processing  eleaenta  (PKs). 

For  reasons  of  alapllclty,  we  have  asauaed  decentralized  executive  control  over  theae  npg  PCs. 

Unless  appropriate  aeasurea  are  taken,  processes  executed  not  on’v  in  different  PEa  but  even  within 
the  aaae  PE  any  remain  unaynchronlzed.  Actually,  two  (although  related)  kinds  of  synchronization  are 
implied  here:  aynchronlzatlon  of  actions  aaong  processes,  which  can  be  attained  by  means  of  interrupts 
and  interprocess  data  communications,  and  synchronization  of  proceaaing  with  the  outside  world.  The 
latter  type  of  synchronisation  la  usually  accomplished  through  the  monitoring  of  progression  of  time, 
of  incoming  measurements,  or  of  other  signals  received  from  outside. 

One  technique  for  obtaining  synchronization  it  to  introduce  systemwide  tlaer  interrupts,  driven 
by  an  oscillator,  for  dividing  the  computational  time  line  of  each  PE  into  time  intervals  of 
length  At.  In  Section  III.l),  such  intervals  were  called  fundmental  time  frames  (FTFa);  the  timer 
interrupts  separating  FTP*  -  FTP  interrupts.  The  length  of  an  FTP  depends  on  the  estimation  problem 
and  can  be  chosen  to  represent  the  fastest  working  rhythm  tn  the  system  or  in  its  measurement  acquisition 
process.  For  example,  in  the  estimation  problem  of  GPS  navigation  (outlined  in  Appendix  C),  FTP*  were 
chosen  to  20  milliseconds  long,  because  20  millisecond  cycles  represent  the  basic  transmission  rhythm 
of  GPS  satelltes;  the  leRgth  of  an  estimation  cycle  varies,  but  typically  ia  about  one  second  long. 

To  enable  the  applications  processes  in  a  processing  element  (PE)  to  read  the  ttme  (actually,  to 
count  the  FIT  interrupts),  the  local  real-time  executive  of  such  a  FT  must  furnish  a  noninterrupt  able 
function  subprogram,  each  call  to  which  return*  to  the  caller  the  current  count  of  completed  FTP 
Interrupts.  Application*  proceaaea,  especially  the  measurement  acquisition  process,  can  uae  this  time 
information  to  tls*-tag  their  outputs. 

The  synchronization  method  based  on  nr  interrupt*,  introduced  above,  can  be  extended  to  synchronize 
the  cyclic  processes  of  the  entire  distributed  system.  This  can  be  done  by  extending  the  scheduling 
scheme  credited  to  Jordan  (Jar)  in  the  next-to-lsst  paragraph  of  Subsection  l  of  the  present  section. 

With  the  notation  introduced  in  that  subsection,  this  extension  la  a*  follow*: 

U>  let  the  lengths  of  fundamental  ttme  frames  (FTFe)  be  expressible  as  At  «ATf,/2* 
tor  soem  fixed  nonnegsttve  integer  k*,  with  AT(y  now  representing  the  period 
(cycle)  lengtha  of  potentially  highest  rate  cyclic  process  in  the  entire  system. 

(2)  Synchronise  all  proc*;«e*  proceaaing  the  period*  of  iengthATo  with  PTFs  by  requiring 
that  the  cycle  boundaries  of  eueh  processes  be  always  aligned  with  FTP  boundaries. 

O)  let  the  period  of  any  cycU<-  process  in  the  system  be  expressible  as  AT^  •  AT^2* 
for  some  nunnegative  integer  k. 

(A)  if  k?Q,  then  synchronise  sny  cyclic  process  with  the  period*  of  length  A\  with  mil 
cyclic  processes  with  the  periods  of  length  AT.,  where  )ck,  such  thst  the  cycle 
boundaries  of  former  are  alwmys  aligned  with  those  of  the  latter. 

J.  Executive  Service  Soutines  srti  Pragrsm/Process  5'*lu*  Tables 

Each  time  when  the  processing  eleamnt  return*  the  execution  control  to  its  *T  executive,  s  subsystem 
of  this  executive,  called  s  schedu ter,  decides  which  process  presently  posted  on  one  of  the  ready 
queues  will  he  executed  next”  (A  ready  queue  can  be  thought  of  aa  a  list  c t  all  processes  of  the  same 
type  which  are  In  the  resiy  state.)  To  make  this  decision,  the  scheduler  follow*  the  select  ion  logic 
implied  by  the  priority  scheme  adopted  previously.  Control  specialist *s  parts  of  real-time  software 
interface  with  these  protVs*  management  facilities  via  a  set  of  subprogram*.  Which  sometimes  ate  called 
executive  service  toutlt;:.  Each  executive  service  routine  Is  an  uninterruptible  procedure,  the  execution 
of  Which  requires  a  negligible  amount  of  processor  time.  It  Is  invoetbl*  fro*  application  processes 
and  operates  on  data  stored  lo  program/ process  status  tables.  Functionally,  executive  service  routines 
tan  be  divided  into  two  type#:  those  Which  define/ redefine  a  process  for  s  program  listed  lo  the 
ptogrem/proceae  status  tables  (PPS)j  mod  those  iditch  change  the  stmt*  of  a  process. 

By  mm  earlier  ssauaptlo*  about  the  processing  environment,  each  program  Of  application#  part  of 
real-time  software  reslltes  at  mat  on*  process  sc  s  time.  Thus,  the  program/ process  status  table* 
may  be  vltuallted  e*  a  two -dimen*  -n«l  array,  each  row  of  Which  represents  a  program  and  each  column 
cf  Which  describes  an  attribute  of  iro grams  or  at  processes  defined  for  theae  programs.  The  r -tries 
in  s  row  of  PH  cablet  characterize  s  program  and  specify  the  stats  and  characteristics  of  the  process 
defined  for  this  program,  ft  Is  not  always  required  thst  a  process  he  defined  for  a  program.  If  the 
tatter  1*  true  at  soma  tins,  such  a  program  may  he  visualised  as  being  Inactive  at  thst  time,  which  is 
not  equivalent  to  a  possibly  Inactive  but  defined  process. 

The  function  of  an  aersoutlvs  service  routine  which  deflnes/redef  Inee  a  process  for  a  specified 
program  la  to  enter  the  characteriettcs  of  the  procees  to  he  defined  la  so  appropriate  row  of  the  PPS 
tablet.  Initially,  a  newly  def ioad/redef lord  process  la  always  declared  to  he  Inactive, 


Process  definition/redefinition  subprograms  enable  applications  processes  to  change  the  nature  of 
the  process  associated  with  a  program  by  changing  its  type,  execution  rate  (if  the  latter  is  applicable 
to  process  type),  prior'.- y,  or  other  attributes.  For  example,  a  program  which  at  soue  given  time 
realizes  a  cyclic  process  may  at  some  later  time  redefined  to  realize  a  deadline  process  (provided 
that  the  program  logic  allows  doing  It);  or  It  may  at  some  later  time  be  redefined  as  a  cyclic 
process  with  a  changed  period  and  a  changed  priority. 

One  could  consider  an  alternative  approach  in  which  process  types  are  defined  and  fixed  prior  to 
real-time  operations,  say,  perhaps  at  procesr  construction  or  program  load  tlt<e.  Such  an  approach 
inconveniences  a  control  system  specialist  because,  at  the  start  of  design,  he  often  is  not  sure  himself 
under  what  scheduling  ruj.es  various  algorithms  should  be  executed.  At  least,  the  capability  to  define 
the  process  at  cold  start  offers  a  control  system  specialist  a  design  convenience.  Furthermore,  the 
capability  to  redefine  in  real-time  the  process  for  a  program  enables  the  real-time  system  to  reconfigure 
its  mode  of  processing  after  a  partial  failure  of  equipment  or  after  a  drastic  change  in  operational 
environment. 

Executive  service  routines  which  change  the  state  of  a  defined  process  will  not  be  described  here 
in  detail.  It  Is  only  important  to  note  that,  for  each  process  type,  a  procedure  must  be  furnished 
for  every  state  transition  defined  in  the  process  state  transition  graph. 

E.  INTERPROCESS  COMMUNICATIONS  AMD  SYNCHRONIZATION 

1.  Introduction 

la  order  to  perform  common  tasks,  concurrent  processes  need  to  communicate  through  data.  In  the 
present  section,  we  discuss  the  interprocess  communication  problem,  as  well  as  the  related  process 
synchronization  problem,  by  taking  a  restrictive  approach  similar  to  our  earlier  handling  of  other 
cspects  of  real-time  excutive  design.  For  a  more  complete  treatment  of  the  subject,  the  interested 
reader  should  refer  to  a  recent  text  on  system  programming  or  on  operating  systems,  such  as  [Cof  73], 
(Fre],  [Gra],  IHans],  or  [Mad]. 

A  unit  of  data  exchanged  at  a  time  is  often  called  a  message.  In  the  applications  considered 
here,  a  typical  message  is  an  array  of  homogeneous  data  (such  as  a  Kalman  gain  vector/matrix),  or  a 
record  of  heterogeneous  data  vsucli  as  a  Kalman  gain  vector,  plus  an  identification  tag  of  the  measurement 
to  which  the  gain  vector  corresponds),  or  just  a  flag  indicating  occurrence  of  an  event. 

It  is  easier  to  understand  the  interprocess  communication  problem  if,  with  each  message  type,  one 
can  associate  an  area  in  global  data  memory  reserved  for  storing  a  single  or  several  instances  of  that 
message.  One  often  uses  the  terms  "buffer"  or  “communication  buffer"  when  referring  to  such  a  dedicated 
memory  area.  Typically,  a  message  contains  two  tynes  of  data:  communicated  (or  applications)  data 
and  communication  protocol  data.  The  function  of  protocol  data  is  to  control  the  accessing  of  the 
buffer  by  the  processes. 

Messages  of  a  certain  type  implement  one-way  communication  between  two  or  more  processes  if  each 
unit  message  is  entirely  produced  (written)  by  a  single  writer  process  and  may  be  consumed  (read)  by 
one  or  several  reader  process  as.  In  such  a  case,  a  buffer  for  storing  messages  may  contain  the 
communicated  data  produced  only  by  a  single  writer  process.  On  the  other  hand,  several  processes  may 
be  involved  in  generation  and  exchange  of  protocol  data. 

2.  Assumptions  and  Design  Principles 

Next,  we  Introduce  the  following  restrictive  assumptions  as  interprocess  communication  design 
principles: 

(1)  One-way  communications  -  only  one-way  communications  arc  admissible,  which 
implies  that  any  buffer  may  contain  applicationa  data  produced  only  by  a 
single  writer  process. 

(2)  Restriction  on  the  length  of  stay  in  a  critical  region  -  m  process  remains 
in  a  critical  region  longer  than  for  an  a  priori  prescribed  maximum  length 
of  time,  such  as  a  *cv  milliseconds.  (Recall  that  a  process  is  assumed 

to  be  noninterruptable  during  the  time  Interval  spanned  between  entering  and 
leaving  such  a  region.) 

(2)  Limited  waiting  for  the  reading  of  data  -  if  a  reader  process  during  its 

execution  reaches  a  point  where  it  tries  to  retrieve  interprocess  communication 
data  from  a  buffer  but  cannot  do  it,  becai.se  the  buffer  is  locked  out  by  a 
writer  process  which  presently  is  writing  into  that  buffer  or  because  the  buffer 
contains  no  new  data  that  the  reader  process  has  not  yet  read  (co.iauscd),  the 
reader  process  (perhaps  after  waiting  at  most  for  some  predetermined  length 
of  time)  proceeds  to  process  other  tasks  without  this  time  having  retrieved  the  data. 

(A)  Limited  waiting  for  the  writing  of  data  -  it  a  writer  process  during  its 

execution  reaches  a  point  where  it  wants  to  write  Interprocess  communication 

data  into  a  buffer  but  cannot  do  it  because  a  reader  process  is  presently 

accessing  that  buffer  or  because  the  buffer  is  full  and  its  contents 

are  not  supposed  to  be  overwritten,  the  writer  process  (perhaps  sftar  waiting 

at  moat  for  some  predetermined  length  of  time)  proceeds  to  process  other  tasks  without 

this  time  having  written  the  data. 


Principle  (3)  implies  that  on  occasions  (when  no  fresh  data  can  be  obtained)  old  data,  such  os 
old  Kalman  gains,  will  be  repeatedly  reused*  Similarly,  (4)  implies  that  sometimes  the  passing  of 
produced  data  will  be  skipped.  The  important  ides  here  is  that  the  processing  logic  must  be  designed 
not  to  fall  apart  if  the  situation  described  in  (3)  or  in  (4)  rises.  Only  the  overall  system  performance 
is  allowed  to  degrade  somewhat. 

Adherence  to  the  above  stated  principles  for  design  of  interprocess  communications  eliminates  the 
possibility  of  deadlocks,  for  then  the  four  necessary  and  sufficient  conditions  for  a  deadlock  (e.g., 

(Fre],  p.  157)  cannot  be  met. 

3.  Communication  Mechanisms  and  Their  Implementations 

Next  we  turn  to  specific  designs  of  communication  mechanisms,  examining  the  capability  of  each  to 
attain  mutual  exclusion  of  communication  processes.  In  the  sequel  we  examine  the  following  interprocess 
communication  techniques: 

o  Time -separated  communications  under  Che  control  of  a  single  flag. 

o  Communications  via  a  multiple  buffer  with  or  without  a  critical  region. 

o  Nonintenuptable  communications  via  a  critical  region  under  the  protection  of 
Dijkstra's  P  and  V  semaphores  or,  perhaps,  under  more  general  communication 
control  primitives. 

It  was  noted  in  the  discussion  of  mutual  exclusion  (Subsection  2  of  Section  III.B)  that  implementation 
of  communication  control  mechanisms,  which  we  called  communication  primitives,  requires  special  indivisible 
operations.  An  operation  was  said  to  be  indivisible  if  its  execution,  in:ludiug  the  accessing  of 
memory  during  its  execution,  cannot  be  interrupted.  An  indivisible  operation  may  be  implemented  on 
several  different  lev.als:  it  may  be  a  single  machine  language  instruction  such  as  a  test-ami- set 
instruction,  an  unlnterruptable  sequence  of  machine  language  instructions  resulting  from  the  compilation 
of  a  single  or  several  high-level  language  statements,  a  single  subroutine,  or  a  pair  of  subroutines 
and  a  program  segment  between  such  a  pair. 


As  an  aside,  the  following  two  unlnterruptable  test-and-set  instructions  are  useful  in  construction 
of  communication  primitives.  The  first  tests  whether  its  operand  flag  (the  contents  of  a  memory  locution) 
is  nonxero;  if  it  is,  then  this  instruction  sets  the  flag  to  tero  and  skips  the  next  instruction; 
else,  it  proceeds  to  che  next  instruction  without  having  changed  the  value  of  the  operand  flag.  The 
second  instruction  type  complements  the  first  in  the  following  sense:  it  testa  whether  the  operand 
flag  is  xcro;  if  it  is,  then  the  instruction  changes  the  value  of  the  flag  to  1  and  skips  the  next 
instruction;  if  not,  it  goes  to  ttie  next  instruction. 

A.  Time-Separated  Communications  wider  the  Control  of  a  Flag 

Use  of  a  single  flag  to  control  one-way  communication*  between  a  writer  process  W  and  «  reader 
process  R  is  the  simplest  of  all  three  communication  techniques  presented  here.  It  attains  mutual 
exclusion  of  processes  because  of  the  restriction*  which  it  impose*  on  the  participating  processes. 

To  define  one  possible  Implementation  of  this  communication  method,  let  BUFFER  be  the  name  of  the 
communication  buffer  atxl  CFLAG  he  the  namts  of  the  variable  reoresenting  the  control  flag.  The  write 
procedure,  executed  by  the  writer  proce**,  la  **  follow*: 


write: 

begin 

Tf  CFLAG  «  0  then  goto  alpha; 

elae  goto  beta; 

alpha: 

write  BUFFER; 

CFOS  i.  1; 

end 

beta: 

(the  next  executable  statement) 

With  the  same  variable  name*,  the  read  procedure,  executed  by  a  reader  process  is  oe  follows: 

read :  begin 

if  CFLAG  -  t  then  goto  alpha; 
else  goto  bate; 
alpha;  read  from  frOfFEU; 

CFtXo  :«  0; 
end 

bats:  (the  next  executable  statement) 


Despite  their  structural  simplicity,  the  cbovt  writ*  and  read  procedure*  should  to  be  used  only 
with  coetlta,  for  they  may  prevent  new  date  from  being  eotoreu  Into  the  communication  buffer  until  ite 
previous  content*  have  been  read.  On  the  ether  hand,  this  simple  technique  enforce*  mutual  exclusion 
of  caenunloatlng  process**  without  assuming  anything  about  their  relative  speeds.  It  i*  useful  when 
the  writer  and  reader  procaatau  are  cyclically  executed  at  the  earn*  average  rate,  because  information 
will  rarely  be  loet  then.  In  *ueh  e  cue*,  It  1*  convenient,  although  not  absolutely  ncceeaery,  to 
have  the  writer  process  precede  the  reader  process  1a  each  cycle. 


5.  Communication*  Vie  Multiple  Buffer* 


A  multi] 
structure  am 


■la  buffer  of  multiplicity  M  contains  M  date  areas  (each  celled  u  buffer!  of  identical 
t  eite.  Such  buffer*  ere  usually  circularly  arranged  lu  the  sense  that  the  writer  process , 
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■safe 
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after  writing  into  the  Mth  buffer,  next  write#  into  the  first  huffer;  similarly,  the  reader  process, 
after  reading  from  the  Mth  buffer,  switches  to  the  first,  Renee,  such  a  storage  scheme  is  often 
called  a  circular  buffer.  In  presence  of  fluctuations  in  execution  ratea  of  two  communicating  processes, 
coMumications  via  a  multiple  buffer,  due  to  extra  storage  capacity,  are  less  likely  to  loose  information 
than  communications  based  on  the  method  described  in  the  preceding  subsection,  ttitual  exclusion  is 
now  attained  by  not  allowing  the  reader  process  and  the  writer  process  to  simultaneously  access  the 
same  buffer.  However,  unless  the  communicating  concurrent  piocesses  mutually  exclude  one  another 
through  scheduling  control,  aafe  implementation  of  multiple  buffers  requires  the  use  of  protected 
critical  regions. 

6.  Uninterruptible  Communications  Via  Protected  Critical  Regions 

If  one  cannot  or  is  not  willing  to  make  any  assumptions  about  the  relative  speeds  or  execution 
times  of  communicating  processes,  then  the  following  technique  constitutes  a  general  approach  for 
controlling  the  access  to  s  global  data  set  by  two  or  several  concurrent  processus,  which  always  works 
(although  we  will  later  qualify  "always"): 

•  Kith  each  global  data  set  D,  accessed  by  at  least  two  processes,  associate  an  integer 
valued  acceaa  control  flag  S. 

a  Allow  each  proceaa,  Pj,  communicating  through  D,  to  access  D  only  from  within  critical 
regions,  CRj(S),  each  protected  (enclosed)  by  a  pair  of  communication  (synchronisation) 
primitives  operating  on  S.  Let  these  two  synchronization  primitives,  one  at  the  entry 
point  to  a  critical  region  and  the  other  at  its  exit  point,  be  integral  parts  of  the 
critical  region. 

•  Make  the  process  which  enter#  a  critical  region  unlnterruptable  from  the  moment  it 
starts  to  execute  the  first  instruction  of  the  entry  point  primitive  until  it  completes 
the  execution  of  the  last  instruction  in  the  companion  exit-point  primitive. 

Thus,  a  pair  of  corresponding  communication  primitives  is  a  mechanism  operating  on  S,  which, 
after  having  been  started,  is  executed  to  completion  without  interruption  and  which  can  be  executed  by 
only  a  single  process  at  a  time.  Flag  S  is  sn  integer  that  may  be  operated  on  only  by  a  synchronization 
primitive  or  by  a  special  procedure  (the  latter  could  be  part  of  compilation  or  part  of  cold-start 
initialisation)  designed  '0  initialise  S. 

It  can  be  shown  that  the  following  primitives,  derived  from  Dljkaira'a  F  and  V  synchronisation 
semaphores,  handle  til  mutual  exclusion  and  synchronization  problems  encountered  in  multi programmed 
aultiproraasor  systems  ox  the  type  considered  here  (following  the  notation  given  on  p,  U9  of  (FreJ, 
all  operation’}  enclosed  below  within  a  pair  of  brackets  are  assumed  to  be  unlnterruptable ) : 

WAtT(S)  :  (S  :•  S  -  Is  if  Srf>  than 

place  the  process  which' "called  WAIT  on  a  wait  queue, 

Q, ,  and  release  the  processor  to  another  proceaa:  elaa 
enter  the  critical  region.) 


SICHAL(S)  »  JS  i*  8  ♦  Is  it  S<0  then  remove  a  process  from  %  and 
change  its  state  to  "'""ready”  for  processor  allocation.) 


For  such  a  pair  of  synchronisation  primitives,  on*  can  define  an  initialisation  function,  INIY(*,v), 
which  Initializes  semaphore  x  to  value  v.  thue,  if  S  is  initialized  to  )  by  executing  INIT(S,i),  the 
write  process  H  and  the  te«d  proceaa  R  can  use  the  following  procedures  to  coasts:  lent#  via  the  date 
written  into  (road  from)  the  date  *et  0.  Barf:  time  that  process  V  wants  to  write  into  8UFH5R,  it 
execute*  a  cods  segment  of  the  font: 


begin 

"WUT(S); 
write  into  D; 

mm$n 

end: 


A  criticcal  region 
for  pro©***  W 


Similarly,  each  time  when  proceed  *  wants  to  read  from  data  set  ft,  it  must  execute  a  cod*  segment, 
such  sat 


WAIT  (S); 
read  free  ft; 

Timm* 

end 


A  critical  ration 
for  prone##  k 


it  it  steuttftd  that  the  global  variable  S  in  known  as  *  semaphore  to  both  of  the  shove  code  augment*, 
For  a  discussion  of  other  (*t*i  tor*  general)  uses  of  the  synchronisation  primitives  WAIT  end  SICHAL, 
a*  ’*ell  «»  for  definitive#  of  other  synchronization  primitives,  the  Interested  reader  is  referred  to 
Chapter  A  of  (Fre). 

in  a  foregoing  paragraph,  we  assarted  that  the  above  described  communications  control  mechanism 
is  sufficient  for  the  processing  environment  assumed  hers,  the  key  to  that  la  the  assumptions  (deaigr. 
principles)  stated  in  subsection  I  of  this  section,  especially  the  second  assumption,  ercntdlig! 
to  rfsirft  a  process  i*  not  supposed  to  remain  in  •  critical  region  longer  than  some  «  priori  defined 
length  of  time.  Adherence  to  this  principle  prevent#  deadlock*.  This  principle  must  be  enforced  at 
design  time  (at  ewcut too  time  a  process  is  untntsrruptable  rfwn  it  la  inside  a  critical  region  and  so 
cannot  be  forced  out  of  it)  by  exercising  ears  about  the  executable  coda  which  is  places. inside  critical 


regions.  In  applications  considered  here,  only  the  code  segments  requiring  a  limited  amount  of  processor 
time  and  needed  for  writing  data  into  a  communication  buffer  (or  for  copying  it  from  such  a  buffer) 
are  allowed  within  a  critical  region. 

Finally,  we  want  to  say  a  few  words  about  the  implementation  of  communication  (or  synchronization) 
primitives.  First,  the  form  in  which  they  are  implemented  depends  on  the  programming  language  used. 

If  assembler  language  or  a  high-level  language  such  as  Fortran  is  used,  these  primitives  should  be 
implemented  as  an  integral  part  of  the  real-time  executive  service  facilities  in  the  form  of 
unlnterruptable  subprograms.  If  a  programming  language  such  as  ADA  [Weg],  tdilch  is  designed  for 
multiprogrammed  task  execution,  is  used,  then  these  or  similar  primitives  are  furnished  as  facilities 
built  into  the  programming  language. 

IV.  ALGORITHMIC  AND  PROCEDURAL  ISSUES  IN  DESIGN  OP  REAL-TIME  ESTIMATORS 
A.  INTRODUCTION 


In  the  first  pert  of  present  exposition,  we  examined  computer  implementation  aspects  of  real-time 
control  system  design.  In  the  course  of  doing  it,  we  described  software  architecture  for  the  real-time 
estimators  to  be  implemented  on  a  distributed  system  of  small  computers.  The  ideas  on  implements! 
design  presented  up  to  this  point  applied  not  only  to  realtime  estimators  but,  more  generally,  to  a 
variety  of  real-time  control  or  communication  systems. 

In  the  second  part,  i.e.,  in  Sections  IV  and  V,  we  narrow  down  our  focus  to  real-time  estimators. 

We  do  it  in  two  steps.  The  present  section  reviews  selected  Issues  pertaining  to  the  design  of  computational 
algorithms  and  procedural  logic  for  Kalman  filters,  although  nearly  all  ideas  will  also  apply  to  other 
types  of  recursive  real-time  estimators.  Finally,  Section  V  will  illustrate  filter  mechanizations 
resulting  from  several  known  workload  partitioning  schemes.  Typically,  we  end  up  with  schemes  requiring 
multiprogrammed  processing  environment.  Concepts  and  techniques  discussed  in  Sections  II  and  III  can 
then  be  applied  to  complete  the  implemental  design  of  estimator  software. 

It  is  difficult  to  be  objective  and  sufficiently  broad  in  selection  of  algorithm  and  procedure 
design  Issues:  what  is  Important  to  one  designer  often  is  determined  by  his  background  and  Interests, 
and  may  appear  to  be  insignificant  to  another.  In  our  selection,  we  were  guided  by  what  we  viewed  as 
being  critical  to  the  real-time  estimators  of  the  type  considered  here.  These  factors  are  (1)  modeling 
of  the  estimation  problem,  (11)  computational  algorithms  for  implementing  the  covarlance/galn  filtering 
portion  of  the  estimator,  (ill)  system  Identification  in  real-time,  (lv)  increasing  the  robustness  of 
the  estimation  process  against  the  perturbations  such  as  bursts  of  high  amplitude  noise  due  to 
environmental  disturbances,  sensor  failures,  or  sudden  and  drastic  changes  in  the  system  model. 

It  Is  difficult  to  exaggerate  the  importance  of  modeling.  But  modeling  depends  on  s  particular 
problem.  Hence,  we  shall  not  discuss  it  here.  The  purpose  of  the  present  section  is  to  remind  the 
resdur  about  and  to  comment  on  ths  other  issues  identified  above.  However,  since  they  ere  well  covered 
in  recent  literature  and  really  do  not  belong  to  our  main  theme,  we  shall  not  discuss  any  of  them  in 
detail.  Rather  we  shall  refer  the  interested  reader  to  recent  literature  on  topics  related  to  these 
Issues. 

B.  FILTER  ALGORITHMS 


In  s  Kalman  filter,  filter  algorithms  sra  ths  computational  algorithms  which  perform  covariance 
and  gain  processing  and,  aftsr  having  bsen  programmed,  implement  s  critical  kernel  of  real-time  estimation 
software.  They  are  critical  mainly  for  two  reasons:  (1)  they  may  potentially  destabilise  the  estimation 
process  or  prevent  it  from  converging  end  (11)  they  may  require  an  excsaslve  amount  of  processing  time 
and  memory. 

Covariance  and  gain  (C/C)  processing  (filtering)  algorithm*  perform  the  following  fractions: 
time  propagation  of  state  error  covariance  matrix  P,  computation  of  Kalman  gains,  and  measurement 
updating  of  P.  The  current  practice  is  to  structure  an  estimation  scheme  so  that  the  measurements  in 
an  actuation  cycle  arc  processed  sequentially  one  by  one  and  the  Kaleen  gains  ere  computed  end  atete 
error  covarlencea  updated  for  each  measurement  separately.  Such  *  scheme  la  called  acquentlal 
processing  of  measurements. 

Appendix  A  summarises  the  original  form  of  C/C  processing  algorithms  for  s  linear  Kalman  filter 
with  discrete  measurements.  Unfortunately,  the  mass  virement -update  operation  in  ths  original  form  of 
Kalman  filter  algorithms  (Equation  (7)  in  Appendix  A)  is  potentially  unstable.  Roundoff  error*  may 
eventually  make  the  state  error  covariance  matrix  acquire  negative  characteristic  roots  and,  thus, 
loss  its  positive  definiteness.  Rsncs,  the  criticality  of  numerical  stability  requirement  In  applications 
considered  harm  motivates  the  use  of  square  root  filtering  algorithms  for  coverienee/galo  processing. 

Severe!  variations  of  square  root  filtering  ere  known.  The  version  which  has  been  defined 
end  refined  largely  by  Bleraan  la  summarised  In  Appendix  B.  (References  (Bis)  end  (Tho|  describe  to 
whet  m  shall  refer  *a  Herman's  method  and  give  timing  end  elxlng  models  for  it.  The  first  of  those 
references  also  discuses*  applications  of  square-root  filtering  technique*  to  information  matrix 
estimation.)  There  it  .wore  controversy  In  Utt-raturs  about  which  particular  form  of  square-root  filtering 
should  *j*  used.  Carlson  (Car)  describee  whet  could  be  viewed  as  an  alternative  to  Blarman's  method. 

Which  1*  also  attractive.  Our  selection  of  Bletman't  method  has  been  motivated  mslnly  by  year*  of 
satisfactory  experience  with  It  in  applications  to  navigation  problems.  In  any  ease,  saving  just  a  few 
percent  of  processing  time  should  net  be  the  decisive  criterion  for  using  one  sat  of  algorithms  Instead 
of  another. ■ 

For  a  "neutral*  overview  of  available  options  in  square  root  filtering,  the  Interested  reader  le 
referred  to  Chapter  1  of  (flay),  Which  also  summarise*  comparative  timing  data  for  better  known  variants. 


One  benefit  derived  fro*  the  use  of  numerically  stable  covariance  and  gain  processing  algorithms 
Is  the  feasibility  to  Implement  them  and  to  make  them  perform  In  single  (or  reduced)  precision 
floating-point  arithmetic  except  for  computations  of  some  dot  products.  In  a  microprocessor,  this 
often  saves  not  only  memory  but  also  processing  time  because  of  the  relative  disparity  In  the  speeds 
of  single  and  double-precision  floating-point  operations.  This  disparity  in  processing  speeds  becomes 
especially  large  If  the  microprocessor  does  not  have  floating  point  arithmetic  Implemented  in  hardware 
form. 

C.  SYSTEM  IEENTIFICATIOH 

1.  Identification  Problems  In  Real-Time 

We  use  the  term  "system  Identification”  in  a  restricted  eense  to  indicate  acquisition  of 
knowledge  about  the  distributional  properties  of  the  stochastic  processes  representing  the  process  and 
measurement  noise  sources  In  a  Kalman  filter. 

In  practice,  one  usually  assumes  that  each  noise  source  Is  represented  by  a  stochastic  process 
from  some  particular  class  of  processes.  In  such  a  case,  the  Identification  problem  reduces  to 
determination  of  the  parameters  which  define  a  particular  process  in  the  assumed  class.  Two  distributional 
parameters  that  are  usually  of  Interest  are  the  mean  and  the  covariances  of  the  stochastic  process. 

These  quantities  may  not  be  time  Invariant  and  so  their  values  nay  have  to  be  updated  repeatedly.  As 
the  system  model  of  a  Kalman  filter  in  Appendix  A  Indicates,  each  noise  source  is  typically  modeled  as 
a  white,  aero-mean  gaussian  process  with  unknown  variances  or  covariances. 

Should  there  be  any  suspicion  that  the  process  representing  a  noise  source  has  a  nonzero  mean  of 
unknown,  but  significant  value,  the  unknown  mean  should  be  Included  in  the  system  model  as  a  state 
variable  and  estimated.  Most  often,  the  unknown  parameter  whose  value  la  sought  is  a  noise  variance 
(for  a  scalar-valued  process)  or  a  noise  covariance  matrix  (for  a  vector-valued  process). 

In  cases  of  sequentially  correlated  noise,  one  also  must  effectively  estimate  autocorrelation 
coefficients  In  order  to  whiten  the  noise. 

2.  Identification  Methods  for  a  Kalman  Filter 

In  the  presence  of  colored  noise  (as  is  pointed  out  In  Chapter  11  of  [And]),  retention  of  optimality 
properties  of  the  filter  is  usually  possible,  although  at  the  expense  of  Increased  complexity.  This 
reference  Illustrates  a  few  special  cases  (such  as  a  situation  In  which  the  measurement  noise  process 
is  Markov)  and  techniques  for  handling  them  which  save  the  optimality  properties  of  filter  without 
increasing  the  dimensions  of  the  state  vector.  Another  approach  Is  to  replace  the  filter  with  one 
that  la  less  complex  by  means  of  model  order  reduction.  References  [And]  and  [May]  probably  are  the 
best  Introduction  to  the  subject. 

Methods  for  Identifying  the  unknown  covariances  of  noise  processes  csn  be  roughly  divided  Intot 
(1)  adaptive  (l.e.,  estimation  time)  methods,  (2)  heuristic  on-line  methods,  and  (3)  a  priori  modeling 
methods.  Occasionally,  several  methods  are  combined. 

3.  Adaptive  Identification  Methods 

In  Kalman  filtering,  the  ten  "adaptive  estimation"  usually  refers  to  on-line  estimation  techniques 
which  include  estimation  of  unknown  distributional  parameters  In  noise  models.  During  the  past  decade, 
many  adaptive  estimation  echamaa  ware  Investigated  and  the  results  of  this  research  reported  In  literature, 
e.g.,  refer  to  Brewer  [Bre]  or  to  Ohap  and  Stubberud  [Ohs]. 

Unfortunately,  these  on-line  system  Identification  technique*  nontrlvlally  lncreeae  the  proceeslng 
load  In  almost  all  casta  and  to  may  become  prohibitively  expensive  with  reapect  to  proceaalng  time. 

For  example,  any  meaaurement  biaa  (l.e.,  nontaro  mean  of  measurement  noise)  In  principle  can  be  estimated 
by  modeling  It  ea  a  component  of  ths  system  stats  vector)  but  the  computational  load  due  to  the  processing 
Kalman  gains  and  covariances  in  a  filter  la  roughly  proportional  to  Kn^  operations,  idler*  n  1*  the 
length  of  mtatm  vector  and  K  Is  a  scaling  factor  which  depends  on  a  particular  algorithm  used. 

The  adaptive  techniques  become  even  more  ccaputetlonally  expensive  when  the  nolee  rendem  process 
under  consideration  la  nonstationary  or  sequentially  correlated.  Then  It  is  not  enough  to  estimate 
the  unknown  distributional  parameters  once,  say  as  the  start  of  ths  estimation  process,  and  than  to 
continue  using  the  obtained  parameter  estimates  throughout  the  remaining  part  of  the  estimation  process) 
but  there  la  a  need  than  to  continue  the  eetlaetlon  of  chenglng  dletrlbut tonal  parameters  throughout 
the  estimation  process.  Furthermore,  In  some  applications  such  me  missile  dynamic*  during  powered 
flight,  the  noise  characteristic*  may  change  so  rapidly  that  evan  with  almost  unlimited  processing 
resources  It  t*  impossible  to  input  ths  measurements  needed  for  system  identification  at  a  sufficiently 
high  rat*. 

4.  Heuristic  On-Line  end  A  Priori  Modeling  Methods 

Current  microprocessor  end,  In  particular,  special  chip  technologlaa,  aided  by  modern  methodology 
of  software  design  end  Implementation,  have  mad*  heuristic  approach  to  eye  tea  Identification  attractive. 
This  approach  utilize*  the  following  two  Ideas. 

First,  with  some  planning,  on*  can  design  the  sensor*  and  other  measurement  Input  port*  (or  st 
least  the  digital  controller*  of  these  devices)  for  e  reel-time  estimation  system  so  as  to  meka  them 
produce  amtr*  information  In  addition  to  the  "regular"  measurement*  specified  In  the  systw  model. 

Usually,  such  extra  Information  can  be  obtained  at  little  additional  coat  am  a  byproduct  of  regular 
measurement*.  This  extra  information  la  often  Intend  ad  to  help  the  estimation  process  (1)  promptly 


detect  a  change  In  the  characteristics  cf  a  noise  process  or,  more  generally,  in  system  state; 

(2)  accurately  approximate  the  values  of  process  and  measurement  noise  covariances. 

Secondly,  real-time  test  equipment  (capable  of  creating  a  vide  spectrum  of  possible  operational 
environments  and  producing  close-to-real-life  measurements  plus  their  extras)  can  be  utilized 
to  calibrate  the  noise  parameters  as  a  function  of  the  received  extra  Inputs  for  quick  computation  of 
covariances. 

Next  we  illustrate  applications  of  the  heruristic  modeling  techniques  outlined  above  to  two  estimation 
problems  in  GPS  navigation.  (Appendix  C  defines  a  simple  version  of  the  estimation  problem  for  GPS 
navigation.)  Several  models  of  GPS  navigation  equipment  have  been  or  are  being  developed.  A  typical 
set  of  GPS  user's  equipment  is  built  around  a  system  of  microprocessors  and  utilizes  a  specially 
designed  receiver  for  obtaining  pseudo-range  and  delta  pseudo-range  measurements  at  a  high  repetition 
rate.  The  receiver  passes  these  measurements  to  a  microprocessor  based  estimation  system.  The  latter 
recursively  produces  a  primary  navigation  solution  (i.e.,  estimates  the  state  vector)  from  which  other 
navigation  quantities  of  interest  can  be  derived  as  byproducts. 

EXAMPLE  1.  One  type  of  GPS  navigation  equipment  was  developed  as  part  of  test  instrumentation  for  a 
long-range  missile  (Gyl  80).  Analysis  of  the  process  noise  showed  that  all  dynamics-related  elements 
of  process  noise  covariance  matrix  were  expressible  in  terms  of  a  single  parameter,  the  acceleration 
variance.  During  a  short  powered  flight,  each  of  the  three  engines  of  the  missile  undergoes  an 
acceleration  peak  and  an  acceleration  valley;  after  the  missile  goes  into  the  coasting  flight,  nearly 
all  acceleration  is  due  to  gravitational  attraction,  which  thereafter  changes  very  slowly.  In  early 
design,  several  adaptive  process  noise  covariance  identification  techniques  were  tried.  They  responded 
too  slowly  and  were  too  expensive  computationally.  Thereafter,  it  was  decided  to  Instrument  the  system 
so  as  to  provide  the  estimator  with  a  discrete  warning  (completion)  signal  before  (after)  each  event 
that  drastically  affected  the  acceleration  (e.g.,  liftoff  or  a  change  in  engine).  Special  real-time 
test  equipment  —  producing  not  only  realistic  GPS  pseudo-range  and  delta  pseudo-range  measurements 
throughout  a  test  mission  but  also  the  above  described  discrete  event  warning  (completion)  signals  — 
was  used  to  select  experimentally  the  best  possible  acceleration  variance  for  each  segment  of  a  test 
mission. 

EXAMPLE  2.  This  example  deals  with  GPS  navigation  equipment  for  a  medium-dynamics  user  [War].  The 
GPS  receiver  of  this  navigation  equipment  is  designed  to  produce,  in  addition  to  GPS  satellite  pseudo¬ 
range  and  delta  pseudo-range  measurements,  several  parameters  for  computing  the  measurement  noise 
variances;  also  the  velocity  variance  from  which  all  dynamics-related  process  noise  covariances  can  be 
directly  computed. 

D.  INCREASING  TEE  ROBUSTNESS  OF  AN  ESTIMATION  PROCESS 

Judging  by  a  large  number  of  current  publications,  much  Interest  has  been  recently  shown  in  robust 
statistical  inference,  including  robust  estimation  (or,  to  be  more  specific,  regression),  the  objectives 
of  which  are  to  handle  the  situations  in  which  classical  methods  do  poorly;  for  extmple,  classical 
regreasion  methods  have  difficulties  with  outlier  and  collinear  data.  What  is  nice  about  "off-line" 
statistical  analysis  is  that,  if  one  method  of  Inference  leads  to  suspicious  results,  the  statistician 
can  always  try  another  one  on  the  original  data.  In  real-time  estimation,  however,  we  do  not  et\joy  «*- 

this  luxury:  data  is  processed  at  about  its  arrival  rate;  if  the  on-line  analysis  of  data  falls,  it 
may  be  physically  impossible  or  too  expenalve  to  repeat  the  experiment.  This  strongly  motivate*  us  to 
Increase  the  robustness  of  an  on-line  estimation  process.  Proper  preprocessing  and  screening  of 
measurements  contribute  to  it.  Thus,  while  designing  a  real-time  estimation  scheme,  one  should  always 
examine  whether  the  considered  appllcttlon  requires  apecial  procedures  for  (1)  screening  the  measurements 
against  isolated  outliers,  (2)  detecting  th*  leading  and  trailing  edges  of  high-amplitude  noise  bursts, 

(3)  detecting  the  onset  of  and  then  taking  appropriate  measures  against  nonwhlteneaa  in  measurement 

noise,  and  (4)  censoring  (imposing  bounds  on)  measurements.  One  should  also  examine  whether  any  special  ** 
procedure*  are  required  for  detecting  the  onset  of  a  drastic  change  in  the  system  model  and  for  taking 
appropriate  measure*  against  detected  changes. 

\ 

One  area  which  should  b*  examined  for  each  application  i*  whether  it  is  necessary  to  have  procedure* 
for  monitoring  the  estimates  and  for  altering  them  in  case  their  value*  exceed  a  predetermined  range. 

This  simple  heuristic  technique,  known  to  statistician*  as  censored  estimation,  has  aaved  the  situation 
in  several  known  applications. 

(Sch)  la  a  highly  readable  reference  which  complement*  the  discussion  of  algorithm  and  procedure 
design  laaues  in  th*  present  section.  Its  discussion  of  the  balancing  of  covariance  matrices  for 
filter  convergence  end  stability  is  especially  noteworthy. 

V.  IE  COMPOSITION  OF  A  KALMAN  FILTER  INTO  CONCUSS*  NT  PROCESSES 

A.  MOTIVATION  AND  OVERVIEW 


Sections  II  and  III  outlined  software  engineering  techniques  for  decomposing  a  real-time  control 
problem  into  concurrent  process**.  In  applications  considered  hare,  such  decomposition  may  enable  an 
estimation  schema  to  satisfy  eh*  real-time  constraints  of  the  problem  on  a  distributed  eystsm  of  small 
computers.  Specified  limitations  on  the  weight,  the  volume,  the  power  consumption,  or  cost  of  equipment 
often  do  not  allow  extending  a  distributed  eye  tea  through  addition  of  extra  processing  element*. 

Thus,  trying  to  satisfy  th*  real-time  constraint*  by  incrementing  the  equipment  la  often  unacceptable. 

In  such  a  situation,  decomposing  th*  workload  into  concurrent  process**  ia  the  only  racouree. 

The  present  section  illustrates  this  approach  by  eeena  of  several  schemas  for  decomposing  Kalman  filters. 
Such  decomposition  of  a  Kalsan  filter  (or  of  any  recursive  estimator)  into  concurrently  executable 
procedure#  often  constitute*  pert  of  whet  is  known  aa  filter  mechenleetlon.  *  term  we  mainly  reserve 
lor  filter  structuring. 


In  the  sequel,  we  first  Introduce  two  basic  structural  formulations,  direct  and  indirect,  of  a 
Kalman  filter/  Each  can  be  used  as  a  basis  for  decompositions  presented  subsequently.  Next,  in  orde. 
to  establish  rationale  and  common  reference  for  discussion  of  decompositions,  we  th*  ;ng 

tasks  comprising  a  single  estimation  cycle  of  a  sequentially  structured  filter.  Finally,  we  examie 
several  scheL.  for  decomposing  the  computations  of  such  a  filter  into  concurrent  processes. 

B.  DIRECT/INDIRECT  FORMULATIONS  AND  FEEDFORWARD/FEEDBACK  MODUS  OF  USE 

1.  Direct  and  Indirect  Mechanizations 

Two  alternate  approaches  for  formulating  a  Kalman  filter  are  known.  In  the  £i”J' 

(or  total  state)  formulation,  the  state  vector  s,  which  describes  the  total  state  of  the  system,^ 
directly  estimated;  i.e.,  in  each  estimation  cycle,  s  is  first  time-propagated  and  then  measur^nt-updated . 
In  the  Second  approach,  called  indirect  (or  state^rror)  formulation,  a  Kalman  *stimates  not 

the  system  total  state  vector  s  but  the  error  5s  in  a.  Thus,  if  an  indirectly 

used,  each  estimation  cycle  involves  three  major  steps:  time-propagation  of  ±»  ^  °  p”  the 

is  in  e;  and  updating  of  the  propagated  value  of  s  by  subtracting  frcm  it  the  estimate  of  fle¬ 
es  timation  cycle  with  reference  time  tk,  the  last  step  can  be  symbolically  written  as 

s(k|k)  -  3(k|k-l)  -  fi£(k|k). 

Literature  suggests  that  indirectly  formulated  Kalman  filters  were  first  introduced  in  navigation 
systems,  although  such  a  filter  can  be  used  in  nearly  any  situation  to  which  a  directly  formulated 
filter  also  applies. 

2.  Feedforward  and  Feedback  Modes  of  Use 

There  are  two  basic  modes,  illustrated  in  Figure  V.B-1,  for  using  a  Kalman  filter  (or  any 
recursive  estimator)  in  a  control  system:  feedforward  use  and  feedback  use. 


T*H  1M*Y  l«  UMO 


Figure  V.B  1.  Three  Formulation  and  Uwe  Mode  Combinations 
of  a  Kalian  Filter 

A  recursive  estimator,  used  In  feedback  mode,  feeds  some  of  Its  estimates  beck  Into  ^control 
mtm  ttm  which  it  la  r««iviaf  waaurwatnta*  *««db*ck  aachtnitatlon  la  *14«ly  utad  in  ihtagratad 
navigation  systems,  l.e.,  in  navigation  systems  built  around  a  recursive  estimator  which  °P*"t** 
measurements  from  several  different  types  of  sensors  or  measurement  subeyeteme.  Consider,  for  example, 
an  integrated  navigation  system  operating  on  two  typee  of  measurements:  (1)  position,  velocity,  and 
acceleration  outputs  of  an  inertial  measurement  subsystem  end  (2)  pseudo-range  end  delta  pseudo-range 
Masuramenta  of  CPS  satellites.  A  loop  tdilch  feeda  back  the  estimation  outputs  into  the  inertial 
measurement  eubeystme  could  be  used  to  recalibrate  the  Inertial  eubeyst*  tdtlle  the  0P9  measurements 
are  belt*  received  art  when  they  produce  excellent  navigation  data.  Another  feedback  loop  ctxild  be 


used  for  feeding  aiding  data  to  the  GPS  sensor  (a  special  receiver)  for  quick  reacqulsltlon  of  GPS 
space  vehicles  (SVs) .  (Estimates  of  SV  ranges  and  their  tine  derivatives  are  the  quantities  which  aid 
acquisition  of  a  new  SV  or  reacqulsltlon  of  an  SV  tdiose  track  had  been  lost.)  With  feedback  mechanization, 
It  Is  often  more  convenient  (although  not  mandatory)  to  use  an  indirectly  mechanized  Kalman  filter. 

Part  c  of  Figure  V-B-l  graphically  summarizes  the  above  described  filter  mechanization,  with  the  “AIDED 
SENSORS  AND  tCASCRElCNT  SUBSYSTEMS"  box  representing  both  the  inertial  measurement  subsystem  and  the 
GPS  receiver  and  with  the  "UNAIIED  SENSORS  AND  MEASUREMENT  SUBSYSTEMS"  box  standing  for  other  possible 
(unidentified)  measurement  sources,  say,  such  as  an  altimeter. 

In  the  example  of  the  preceding  paragraph,  if  the  user  of  equipment  were  Interested  only  in  accurate 
estimates  of  navigation  quantities  (such  as  position  or  velocity)  and  if  he  were  indifferent  to  the 
calibration  of  the  inertial  measurement  subsystem,  he  could  have  the  filter  mechanized  In  feedforward 
fashion,  as  illustrated  in  Part  b  of  Figure  V.B-1.  In  the  latter  case,  the  filter  estimates  errors  in 
the  navigational  outputs  of  the  Inertial  measurement  subsystem  (IMS),  vAiich,  in  Part  b  of  Figure  V.B-1, 
is  the  box  outputting  mi*  Estimates  of  these  errors  are  then  subtracted  from  the  navigational  outputs 
of  the  IMS.  Note  that  in  the  present  case,  (1)  the  outputs  of  the  IMS  are  shown  to  be  used  by  the 
Kalman  filter  for  time-propagation  of  the  navigation  state  vector  e  and  (ii)  the  GPS  receiver,  the 
outputs  of  which  are  represented  by  mj,  is  not  being  aided  (which  is  not  a  realistic  assumption)  by 
the  Kalman  filter.  Similarly  to  the  feedback  mechanization.  It  Is  often  more  convenient  (as  Indicated 
In  Part  b  of  Figure  V.B-1)  with  the  feedforward  mechanization  to  use  Indirect  formulation  of  a  Kalman 
filter. 

Care  must  be  exercised  In  the  design  of  a  recursive  estimator  used  in  feedback  fashion,  A  feedback 
loop  may  become  unstable,  which  sometimes  will  manifest  itself  through  sequentially  correlated  (nonwhite) 
measurement  noise. 

3.  Pros  and  Cons  of  Indirect  Formulation 

In  many  applications  of  the  type  considered  here  (such  as  navigation  systems),  an  indirectly 
formulated  Kalman  filter,  designed  to  estimate  low-frequency  errors,  can  be  executed  at  a  considerably 
lower  rate  than  the  rate  at  which  a  direct  filter  would  have  to  be  run  in  order  to  perform  comparably. 

This  la  because  a  linear  model  is  uften  adequate  for  representation  of  low-frequency  error  dynamics. 

On  the  other  hand,  indirect  formulation  of  a  parallely  structured  filter  usually  costs  more  In  processor 
time  overhead  than  does  direct  formulation,  because  the  error  estimates  in  such  a  filter  generally 
must  be  time-propagated  in  order  to  match  their  reference  times  with  those  of  the  state  vector  at 
each  update  of  the  latter. 

C.  REVIEW  OP  PROCESSING  TASKS  IN  A  SEQUENTIALLY  STRUCTURED  KALMAN  FILTER 

1.  Sequentially  Structured  Filter 

Before  discussing  parallely  structured  estimators,  we  briefly  review  the  processing  teeks  comprising 
a  single  estimation  cycle  of  a  sequentially  structured  Kalman  filter.  We  assume  that  auch  a  filter 
processes  measurements  sequentially  as  scalars  and  that  It  la  Indirectly  formulated.  The  standard 
Kalman  filter  equations  summarised  in  Appendix  A,  then,  suggest  decomposition  of  work  at  time  tvt  into 
the  following  tasks i 

(1)  Time-propagation  of  state  vector  9  from  t^.j  to  t^;  i.e.,  computation  of  S(k>”  ^  S(k|k-1). 

(2)  Tlma-propagetion  of  state  error  covariance  matrix  P[  i.a.,  computation  of  P(k)“  ^  P(k|k-1). 

(3)  Settingmp  of  the  measurement  processing  loop,  which  (depending  on  ths  type  of  filter 

.  'mechanisation)  may  Include  activities  such  at  the  clearing  of  state  error  estimate  vector  i_t, 
the  saving  of  state  data,  ate. 

(A)  Measurement  processing  loop,  each  pax?  through  which  proteases  scalar  measurement  ej(k)  (1 
•  l,2...n»)  end  requrles  execution  of  the  following  cask*! 

.  (4.1)  Battiog-up  of  indices  sod  logic  for  the  1th  pass  through  the  loop. 

(4.2)  Preprocessing  of  m^(k),  which  may  Include  its  conversion,  transformations,  end 
prefilter lug. 

(4.3)  Computation  of  predicted  measurement,  linear  (or  linearised)  state-to-eeeeurement 
transformation  vector  h(k),  and  measurement  residual. 

(4.4)  Screening  of  measurement  m*(k)  for  acceptance/rsjectiuo  by  means  of  residual 
analysis. 

(4.5)  Maakursmant-updaftng  of  state  error  covarisnce  matrix  P[*uch  that  the  update 
P(k)+  •  P(k|k)  is  completed  in  the  laet  pmse  through  the  measurement  loop)  end 
computation  of  Kalman  gain  vactor  Kt(k)  for  mi(k). 

(4.6)  Updating  of  the  estimate  of  state  error  Sjrik)  [such  that  tha  output  of  thia  step 
in  tha  last  pass  through  tha  measurement  loop  is  6?(k|k)). 

(4.7)  End-o£-itaratioo  procaaalng. 

(5)  Measurement  updating  of  state  vector,  i.e.,  computation  of  i(k)+  A  £(k|k)  -  i(k|k-l)-6i(k,k). 


In  the  sequel,  the  above  talks  (4.1)  through  (4.4),  (4.6),  and  (4.7)  will  be  collectively 
referred  to  as  measurement  Incorporation;  the  above  tasks  (2),  (4.5),  and  modified  forns  of 
(4.1),  (4.3),  and  (4.7),  as  covarlance/galn  filtering. 

2.  Departure  from  Sequentially  Structured  Filters 

After  decomposition  of  workload  Into  the  above  Identified  or  similar  processing  tasks  and  after 
obtaining  tlnlng  and  siting  estimates  for  each  task,  these  tasks  can  be  recombined  Into  concurrently 
executable  processes  in  many  different  ways.  With  timing  and  sizing  data  for  each  Identified  task 
available,  It  Is  easy  to  predict  not  only  the  processor  time  requirements  for  processes  formed  from 
these  tasks  but  also  the  memory  requirements  for  the  programs  and  data  sets  Implementing  these  processes. 

This  leads  to  candidate  filter  structures.  The  designer  next  faces  the  problem  of  selecting 
the  most  suitable  structure  (or  set  of  structures)  for  the  problem  at  hand.  Re  may  select  a  set  of 
structures,  each  suitable  to  a  particular  mode  or  phase  of  his  estimation  problem,  and  then  apply  the 
process  control  facilities  of  the  real-time  executive  outlined  in  Section  III  to  have  processes  adaptively 
reconfigured  (redefined)  In  real-time. 

The  price  to  be  paid  for  solving  the  scheduling  problem  via  structures  of  concurrent  processes  is 
poorer  functional  performance,  because  It  leads  to  algorithms  which  usually  are  less  "optimal"  than 
their  original,  sequentially  executable  forms.  Thus,  one  task  which  the  designer  now  confronts  is  to 
determine  by  how  much  the  performance  actually  suffers.  This  Is  normally  done  by  means  of  simulations. 

D.  PARALLEL  STRUCTURES 

1.  Introductory  Remarks 

Next,  we  Illustrate  several  solutions  to  the  real-time  constraint  problem  through  workload  decomposition 
Into  concurrent  processes.  For  this  purpose,  we  outline  several  ways  for  parallel  structuring  of 
Kalman  filters. 

Figure  V.D-1  Illustrates  how  a  slight  restructuring  of  the  procedure  used  In  the  sequential  model 
of  Section  V.C  changes  «  sequential  schema  Into  a  parallely  structured  estimator  with  two  concurrent 
processes:  one  for  propagatlon/update  of  state  vector  ji;  the  other  for  processing  of  measurements  and 
covarlance/galn  computations.  The  outputs  of  the  Kalman  filter,  s_(j|j),  Identified  as  user's  estimate 
in  Figure  V.D-1,  are  not  "strictly"  Kalman  in  the  sense  that  every  second  time  they  are  computed  while 
using  a  time-propagated  value  of  6j».  For  an  Indirectly  formulated  filter,  the  eecond  process  estimates 
the  error  in  state  vector  a. 

Further  separation  of  measurement  processing  and  covarlance/galn  computations  by  Introduction  of 
an  additional  concurrent  process  leads  to  a  triply  parallel  filter  structure  consisting  of  three  concurrently 
executable  processes:  one  for  state  propagatlon/update;  the  second  for  measurement  processing;  the 
third  for  covarlance/galn  computations.  The  resulting  scheme  la  Illustrated  In  Figure  V.D-2. 

If,  Instead  of  decompoalng  the  workload  Into  two  procasaea  as  In  Figure  V.D-1,  we  allowed  one 
proceee  to  propagate  the  state  vector,  process  the  measurement u ,  end  update  the  state  vector  (with  the 
Kalman  gains  computed  by  the  other  proceea),  while  aaslgntiy;  to  the  other  the  processing  of  covarltncee 
and  the  computation  of  Kalman  gains,  then  we  would  obtain  a  doubly  parallel  estimation  scheme  with 
concurrent  coverlance/gein  filtering,  Which  is  described  In  Subsection  4. 

Next,  we  examine  the  above  introduced  three  parallel  filter  structurea  in  greater  detail.  We 
assume  that  only  one  processing  element  is  available  for  filter  functions.  Hence,  concurrent  processes 
resulting  from  filter  decompositions  must  be  exacuted  in  Interleaved  time  fashion  on  a  single  processor, 

2.  Parallel  Estimation  of  State  Error 

This  la  the  doubly  parallel  structure  shown  in  Figure  V.D-1,  For  this  structurs,  «M  now  assmoa 
indirect  filter  formulation.  Ranee,  as  mentioned  previously,  one  process  (called  the  error  estimation 
process)  astimetes  the  error  As  In  state  vector  _s  (and  actually  performs  all  functions  normally'  ascribed 
to  a  Kalman  filter),  while  the  other  process  (caTlsd  the  state  propagation/updating  process)  propagates 
the  state  vector  s  and  than  updates  a  by  subtracting  from  It  fKs  estimate  of  £Y  passed  by  the  error 
estimation  procesa. 

Compared  to  the  other  two  parallel  filter  structures  discussed  in  the  sequel,  this  schema  yields 
high-rats  propagation/updating  of  the  state  vector.  However,  somewhat  stale  estimates  of  6_«  (although 
properly  time-aligned  by  propagation)  will  in  general  be  used  ror  updatlt*  of  £. 

This  scheme  may  be  the  only  recourse  If  it  la  required  that  the  state  vector  _•  be  propageted/updeted 
at  a  rate  much  falter  than  that  at  which  the  whole  filter  can  be  executed.  For  example,  thle  may  be 
required  in  the  navigation  applications  (where  £  represents  the  navigation  solution)  to  computations 
such  as  aerial  cargo  drops  or  weapon  dellwerlesT  With  this  filter  structure.  It  is  helpful  to  aid 
the  propagation  of  *  with  outputs  from  a  measurement  system  (such  as  the  velocity  and  acceleration 
inputs  from  an  Inertial  eubayetri),  complementing  the  primary  measurements  on  which  the  filter  la 
operating. 

This  doubly  parallel  scheme  reduce#  to  a  sequentially  structured  filter  when  the  execution  rates 
of  the  state  P/0  process  and  of  the  error  aatlmatlon  procesa  are  equal.  In  such  e  case  (or  when  the 
rata  of  error  estimation  procesa  la  not  significantly  lower  than  that  of  state  F/U  process),  this 
•chose,  if  properly  handled,  displays  many  good  propartlaa  (such  aa  robust  initial  coovsrganca)  of  a 
Kalman  flltar. 


The  parallel  error  estimation  acheae  easily  leads  Itself  to  measurement  screening  and  nloely 
responds  to  real-time  changes  In  the  system  model.  Its  chief  disadvantage  Is  a  relatively  low  rate  of 
measurement  Incorporation,  which  may  prohibit  Its  use  In  dynamically  lively  applications.  However,  by 
Introduction  of  a  third  concurrent  process  for  covarlance/galn  filtering  (and  thus  by  separation  of 
these  functions  from  error  estimation),  the  problem  of  low  rate  of  measurement  incorporation  can  be 
alleviated,  but  at  the  coat  of  an  additional  Increase  in  dissimilarity  from  the  structure  of  canonical 
Kalman  filter  algorithms.  Such  a  triply  parallel  scheme  Is  discussed  In  Subsection  3. 

If  the  state  propagation/update  and  error  estimation  processes  are  the  only  two  processes  assigned 
to  a  processing  element,  they  can  be  implemented  by  defining  the  first  process  as  a  cyclic  process  and 
the  second  as  a  background  process.  The  latter  will  be  given  all  free  processor  time  remaining  after 
execution  of  the  cyclic  process.  Should  there  be  other  background  processes  assigned  to  the  same 
processing  element,  the  following  tw>  options  «re  available:  (i)  define  the  error  estimation  process 
as  one  of  several  background  processes  but  allow  only  one  such  process  to  be  active  at  a  time;  (2) 
define  the  error  estimation  process  as  a  deadline  process. 

3.  Triply  Parallel  Estimation:  Covarlance/Galn  Filtering  Performed  Concurrently  with  Error 
Estimation 

This  estimation  scheme  (eutaaarized  In  Figure  V.D-2  and  obtained  as  Indicated  In  the  preceding 
subsection)  processes  messurements  st  a  higher  rate  than  the  doubly  parallel  scheme  with  concurrent 
error  estimation,  but  does  It  at  the  coat  of  performing  ihe  covariance  filtering  and  the  (rain  computations 
at  a  much  lower  rate.  It  should  be  used  when  a  high  rate  of  measurement  incoporatlon  Is  more  essential 
to  performance  than  optimal  gains. 

It  is  not  difficult  to  see  that  this  filtering  scheme  Is  awkward  In  screening  and  rejecting 
measurements  and,  at  best,  only  sluggishly  responds  to  changes  In  system  model  and  poorly  converges 
after  initialization.  All  these  poor  properties  are  due  to  low-rote,  autonomous  pr. -ceasing  of  galas 
and  covariances. 

To  implement  a  triply  parallel  estimation  scheme  on  a  processing  element,  three  processes  need  to 
be  defined:  (I)  a  cyclic  state  propagation/  update  process,  which  can  alto  be  designed  to  serve  as  a 
logical  controller  of  the  entire  Kalman  filter;  (2)  a  measurement  incorporation  process,  which  (depending 
on  the  measurement  acquisition  mechanism)  may  be  implemented  either  as  a  cyclic  process  or  an  almost 
cyclically  scheduled  deadline  process;  and  (3)  a  covariance/gain  filtering  process,  defined  a n  a  background 
process,  which  will  be  given  all  free  processor  tlaa  remaining  between  repeated  executions  of  the 
other  two  processes.  However,  should  there  be  other  background  processes  on  'ne  tme  oroceasing  element, 
then  either  one  background  process  at  most  would  be  kept  active  at  a  time  or  the  covarlance/galn  iilterlng 
proceaa  would  have  to  be  defined  as  a  deadline  process. 

A.  Doubly  Parallel  Estimation  with  Concurrent  Covariance/Cain  Filtering 

This  scheme  Is  based  on  two  processes.  In  each  estimation  cycle,  the  first  (state  est  1 -r.it loo) 
process  first  propagstea  the  state  vector  £,  then  sequentially  processes  the  current"  measurements' 
while  using  ths  gains  computed  by  the  otheT  process,  and  finally  updates  the  state  v«,  ccr.  Any  of  the 
two  filter  formulations  (direct  or  Indirect)  can  be  used  In  state  estimation.  The  other  (coverlence/getn 
filtering)  proceae  propagates  the  state  error  covariance  matrix  P,  then  enter*  e  loop  (structurally 
similar"  to  the  measurement  processing  loop  In  state  error  estimation  process)  to  update  P  end 
compute  gains. 

This  scheme  la  similar  to  ths  triply  parallel  estimation  scheme  Introduced  earlier  except  that 
ths  propsgatlon/updstlng  of  state  end  the  processing  of  measurements  presently  are  done  within  the 
tame  process.  Thus,  (due  to  low-rete,  autonomous  computation  of  gains)  this  scheme  has  properties 
similar  to  those  of  triple  estimation,  the  only  main  difference  being  that  atats  propagatlon/updatlng 
and  maaauremant  Incorporation  are  now  performed  within  the  earns  proceae.  This  limits  the  rate  at 
which  £  can  be  propagated  end  updated. 

This  type  of  filter  mechanisation  has  baan  used  successfully  in  CPS  navigation  applications 
described  in  (Deal  and  (Cyl  80|.  In  thsse  applications,  the  Unearlted  stete-to-meseureemnt  transformations 
change  slowly  after  the  Initial  convergence  (which  justified  low  rate  of  gain  computation)  and  good 
performance  depends  on  A  high  rsts  of  messursmsnt  Incorporation.  Ths  Initial  convergence  and  auddmn 
chaogts  in  the  system  modal  or  to  measurements  still  rmqulra  special  care. 

To  Implement  this  scheme,  one  can  define  the  etate  estimation  process  as  a  cyclic  process  and  the 
covarlance/galn  filtering  process  as  a  background  process.  Ths  latter  will  be  given  all  free  processor 
time  remaining  between  cyclic  c as cut ions  of  the  stats  estimation  process.  Should  there  be  other  background 

VI.  SUXHAXf  AMD  CONCLUSIONS 

In  this  article,  we  addressed  the  control  specialist  who  la  facing  the  problem  of  hsvlrg  his 
estimation  algorithms  Implemented  as  a  working  real-time  system.  Us  attached  the  name  'lapleatntei 
design*  to  the  activities  cone* road  with  adaptation  and  restructuring  of  algorithms  for  computsr 
Implement st loo. 

Prom  the  etart,  we  assumed  that  distributed  systems  of  microprocessors  (or  just  of  Mil  computers) 
wars  the  type  of  hardware  on  which  laplemeotal  design  was  to  be  executed,  but  this  did  not  rule  cur 
eyateme  consisting  of  a  elngle  computer.  With  this  aaeuaptlon,  we  narrowed  the  dlecueelon  to  a  generic 
class  of  sat  1ms t loo/ control  real  systems  coosldsred  to  be  suited  to  Implementation  on  distributed 


microprocessors .  We  characterised  them  as  small-to-aedium-scale  control  systems  designed  to  be  boundedly 
loadable,  i.e,,  to  accent  a  processing  load  not  exceeding  the  bounds  established  at  design  time.  To 
this  category  belong  practically  all  real-time  control  systems  which  are  equipped  with  estimators  and 
which  are  to  be  Implemented  on  hardware  of  tho  type  considered  here. 

Next,  we  Introduced  multiprogramming  as  a  suitable  processing  environment  and  deilned  s  structural 
model  of  software  architecture  for  this  environment.  Such  a  model  Is  needed  for  eoftware  timing  and 
sizing,  which  was  Identified  as  an  important  task  In  implementel  design  of  real-time  software.  It  Is 
also  needed  for  design  of  a  real-time  operating  system,  called  real-time  executive. 

We  also  introduced  process  as  a  fundamental  concept  from  theory  of  operating  systems  and  defined 
it  to  represent  a  program  In  execution,  but  rot  necessarily  executing  at  the  moment.  To  simplify 
the  processing  environment  model,  we  also  required  preservation  of  one-to-one  correspondence  between 
programs  and  processes. 

Our  next  topic  was  real-time  executive.  Selected  Issues  of  process  management  and  resource  allocation 
(such  as  deadlocks  end  their  prevention.  Interprocess  ccemunicatlons ,  and  process  synchronization) 
were  reviewed.  Thereafter,  our  attention  turned  to  two  issues  In  real-time  executive  design  which 
very  much  affect  a  control  specialist  engaged  In  implements!  design  of  estimation  algorithms  for  real-time 
operations)  process  scheduling  and  Interprocess  communications. 

At  the  start  of  the  second  part  of  the  present  exposition,  we  turned  to  the  issues  directly  associated 
with  estimators.  In  Section  IV,  we  reviewed  several  algorithmic  and  procedural  aspects  of  estimator 
design,  which  mast  be  considered  If  the  resulting  real-time  estimator  la  to  be  numerically  stable, 
computationally  efficient,  and  robust  to  disturbances  In  measurements.  Discussed  were  stable  algorithms 
for  covarlance/galu  processing  In  a  Kalman  filter  and  real-time  aystea  Identification  techniques.  In 
Section  V,  by  means  of  Illustrations,  we  looked  Into  practical  schemes  for  decomposing  estimators  of 
the  Kalman  filter  type  Into  structures  of  concurrent  processes.  Two  basic  filter  mechanization  schemes 
were  compared,  underlying  aueh  parallel  structures,  were  compared.  They  are  direct  and  indirect  filter 
formulations.  Also,  two  modes  of  filter  use,  feedforward  and  feedback,  were  Introduced. 
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APPENDIX  A:  KALMAN  FILTER  ALGORITHM  TOR  A  DISCRETE  LINEAR  5YSTW  WITH  SAMPLED  MtUSURBREkTS 

In  this  appsodla  ee  summarise  for  reference  purposes  the  standard  form  of  Reiman  filter  algorlths, 

Inc  lull  n*  the  ay  at  am  audsl  which  this  algorithm  sssuaes,  for  a  discrete  linear  system  with  sampled 
memeuremeata. 

A.  SYSTEM  MDORL 

1.  Propagation  of  the  system  state  vector  frem  t  «  tg_j  to  e  •  tgt 


e(k)  *  F(k,  k-l)s(k-l)  ♦  C(k-l)w(k-l). 

(1) 

Rmsrmsiti  at  t  •  tgt 

m(k)  -  Wk^k)  ♦  u(k). 

O' 

Initial  eomditiona  at  t~”  tot 

*[•(0)1  •  S(0),  Covts(0)  -  i(0)|  •  P<0). 

«) 

4.  Aaausptlon*  about  system  statistics: 

a.  The  processes  { w(k)j  and  {u(k)|  are  aero  Man,  autually  independent  Gaussian 
processes  with  covariances 

B(w(k)w(j)T  J  -  Q(k)  6kJ 

and 

B[u(k)u(j)TJ  -  K(k)  «kJ. 

b.  Furthermore,  t(0)  la  Independent  of  w(k)  and  uf’0  for  any  k. 

B.  ESTIMATION  PROCEDURE 

1.  Propagation  of  eatlnates  from  t  *  tk-i  to  t  ■  t^s 


a(k)~  -  P(k,k-l)i(k-l)+,  (4) 

P(k)‘  »  r(k,k-imk-l)-hr(k,k-l)T  +  G(k-l)Q(k-I)C(k-l)T.  (5) 

2.  Updating  of  estlantes  at  t  *  tfc« 

K(k)  -  P(k)-H(k)T|H(k)P(k>"H<k)T  ♦  *(k)|’1,  (6) 

P(k)+  -  (I  -  K(k)H(k)lP(k)“,  (D 

i(k)+  •  i(k)~  ♦  C(k)(a(k)  -  B<k)i<k)k'|.  (8) 

C.  EXTINSIONS  TO  HOHL  TWEAK  KBASUkEWSKT  EQUATIONS 

Tt-e  aeaeureaent  equations  (2)  are  replaced  wttha(k)  *  Me(k),  tk|  *u(k).  (J) 

The  linearised  state-to-aaaaureaent  transformation  H  appearing  In  filter  equations  (6) 
through  (8)  are  now  obtained  by  aeaoa  of  vector  differentiation  of  h(s,t]  with  respect  to  a: 

B(k)  »  (4hla<r),r J/ja)r«t  -  ””  ~  (10) 

k 

0.  NOTATION  USED  MOVE 

a  Upper-case  letters  represent  aatrlcaa 


a  lower-case  letter*  represent  seal  ore  or  (It  aetked  with  bare  underneath)  column  vector*. 

•  *(k)  represent*  the  vtlue  of  coluan  vector  *  et  t  •  tgj  e taller  notation  la  used  to  t lee-t eg 

scalars  and  aatricee. 

a  A7  represents  the  transpose  of  aetrla  A;  If  a  la  a  coluan  vector,  then  X*  represents  the 
transpose  uf  *.  which  la  *  row  vector, 

e  e(k|l)  denotes  an  estlaste  of  a  st  tie*  tk  that  has  been  obtained  by  using  a  history  of 
wtaeuraaeote  up  to  snd  including  the  time  tjCitk). 

APPtKbU  8:  U-0  FACTOg  COVAKlAMvE/CAUi  MQC*S$tg6  AUJOAtTWC  KALMAN  FlLTWi 

A.  imOOOCTtOM 

Cover  lance  and  pin  processing  algorithm),  operating  on  V  «d  0  fectotn  of  state  error  covariance 
a>trtr  P,  are  e  technique  for  implementing  ‘square  root  filtering*  without  requiring  computation  of 
square  roots.  These  el  get  it  has  offer  e  receaaaalad  approach  for  overcoming  the  turneries!  Instability 
Inherent  in  the  original  formula. loo  of  Kslaan  filter  algorlthaa.  The  latter  la  ewmartsed  la  Appendix 
A,  Numerical  Instability  problem*  are  caused  by  the  repeated  use  of  formula  (1)  In  Append  1*  A  for 
aeaaureeant  updating  of  state  error  covariance  Matrix  f.  examination  of  that  foraule  shew*  that 
accnaulatlon  of  roundoff  errors  nay  eventually  enuaa  astrln  P  to  lose  Ita  positive  definiteness,  tt 
should  also  be  mentioned  et  this  polot  that  the  tHB  covariance  factor last loa  algorlthaa  are  Just  • 
oaaertcatly  stable  and  computationally  efficient  netted  for  leplmntltg  Kalman's  estimation  procedure 
(outlined  In  Appeals  A),  but  not  an  eetlaator  different  froa  Kalman  filter. 

The  tela  ”U-t>  ctver  lance  fectorlaatloo*  cases  froa  a  property  ^  nonnagatlve  doftnlta  eynattrlc 
aatrlcca,  according  to  which  positive  (rei definite  matrix  p  can  be  fact' **d  into  t‘  *  tW>t)T,  tdtare  0  la 
an  upper  triangular  aetrla  with  unit  ateaanta  on  Its  aala  diagonal  and  0  la  a  diagonal  aatrla, 

for  convenience,  we  next  tuaaatlre  the  baste  IH)  covariance  factorisation  algorlthaa.  for  details, 
the  reader  la  refereed  to  (Bis)  and  (Tho),  the  second  belt*  as  updated  review  of  the  topic,  altboucb 
narrowar  in  scope  than  the  first. 

b.  jjouneg 

Suppose  that  the  reference  t!ar  of  the  current  eat last ton  cycle  la  tk.  TO  atapllfy  the  notation 
used  In  Appendix  A  (and  to  aake  It  note  compatible  with  the  notation  used  In  (Thol),  we  drop  explicit 
referencing  to  tlaa  and  write  J  for  8  for  3(b|k);  Slatlerly,  ?  for  f(k|k-l)  and  P  for  P(k|k). 

Cetwlts,  we  use  the  aaae  Utter*  as  io  Appendix  A  To  denote  the  quantities  1e  tone  in  which  the 
*?*t«a  aedii  and  the  Calaan  filter  algorlthaa  are  formulated. 


In  specification  of  computational  algorithms,  we  use  (in  agreement  with  the  current  practice  of 
algorithm  definition)  Instead  of  ”*■"  to  connect  the  left-  and  right-hand  sides  of  assignment 

statements.  We  do  it  to  emphasise  the  idea  of  value  replacement. 

The  following  symbols  are  used  to  specify  the  sire  of  system  model:  "n”  to  denote  the  length  of 
state  vector  jj;  "tu",  I*1*  length  of  process  noise  vector  w  ("q”  here  relates  to  process  noise  covariance 
matrix  Q);  and  "n^  ,  the  length  of  measurement  vector  m,  although,  in  the  sequel,  we  consider  only 
the  sequential  processing  of  scalar  measurements. 

C.  U-D  FACTOR  JSASUREiBNT-UPDATE  ALGORITHM 


Suppose  that  the  following  input  quantities  are  given: 

H  «  lxn  matrix  representing  the  linearized  state-to-iaeasurement  transformation  for  the 
scalar  measurement  to  be  processed; 

R  «  noise  variance  (=Oq)  of  the  measurement  to  be  processed; 

U,D  »  U-  and  D-factors  of  P  (time-propagated  to  tfe  or  measurement-updated  at  tfc  for  all 
measurements  so  far  processed  at  t^). 

Proceed  as  specified  by  the  algorithm  contained  in  Figure  Ap.B-1  in  order  to  perform  covariance/gain 
update  processing  for  a  scalar  measurement.  To  process  a  vector  of  measurements,  this  algorithm 
would  have  to  be  embedded  in  a  measurement  processing  loop  (e.g..  Section  V.C)  and  then  executed 
iteratively  nm  times. 

This  algorithm  uses  a  scalar  X  and  n-dimensional  vector  _f  and  £  as  intermediate  variables. 

The  outputs  are: 

It  *  n-dimensional  Kalman  gain  vector  or  ( following  the  notation  of  Appendix  A)  nxl  matrix 

k; 

U,D  »  U-  and  D-factors  of  P  (measurement-updated  for  all  measurements  so  far  processed  at 
tk. 

Instead  of  outputting  k,  it  is  often  preferable  to  output  separately 
v  »  n-dimensional  normalized  Kalmen  gain  vector  and 

o  «  the  innovations  variance,  which,  following  the  notation  of  Appendix  A,  can  be  expressed 
as  [H(k)P(k)~H(k)T  +  R(k)J. 

This  is  because  a  is  used  in  the  measurement  processing  loop  for  screening  a  processed  •seaaureoant  by 
testing  whether  its  residual  lias  within  \n  acceptance  interval  of  the  form  (-b»n,  bon),  where  b(>0) 
is  a  scaling  parameter. 


UDMUPD:  begin 

f~:-HU  (where 

£:-Bf;  (where 

for  j-1 . .  do; 

!*  “J-l  +  (where 

if  0  4  “0,  then  Dj  :»  D.; 

else  6j  )Dj ! 

‘Mfgji 

if  j  •  1 ,  then  go  jo  ELj  ; 

«,  j—i  “  0,  thf  T  X  0; 

else  X:« 


.*n]> 


oq  •  R  and  o-  on) 


for  1  -  do: 

^ij  ffij  +  v’jX: 
vi  :■ 
end 

ELJ  x  end 

FT-  k(l/«n)5 
end  UttOJPD 


(Recycle  if  i<J  -1 ) 
(Recycle  if  J<n> 
(h“(Rl  •  •••  »hp ! ) 


Figure  Ap,B-l.  Executable  (computational)  part  of  U-D  Heasuremeut-tfpdato 
Algorithm,  defined  in  (Thol,  pp.  198-199.  Inputs,  outputs, 
and  Intermediate  quantities  are  specified  in  Section  C. 


D.  COMPARISON  WITH  TIE  ORIGINAL  FORM  OF  KALMAN  FILTER 


Next,  we  restate  the  coverlance/geln  filtering  part  of  the  original  fora  of  Kalman  Filter  {defined 
in  Appendix  A  by  equetions  (5)  through  (7)]  for  processing  a  single  scalar  measurement  in  terms  of  the 
notation  Introduced  in  the  present  appendix.  We  do  it  in  order  to  facllltete  comparison  betvetn  the 
original  algorithms  and  the  U-D  factor  covariance/gain  processing  algorithms  summarized  in  the  present 
appendix. 


In  ten*  of  the  simplified  notation  used  in  the  present  appendix,  equations  (5)  through  (7)  of 
Appendix  A  yield  the  following  procedure: 


{£?=£FfFT-«!QGT5 

(Covariance  p-oparation  to  t^) 

v  !-  PHT; 

(Normalized  Kalman  gain) 

a  :•  %+  R; 
k  IS*"1} 

(Innovations  covariance) 

(Kalman  gains  at  t^) 

P  .—  P  -  kvT; 
end 

(Covariance  measurement-update  at  tk) 

The  above  form  of  covariance  measurement  update  [or  equivalently  formula  (7)  in  Appendix  A)  is  known 
to  be  computationally  unstable  in  the  sense  that  it  may  make  P  acquire  negative  characteristic  roots 
as  a  result  of  roundoff  errors  and  overconvergence  of  P,  A  stabler,  and  although 
computationally  more  expensive,  version  of  that  formula  for  a  scalar  measurement  (with  H  being  lxn 
matrix)  is 


P  :«  (I  -  kH)P(I  -  kH)T  +  kRkT. 

Its  vector  measurement  version  is 

V  !-  (I  -  KH)P(I  -  KH)t  +  KRKt 

where  K  now  is  an  nxn,,  matrix  and  H  is  an  nj,xn  matrix.  Due  to  the  amount  of  processing  required, 
this  stabler  form  is  rarely  used  in  real-time  applications. 

E.  Til* -UPDATING  PROPAGATION  OF  STATS  ERROR  COVARIANCES 


When  the  U-D  factorization  algorithms  are  used  for  measurement  updating  of  atate  error  covariance 
matrix  P  and  computation  of  Kalman  gains,  the  following  two  alternative  approaches  for  performing  the 
time -propagation  of  P  are  available!  the  Conventional  Propagation-of-Covaritnce  Algorithm  and  the 
U-D  Factor  Time-Update  Algorithm.  In  the  sequel,  theae  two  approaches  are  outlined  for  propagation  of 
P  from  time  t^  to  ....  l  »  t^  +  AtR,  i.e.,  for  computing  P(k+1)“  6  P(k+l|k)  «  ?  from  the  measurement-update 
of  P  from  the  preceding  estimation  cycle. 

F.  Tig  CONVENTIONAL  PROPAGATION-OF-COVA.RIANCB  ALGORITHM 

Thie  algorithm  for  computing  P,  given  P  from  the  preceding  cycle,  in  tetma  of  factor  matrices  U 
and  D  is  based  on  the  following  procedure: 

«.  Compute 

f  !•  <F0)D(PU)T,  (i) 

which  yields  the  canonical  product  representation  of  matrix  V;  here,  F«F'k*l|k)  .1*  the  atate 
transition  matrix  for  propagation  of  the  state  vector  a  from  t^  to 

b.  Compute  the  procean  covariance  matrix  0  *  QCh.At^),  which  way  be  a  function  of  time  and  of 
the  propagation  stap  else  Air. 

c.  Compute 

P  •  P  ♦  Ot)CT,  (2) 

where  C^C(k)  is  as  defined  by  equation  (1)  In  Appendix  A. 

d  Factor  5*  into  0  and  t>  by  sear*  of  the  U-D  Factorisation  Algorithm,  specified  in  Figure  Ap.B-2. 

Although  th*  computing  Implied  by  the  above  steps  a  through  d  is  thought  of  a*  a  stable  process, 
it  1«  noted  on  138  of  (TboJ  that  there  exlet  Important  excel  ions  when  F  la  large  and/or  P  la 
ill-conditioned.  In  such  situations,  the  resulting  matrix  F  may  have  serious  errors.  Problems  may 
alao  arise  when,  doe  to  roundoff  errors,  eoee  characteristic  values  become  ell$hrly  negative. 

On  the  positive  tide,  the  above  covariance  propagation  algorithm  yields  (after  Stap  c)  P  expressed 
in  the  canonical  product  form,  various  parts  of  which  (especially  ita  main  diagonal  elements)  are 
often  ueed  for  both  on-line  and  poatmiaslon  parfomance  enalytla.  Alto,  the  appearance  of  slightly 
negative  eharactariatie  roots  In  P  can  ha  avoided  by  keeping  GQCT  euff iclently  ‘large'  compared  to  ? 
and/or  by  Monitoring  and  than  boosting,  on  th#  basis  of  naed,  tie  elaments  of  diagonal  matrix  D. 

Another  technique  (la  case  Q  W  not  an  identity  matrix),  borrowed  from  ridge  regression,  la  to  add  to 
thn  right  hand  Side  of  (2)  a  positive  detinue  diagonal  matrix  Dj  on  detection  of  the  need  to  boost  P. 

G.  U-D  FACTO!  ttOFACATION  (TU* -UPDATE )  AUUKITM 

This  algorithm  is  based  on  rcdKlSd  Crar-Sehmldt  arthogoooUtacloo.  It  la  described  on  pages 
200-202  of  (Thol. 

to  order  to  auamarlte  It,  me  first  need  to  define  a  weighted  Inner  product  of  two  ii-camponemt 

vectors  £  and  £,  weighted  (normalltad)  by  the  mat  -diagonal  element#  of  an  tun  matrix  •  -  diag  Jbj . bn) . 

Us  dafioi a  thia~laaev  product  aa 

(a,  £,)  g  *£T»c  «  ajbjCj  ♦...4mnbtlcIl.  (3) 


Input:  nxn  symmetric  Mtrlx  ?,  with  main-diagonal  and  upper-triangular  eleaents  a  to  red  In  an 

nxn  array  P. 

Output:  nxn  unit -diagonal,  upper-triangular  matrix  u,  with  its  upper  triangular  portion  stored 

in  nxn  array  U  (which  optionally  can  be  “equiva lanced"  with  array  P  so  that  the  original 
?  is  destroyed). 

Output:  the  Min-diagonal  eleaents  of  nxn  diagonal  matrix  if  stored  in  vector  D  (which  optionally 
can  be  stored  in  locations  of  the  oain-diagonal  eleaents  of  array  P). 

Remark:  the  algorithm  does  not  explicitly  generate  the  main-diagonal  unit  elements  of  U. 

UDPTCTR:  begin 

for  j  "  n,  do: 

PJ.J: 

“  1/Bj; 

for  k  "l,...,j-l  do: 

P  i 

5k, * 

for  i  «  1 , . . .  ,k  do: 

pi,k  pi,k  ”  P*ui,J» 
end 
end 

5l  *  Pl.li 

end  UDFCTR 

Figure  Ap.  B-2.  U-D  Factorisation  Algorithm 

Next,  we  use  two  matrices  from  the  system  model  defined  in  Appendix  A,  the  nxn  stste  transition 
matrix  F  «  F(k+l,  k)  and  the  nxnq  process  noise  transformation  matrix  G,  to  define 

V  -  (FU | C | .  (*) 

(Here,  U  and  D  denote  the  measurements  updates  (from  the  preceding  estimation  cycle)  of  1)  and  t>. 
respectively. j  Thus,  V  is  an  nxH  *  nx(n  *  i^)  matrix,  the  jth  row  of  which  will  be  denoted  by  WjT. 

Finally,  we  combine  In  the  indicated  order  the  ms'n-dlsgonsl  elements  of  nxn  diagonal  Mtrlx  5 
with  those  of  n^joiq  process  noise  Mtrlx  0  to  define  an  NxN  diagonal  Mtrlx  B  (whore  sfeln  fi  *  n  + 

»q)  as 

5  •  dlsg  (Hi . fyl 

•  din*  Qltj,...,ftvt,n4|.  (5) 

With  the  needed  definitions  etwpletsd,  wc  are  ready  to  summarise  the  tH>  Fsetor  Propagation 
Algorithm,  Which  we  do  in  Figure  Ap.  B-J. 

H.  OOHCtCPIKC  NOtSS 

Only  the  very  basic  forms  (of  ‘Sieasan's  method")  of  U-0  factor  cevarlaoce/galn  proceamlng  algorlthmo 
have  bean  summer  lead  and  compered  here  with  the  original  fora  of  KaUnn's  filter.  Fur  »  more  complete 
so  count  of  •terfcan’s  approach  ra^er  to  {lie)  or  k.  !Th»|.  For  different  epproaches  to  "aqumte  root 
filtering*,  refer  to  Andre***  lArdr)  or  Carlson  (Cart.  Chapter  A  of  (And}  end  Chapter  2  of  (Nay)  contain 
textbook  introduction*  to  this  topic.  Comparative  timing  god  siting  of  filtering  algorithm*  are  discussed 
In  (11*1,  (Tho),  and  (Hay). 

A*  not  ad  in  Section  IV,  m  faplamental  design  of  real-time  fcslmte  filters  for  microprocessor  (or 
small  computer)  implementation,  the  value  of  "square  root  filtering "  algorithms  it  mainly  do*  in  (1) 

Chair  nuaaricsl  stability,  (2)  their  suitability  for  implementation  in  tingle-precision  floating-point 
arlthmatlc  (except  for  computation  of  enm*  dot  products),  and  (3)  their  reasonable  computational 
efficiency  compared  to  Kalman's  original  fotmulstion.  Criterion  (2)  is  important  lit  Implementation*  of 
realties  estimators  on  the  microprocessor*  Which  do  not  have  floating -poing  hardware.  The  disparity 
betwean  the  processing  spied*  of  slop  la-  and  double-precision  form*  of  floating-point  arithmetic  worses* 
as  on*  goes  from  hardwsrs  to  software  (i.e.,  interpretive)  lapleamratationof  this  arithmetic. 


Input : 

nxN  matrix  W  (with  rows  v[,...,w£). 

Input: 

NxN  diagonal  matrix  5  defined  by  equation  (5). 

Output : 

the  upper  triaugular  part  of  nxn  unit-diagonal,  upper-triangular 
matrix  (J. 

Output : 

the  main-diagonal  elements,  stored  as  a  vector,  of  nxn 
diagonal  matrix  D. 

Define : 

v/0*  -  Wj  for  J  »  l,...,n. 

UDFCTRPR : 

begin 

for  j  •  n,  n-1,.,.,2  do: 

f!  -m  (v  <n~J)  w  (o-J)i 

for  i  «  1,...,  j-1  jio: 

Ujj  (*j(nrJ). 

^(nrj+l)  ..  ^(n-j)  -  (0i,j )*“j(n"J) 5 

end 


end 

§  .  Cw  (n~l)  v  <n-lK_. 

B1  ij  'D> 

end  OEFCTRPR 

Figure  Ap»B-3.  U-D  Factor  Propagation  (Tiae-Updata)  Algorithm 

Conatrainta  in  the  available  processing  reaourcea  and  in  the  required  real -tine  constraints  usually 
Motivate  the  explnitation.  of  problem  structure  in  order  to  reduce  cne  processing  load.  There  art  three 
areas  which  should  always  b«  carefully  examined  and  possibly  exploited.  The  first  is  avoidance  of 
floating-point  operations  on  aeforvalued  operands.  This  can  be  attained  via  careful  programming  of 
alporithi  . .  The  second  is  structuring  of  vectors  *nd  matrices  in  the  system  model  so  ss  to  introduce 
aero  subvestora  and  suhwstricea,  which  would  In  turn  yield  an  estimation  problem  of  aaaller  size. 

This  can  often  be  accomplished  through  careful  structuring  of  system  model  and  mechanization  of  estimation 
algorithms,  finally,  all  kernel  algorithm*,  operatic  on  or  producing  matrices,  should  be  designed  to 
handle  oxtriee*  as  one  dimension*}  arrays.  Thjs  last  feature  allows  efficient  application  of  the  same 
algorithm  implementation  (subprogram)  to  system  models  of  varying  dimensions.  In  some  applications 
considered  here,  dimensions  of  the  system  model  may  change  in  teal  time. 

APflSKMX  Cf  eSTlKATUWt  FROBUJti  IN  CPS  USER'S  NAVIGATION 

A.  INTRODUCTION 

The  present  appendix  summarize*  tb*  OPS  estimation  problem  and  defines  a  system  model  for  it. 

This  problem  is  cited  se  an  illustration  several  times  in  the  tain  body  of  the  preaent  exposition. 

A  user  of  CPS  navigation  equipment  it  assumed  to  be  either  moving  or  staying  stationary  close  to 
the  surface  <>f  the  esrth.  Several  different  typae  of  CPS  uaer'a  navigation  equipment  for  various 
claeoeo  ol  users  (e.g.,  s  stationary  user,  a  land  vehicle,  an  aircraft,  or  a  ship)  were  recently 
deveio'ed  or  ere  still  under  development.  |Dsw),  |Cyl  $0),  and  (Up*)  are  samples  of  literature  describing 
CPS  user's  navigation  and/or  ltd  equipment.  |C«x|  discueeee  integration  of  CPS  with  inertial  systems; 
(Van)  describes  CM  message*.  The  estimation  problem  of  CPS  navigation  and  Its  t  ye  tarn  modal  actually 
depend  on  the  particular  type  of  equipment  under  consideration.  Poi  pedagogical  reason*,  we  overlook 
•any  technical  detail*  In  the  estimation  problem  end  define  an  overs  implied  system  modal  for  it. 

»„  oookm hate  stsmw 

An  eartb~c entered,  earth- fixed  (EOSF)  cooriinate  ay* tern  (with  coordinate  exes  denoted  by  x,  y, 
and  a)  ts  used  in  all  CPS  navigation  processing  described  hare,  the  t-axir  of  such  a  coordinate  frame 
coincide*  with  the  ptiar  axle  of  tha  refsrsne*  e?  Unsold;  x  and  y  lie  in  she  squs.torlal  plane.  The 
parUeule.r  version  of  KCEF  frame  ataumad  hers  has  Its  x-axlt  pointing  toward  the  Greenwich  meridian; 
the  y-axle  90*  cent  of  cite  x-axla. 

C.  KAVICATIOh  STAKE  VECTOR 

The  navigation  state  vector  (which,  in  ge  natal-,  H  a  fvnctlon  of  time/  is  defined  by 

*T  •  (b,f ,*,y,t,v_,vv,v,)  £(b,f,p*,vT), 
where:  ~ 

b  *  the  range  bias  in  range  measurements,  dua  to  a  bias  In  Cht  clock  of  user's  navigation 
equipment  aat  relative  to  the  CPS  rime; 


f  ”  db/dt  ~  frequency  drift  rate  of  user's  navigation  equipment  clock; 

j}T  »  (x,y,z)  “  ECEF  coordinates  of  the  antenna  phase  center  (PC)  in  user's  navigation  set; 

T 

»  (vx,Vy,vz)  ECEP  velocity  components  of  the  antenna  PC. 

In  GPS  navigation,  even  for  moderate  dynamics  users,  one  usually  models  acceleration.  For  simplicity, 
acceleration  is  not  modeled  in  the  present  case. 

D.  DISCRETE-TIMS  MO  EEL 

The  discrete  time  model  of  state  vector  dynamics  is 
s(k)  -  F(k,k-1)  s(k-l)  +  w(k-l). 

Let 


Atk  “  “  **-1* 

and  ideally  one  would  like  to  assume  that  w(k)  is  a  zero  mean  white  noise  Gaussian  process,  with 
E(w(k)wT(k)]  -  Q(k,Atfc), 

E[s(0)]  ■  9(0) ,  and 

E((s(0)  -  8(0)1 (s(0)  -  9(0)1T)  -  P(0). 

The  state  transition  matrix  F  is  defined  by  the  following  transformations: 
b(k)  -  b(k-l)  +  Atkf(k-1), 
f(k)  -  f(k-l)exp[-Atk/tjj] , 

with  the  range  bias  correlation  time  r ^  assumed  to  be  a  constant  or  a  slowly  changing  parameter;  furthermore, 
£(k)  -£(k-l)  +Atkv(k-1); 
v(k)  -  v(k-l). 

NAVSTAR-CPS  satellites  (on  the  pseudo-range  and  delta  pseudo-range  measurements  of  which  the 
navigation  filter  operates)  will  be  referred  to  as  space  vehicles  (SVs). 

For  each  tracked  SV,  the  navigation  filter  during  a  measurement  processing  cycle  receives  via  the 
GPS  receiver  in  the  navigation  set  a  pair  of  pseudo-range  and  delta  pseudo-range  (an  observed  change 
in  pseudo-range  over  a  count  period  of  fixed  length)  measurements.  For  the  SV,  SV,,  these  two 
measurements  will  be  denoted  by  PRj(t)  and  DPRj(t),  respectively.  A  pseudo-range  roughly  is  a  range 
that  has  been  synthesized  from  the  readings  of  two  distinct  clocks  (SV  clock  and  user's  navigation  set 
clock)  and  that  has  not  been  corrected  for  the  bias  of  user's  navigation  set  clock  with  respect  to  the 
SV  clock.  In  the  sequel,  it  will  be  assumed  that  incoming  pseudo-range  measurements  are  already  corrected 
(actually  they  are  not)  for  other  errors,  such  as  the  SV  clock  errors  with  respect  to  GPS  system  time 
and  atmospheric  signal  delays.  Thua,  if  b(t)  denotes  the  true  but  unknown  range  bias  at  time  t  and  if 
PRj(t)  is  the  corrected  pseudo-range  from  SVj  received  at  time  t,  then  the  range  of  the  signal  received 
from  SVj  by  user's  navigation  set  at  the  same  time  t  is  represented  by 

Rj(t)  -  PRj(t)  -  b(t). 

Hence,  the  predicted  pseudo-range  measurement  for  SVj  at  time  t  can  be  written  as 
PRj(t)  -  Rj(t)  +  b(t). 

Similarly,  the  delta  pseudo-range  measurement  DPRj(t)  for  SVj  at  time  t  is  defined  by 
DPRj(t)  -  PRj(t  +  6t)  -  PRj(t-  6t), 

where  Store  ■  2  St  is  the  duration  of  delta  range  count,  which  la  characteristic  of  the  receiver. 

Hence,  the  measurement  DPR,  is  predicted  at  time  t  by  means  of 
DP&j(t)  »  Pfij(t+’fe)  -  pftj(t-«t) 

•  Rj(t+St)  -  Rj(t-St)  +  b(t+St)  -  b(t-5t) 

•  Rj(t+6t)  -  Rj(t-6t)  +  f(t)StDPR> 

In  a  measurement  processing  cycle,  a  (PR,  DPR)  measurement  pair  ia  received  froa  each  of  four  (or 
occasionally  fewer)  tracked  SVa  and  subsequently  processed  by  the  navigation  filter. 

To  complete  the  system  model,  the  measurement  equation  at  tfc  is  written  as 

m(k)  -  h[m_(k)  ,tk)  +  u(k), 


where  we  assume  that: 


•  The  transpose  of  vector  m(k)  is  of  the  form  (PRi(k) . PRnltCk),  DPRj(k) , . . .  ,DPRnk  (k)], 

with  nk  (<4)  being  the  number  of  distinct  SVs  from  which  measurements  are  available  at  time 
ck5 

•  u(k)  is  a  zero  mean,  white  noise  Gaussian  process  with  E (u(k)uT(k) ]  *  diag[62pjj j , . . . , 

62pRnk>  S2DPR1 .  52DPRnk,:  ^ 

•  {  v(k) j  and  {  u(k) j  are  mutually  independent  stochastic  processes,  which  are  also 
independent  of  s(0). 


It  is  assumed  that  the  GPS  receiver  which  acquires  and  preprocesses  for  the  estimator  pseudo-range  awl 
delta  pseudo-range  measurements  is  designed  to  furnish  extra  observables  from  vtfiich  the  estimator 
directly  computes  £  2pRj  and£2opRj  .  Such  a  receiver  is  described  in  [War]. 

E.  EQUATIONS  FOR  PREDICTION  OF  RANGES  AND  DELTA  PSEUDO-RANGES 

The  signal  range  Rj(t)  from  SVj (received  at  time  t)  is  computed  by  means  of  range  equation 


where: 


Rj(t)  -  ( [ Xj  +  oyj-x]2+[yj-oxj-y]2+[Zj-z]2)  1/<2 

-  [(Axj)2+<Ay.))2+(Azj)2]  1/2 


(1) 


•  a*  (fl)(AtT(j) )  is  the  angle  by  which  the  ECEF  coordinate  frame  is  rotated  during  the 
signal  transit  time  from  SVj  ,  with  il  representing  earth's  sidereal  rate  and  At-p(j) 
being  the  duration  of  signal  transit  from  SVj  ,  a  quantity  viiich  is  unknown  and  must  be 
estimated. 

•  (Xj  ,  y j  ,  z j )  represents  the  ECEF  position  coordinates  of  Vj  at  time  t,  «*  t  ^AtT^,j; 

they  mustJbe  computed  by  means  of  an  extended  form  of  Kepler's  algorithm  while  using  the 
transmitted  orbital  parameters  (called  epheraeris  data)  of  SVj.  The  best  approach  is  to 
recompute  periodically  (at  a  relatively  low  rate)  the  least-squares  polynomials  for  predicting, 
as  a  function  of  time,  the  position  coordinates  of  SVj,  (xj ,  yj  ,  zj).  This  approach 
saves  processor  time  and  yields,  by  means  of  analytic  differentiation  of  polynomials, 
approximations  to  the  time  derivatives  (xj ,  yj,  zj)  of  (xj,  y j ,  zi ) .  These  time  derivatives 
are  useful  in  obtaining  the  time  derivatives  (Axj,  Ayj,  Azj)  of  (Axj,  Ayj,  Atj). 

•  (x,  y,  z)  are  the  ECEF  coordinates  of  the  position  at  time  t  of  the  antenna  phase  center  in 
user's  navigation  set. 

To  predict  Rj(t)  [i.e.,  to  obtain  Rj(t)],  the  quantities  (x,y,z)  in  equation  (1)  are  replaced  with 
their  estimates  at  time  t.  The  expression 


Rj(t+5t)  -  Rj(t-6t),  (2) 

needed  in  computing  the  predicted  value  of  DPRj(t)  at  time  t,  can  be  (as  experience  has  shown) 
efficiently  antf  with  sufficient  accuracy  approximated  by  means  of 

([dR(r)/dr)r»t)6tDPR  -  tAxjAxj4AyjAyj4AzjAZj  J/Rj  (t),  (3) 

where,  as  earlier  stated,  fitDPR  is  the  duration  of  delta  pseudo-range  count.  To  obtain  the  predicted 
value  of  (2),  one  substitues  in  (3)  the  estimates  of  Axj ,  Ayj  ,  Azj  ,  Axj  ,  Ayj  ,  and  Azj  at  time  t. 

F.  MODELING  OF  PROCESS  NOISE  COVARIANCES 


Suppose  that  the  process  noise  manifests  itself  only  through  the  unmodeled  acceleration  and  through 
the  range  bias  and  the  range  bias  rate  (i.e.,  f  -  db/dt)  components  of  state.  Let  At R  be  the  effective 
time  step  used  for  propagating  the  state  error  covariances  P  from  tk_i  to  tk.  Then,  the  process 
noise  covariance  at  time  tk  is  of  the  form 


Q(R»  Atk) 


°2x6_  _ 

^6x2 

Od(k»  tk) 

J 

where  subscript  "b"  refers  to  range  bias  and  its  rate  of  change  and  subscript  "d"  to  user’s  dynamics. 


The  6x6  user's  dynamics  process  noise  gubmatrlx  Oj  la  of  the  form 


Qd(fc»  t) 


qppl3 

5pvI3  J  ^vvI3 


Hence,  Qj  is  constructed  from  three  generating  scalar  parameters  qpp,  qpv,  and  qvv. 


Define  an  auxiliary  2x2  matrix  Qd(k,Atk),  constructed  from  these  three  parameters,  as 


[Fd(r)l  N  [Fd(r)]Tdr, 
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and  »  2  <r  /  r  represents  the  power  spectrum  density  of  velocity  noise  due  to  unmodeled 
acceleration,  <rv“  is  the  variance  of  velocity  noise,  and  rv  is  the  velocity  correlation  time  constant. 


In  general,  <rv^  and  rv  are  not  time  invariant. 
Suboatrix  Oj,  of  0  is  of  the  form 


QbCAt)  - 


<Ibb 

Ibf 

<Jbf 

<Jff 

where,  assuming  that  the  range  bias  rate  correlation  time  rj,  is  much  greater  than  At, 

%b  *  <nbb)At  +  (&ff/3)(At)3, 

“  (nf£/2)(At)  ,  and 
Iff  ^  ( Off) (At). 
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This  article  discusses  some  global  approximation  procedures  for  nonlinear  filtering. 
These  procedures  yield  algorithms  for  recovering  a  signal  from  a  measurement  containing 
signal  plus  noise.  The  celebrated  Kalman  filter  solves  problems  in  which  the  signal 
evolves  as  the  solution  to  a  linear  differential  equation  driven  by  white  Gaussian  noise 
with  Gaussian  initial  condition  and  where  the  signal  enters  linearly  into  a  measurement 
corrupted  by  white  Gaussian  noise.  For  problems  where  these  linear  Gaussian  conditions 
are  weakly  violated,  the  Kalman  filter  serves  as  a  good  approximation.  In  many  interesting 
problems,  however,  these  conditions  are  not  satisfied.  For  example,  one  of  the  random 
variables  may  have  a  multimodal  density  function  or  the  signal  may  enter  nonlinearly  into 
the  measurement.  The  techniques  discussed  here  give  the  designer  some  tools  to  use  when 
the  Kalman  filter  is  no  longer  a  reasonable  approximate  solution  to  his  problem. 


1.0  ORGANIZATION  OF  ARTICLE 

This  article  is  organized  as  follows.  The  first  section  is  devoted  to  a  brief  state¬ 
ment  of  the  general  problem  addressed  here,  and  a  discussion  is  presented,  by  means  of  an 
example,  of  the  difference  between  global  and  local  approximations.  The  extended  Kalman 
filter  will  be  seen  to  be  a  local  approximation,  and  the  hope  is  that  the  example  chosen 
will  illustrate  the  inherent  difference  between  the  extended  Kalman  filter  and  approxima¬ 
tions  to  be  developed  later.  The  example  problem  chosen  to  illustrate  the  difference 
between  local  and  global  approximation  is  that  of  analyzing  a  feedback  loop  with  a  non¬ 
linearity  in  the  forward  path.  The  local  approximation  for  this  problem  is  a  small  signal 
analysis  or  linearization.  The  global  approximation  chosen  is  a  describing  function  anal¬ 
ysis.  The  difference  between  these  two  is  that  the  global  approach  predicts  fundamentally 
nonlinear  phenomena  like  limit  cycles,  whereas  the  small  signal  gain  approach  cannot.  The 
particular  example  chosen  is  also  useful  in  that  it  will  recur  when  filtering  is  discussed 
in  the  second  section. 

Section  2.0  develops  some  global  nonlinear  filtering  approximations  and  illustrates 
their  use  with  a  radar  ranging  example.  The  first  part  of  the  section  is  devoted  to  the 
introduction  of  this  radar  example.  A  radio  ranging  problem,  similar  to  the  radar  prob¬ 
lem  discussed  here,  provided  the  authors  with  motivation  tu  explore  global  approximation. 
The  problem  is  one  that  cannot  be  solved  satisfactorily  by  an  extended  Kalman  filter. 

The  remainder  of  Section  2,0  is  devoted  to  the  introduction  of  particular  approximation 
methods  and  their  application  to  the  radar  problem.  The  applications  are  developed  and 
compared  and  their  performance  demonstrated. 

1.1  Problem  Statement 

The  object  of  this  article  will  be  to  give  some  approximate  filtering  algorithms  for 
the  problem  of  recovering  the  signal  from  a  measurement  containing  signal  plus  noise.  It 
will  be  assumed  throughout  this  article  that  the  signal  to  be  estimated  is  a  random  pro¬ 
cess  whioh  satisfies 


x  •  f (x, t)  +  g(x,t)  n<t)  (1) 

where  f  and  a  are  known  functions  and  n(t)  is  an  m-vector  white  noise  with  spectral  matrix 
Q(t).  The  signal  x{t)  is  an  n-vector.  It  is  further  assumed  that  the  measurement  from 
which  the  signal  is  to  be  extracted  is  of  the  form 

z(t)  -  h(x,t)  +  v(t)  (2) 

where  h  is  a  known  function  and  v(t)  is  a  white  noise  with  spectral  matrix  R(t)  and  is 
independent  of  n(t).  Section  1.2  presents  a  radar  tracking  problem  which  has  this  form, 
and  many  other  physical  problems  can  be  modeled  by  this  set  of  equations.  The  vector  x 
is  called  "the  state  vector"  or  "the  message,"  depending  on  the  situation.  The  vector  z 
is  called  "the  measurement."  The  object  is  to  give  some  algorithms  which  process  the 
measurements,  z(t),  to  determine  the  state,  x(t),  with  the  smallest  possible  errors. 

Soma  of  the  approaches  to  be  used  for  extricating  the  signal  from  a  noisy  measurement 
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require  the  equations  to  be  in  discrete- time  form.  Discrete-time  state  and  measurement 
equations  have  the  form 


x(i+l)  =  f(x{i),  i)  +  g(x(i) ,  i)  n(i) 


(3) 


and 


z(i  +  l)  =  h(x(i) ,  i)  +  v(i)  (4) 

where  x  and  z  are  the  discrete-time  state  and  measurement,  respectively.  The  functions 
f,  g,  h  are  assumed  to  be  known.  The  noise  sequences  (h(i)  and  v(i)  are  white,  independent 
and  Gaussian. 

1.2  The  Difference  Between  Local  and  Global  Approximation 

In  this  section  an  example  problem  illustrates  the  differences  between  gobal  and  local 
approximations.  The  problem  chosen  is  to  analyze  the  behavior  of  a  nonlinear  feedback 
loop.  The  loop  to  be  analyzed  is  diagrammed  in  Figure  1. 
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Figure  1.  A  Nonlinear  Feedback  Loop 

This  tracking  loop  is  first  order.  It  has  a  nonlinearity  in  the  forward  loop  which 
is  linear  for  errors  smaller  than  one  in  magnitude  and  which  goes  to  zero  for  errors  larger 
in  magnitude  than  two.  The  input,  to  be  tracked  by  the  loop,  is  a  Brownian  motion.  A 
Brownian  motion  is  a  Gaussian  random  process,  a  typical  time  history  of  which  might  look 
like  the  trajectory  in  Figure  2. 


Figure  2.  Possible  Trajectory  for  a  Brownian  Motion 

The  Brownian  motion  can  be  thought  of  as  the  integral  of  a  white  noise.  It  shall  be 
assumed  that  the  speotral  level  of  this  white  noise  is  q.  The  magnitude  of  q  determines 
how  difficult  the  input  is  to  track. 

The  form  of  the  forward  loop  nonlinoarity  is  an  obvious  source  of  difficulty  for  this 
tracking  loop.  Since  errors  larger  than  two  result  in  zero  restorative  force,  it  is 
expected  that  there  is  soma  level  of  q  for  which  the  loop  will  be  unable  to  follow  the 
input  with  a  finite  error.  If  the  input  were  a  ramp  and  q  its  slope,  then  it  would  be 
clear  that  above  some  magnitude  of  q  the  input  would  be  untrackable  by  the  loop  in 
Figure  1.  This  would  be  clear  because  the  tracking  loop  is  first  order  and  follows  a  ramp 
with  a  steady  state  error.  For  a  ramp  input  to  a  first  order  loop,  the  magnitude  of  the 
steady  state  error  in  proportional  to  the  ramp's  slope.  When  the  slope  gets  large  enough 
that  the  steady  stata  error  is  greater  than  one,  the  loop  in  Figure  1  will  no  longer  follow 
it  with  a  finite  steady  state  error.  The  object  is  now  to  demonstrate  the  same  kind  of 
dependence  of  the  loop  following  error  on  q  for  the  given  Brownian  motion  input. 

Two  approaches  shall  be  taken.  First,  the  loop  will  be  linearized  and  analysed  sta¬ 
tistically,  and,  secondly,  a  describing  function  approach  shall  be  taken.  The  lineariza¬ 
tion  approach  is  a  local  approximation  and  is  directly  analogous  to  an  extended  Kalman 
filter.  The  describing  function  approach  is  a  global  approach  and  is  directly  analogous 
to  some  of  the  approaches  which  shall  be  taken  to  the  nonlinear  filtering  problsm  in 
Section  2.0. 

To  begin  analysing  this  loop  consider  the  differential  equation  which  the  loop  error 
satisfies 


e(t)  •  -K  f(e)  ♦  n(t) 


(5) 


In  this  equation  f(e)  is  the  forward  loop  nonlinearity  shown  in  Figure  1,  K  is  the  forward 
loop  gain,  and  n(t)  is  the  white  noise  which  is  the  derivative  of  the  Brownian  motion 
input.  The  first  approach  to  analyzing  this  loop  is  to  assume  small  error  and  linearize 
f(e)  about  zero.  This  yields  the  equation 

^  e(t)  *  -Ke  +  n(t)  (6) 


This  linear  equation  is  easy  enough  to  analyze.  The  mean  square  tracking  error  is  defined 
by 


p(t)  =■  E{ez(t){  (7) 

and  if  it  is  assumed  that  E{e(o)}  =0  (that  is,  that  the  loop  is  initialized  without  any 
intentional  error),  then  [jazwinski  (1)]  p(t)  satisfies  the  differential  equation 

p(t)  »  -2K  p(t)  +  q  (8) 

The  steady  state  mean  squared  error  ps  satisfies  p  (t)  «  0  and  is  easily  found  to  be 
Ps  “  q/2K.  Large  input  dynamics  (large  q)  result  In  large  tracking  errors  and  large  for¬ 
ward  tracking  errors  and  large  forward  loop  gain  (large  K  means  a  fast  tracking  loop) 
results  in  small  tracking  error.  If  q  is  small  enough  and  K  large  enough  that  the  errors 
stay  small,  then  this  linearized  analysis  is  a  sufficient  characterization  of  the  loop's 
behavior.  The  problem  with  linearization  is  that  it  gives  no  indication  of  the  loop's 
behavior  as  the  errors  get  large.  As  the  one  sigma  error  predicted  by  linearized  analysis 
gets  large,  only  a  small  part  of  the  error  trajectories  stay  inside  the  range  where  the 
linearization  is  valid. 

A  second  approach  to  analyze  this  loop  is  to  use  a  describing  function.  The  describ¬ 
ing  function  procedure  in  this  instance  [Gelb  and  Vander  Velde  (2)]  is  to  assume  that  the 
error  is  Gaussian  and  to  replace  the  nonlinear  block  with  the  linear  block  whose  output 
best  matches  in  a  mean  square  error  sense  that  of  the  true  nonlinearity.  The  gain  of  the 
linear  block  is  called  the  describing  function  gain. 

Assume  that  the  error  has  a  Gaussian  density  with  variance  p,  then  the  describing 
function  gain  G(p)  is 


G(p) 


L 


ef(e)  exp 


Some  manipulation  yields  the  describing  function  gain  to  be 


(9) 
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(10) 


This  gain  depends  on  p,  the  mean  square  tracking  error  and  can  be  thought  of  as  the  gain 
an  average  error  trajectory  sees.  As  p,  the  error  variance,  gets  small,  c(p)  approaches 
one,  which  is  the  linearised  gain.  As  {  gets  large,  the  describing  function  gain  G(p) 
approaches  zero  -  a  reflection  of  the  fact  that  most  of  the  error  trajectories  are  outside 
the  range  to  whioh  the  forward  loop  nonlinearity  responds. 

The  differential  equation  for  the  tracking  error  when  written  using  the  describing 
function  gain  is 


e(t)  -  -KG(p)  e(t)  ♦  n(t)  (11) 

The  saute  approach  as  used  for  the  linearised  approach  yields  the  variance  equation 

p  <■  -2K0(p)  p  ♦  q  (12) 

Dependence  of  the  describing  function  gain  on  the  variance  complicates  the  variance  equa¬ 
tion.  Finding  the  steady  state  variance  requires  solving  the  algebraic  equation 


0(P)  P'"  jfjf 


Closed  fora  solution  of  this  equation  is  not  possible,  but  a  graphical  solution  demonstrates 
its  qualitative  properties.  The  function  G(p)  p  versus  p  is  graphed  in  Figure  3.  It  is 
clear  from  the  form  of  the  graph  that  If  q/2K  is  larger  then  about  .42,  then  no  steady 
state  solution  exists. 


The  describing  function  approach  has  provided  two  pieces  of  information  which  the 
linearization  approach  did  not.  First,  it  anticipates  some  gain  reduction  as  the  variance 
grows  and  correspondingly  predicts  worse  tracking  errors.  It  also  predicts  a  combination 
of  q  and  K  for  which  the  loop  will  be  unable  to  track.  The  reason  for  the  difference  in 
the  type  of  information  these  procedures  give  is  that  the  describing  function  approxima¬ 
tion  depends  on  the  global  character  of  the  nonlinearity.  That  is,  it  depends  on  the 
response  of  the  nonlinearity  to  all  different  input  magnitudes,  not  just  one.  The  linear¬ 
ization  approach,  on  the  other  hand,  depends  on  the  character  (derivative)  of  the  nonlin¬ 
earity  near  zero.  The  describing  function  approach  is  therefore  called  a  global  approxi¬ 
mation  and  the  linearization  approach  is  called  a  local  approximation.  In  Section  2 
these  same  ideas  will  be  seen  to  apply  to  the  nonlinear  filtering  problem.  The  extended 
Kalman  filter  will  be  shown  to  be  a  local  approximation,  and  global  approximations  will 
be  given.  The  difference  in  the  type  of  information  obtained  from  these  two  approaches 
will  be  of  the  same  character  as  in  the  example  just  studied. 

2.0  SOME  GLOBAL  APPROXIMATIONS  FOR  NONLINEAR  FILTERING  AND  THEIR  APPLICATION 

This  section  gives  some  global  approximations  which  have  proven  themselves  useful  in 
nonlinear  filtering  and  applies  some  of  these  to  some  radio  navigation  and  target  tracking 
problems.  The  intent  is  to  introduce  some  global  approximations  and  to  make  clear  how 
they  are  used  by  applying  them. 

The  reader  will  observe  that  these  global  approximations  are  more  burdensome  to 
derive  and  implement  than  the  extended  Kalman  filter.  The  global  approximations  will,  as 
a  reward,  give  filters  that  outperform  the  extended  Kalman  filter,  particularly  when  the 
errors  are  large  compared  to  the  range  for  which  linearization  is  valid.  Additionally, 
these  filters  will  offer  more  complete  insight  into  the  effect  of  the  system  nonlinearities 
on  the  filtering  process.  The  describing  function  example  in  Section  1.2  illustrates  the 
benefits  of  a  global  approximation.  In  that  example  the  global  approximation  gave  a  more 
accurate  description  of  the  error  covariance  when  the  errors  were  larger  than  tho  linear 
range  of  the  forward  loop  nonlinearity.  A*  the  error  grew,  the  describing  function  analy¬ 
sis  predicted  gain  reduction  not  predicted  by  the  small  signal  approximation.  In  addition, 
the  describing  function  approximation  predicts  a  level  of  input  that  the  tracking  loop  will 
no  longer  follow.  Global  filtering  approximations  will  give  improved  fidelity  and  an  ana¬ 
lytical  description  of  fundamentally  nonlinear  phenomena  just  as  the  describing  function 
approach  did. 

The  next  things  in  this  section  are  a  brief  discussion  of  nonlinear  filtering  and  a 
description  of  the  physical  problem  which  serves  as  an  example.  A  radar  tracking  problem 
is  ohosen  as  an  example  problem.  Several  different  approaches  are  taken  to  thin  problem. 
Each  approach  leads  to  a  different  mathematical  formulation  of  a  nonlinear  filtering  prob¬ 
lem.  Some  of  the  mathematical  problems  which  stem  from  this  radar  problem  are  quite  simi¬ 
lar  to  those  arising  in  other  physical  situations.  Among  these  others  are  optical  target 
tracking  and  laser  communications. 

The  extended  Kalman  filter  approach  is  demonstrated  to  bo  utterly  unsuited  for  appli¬ 
cation  to  some  of  the  formulations  and  inadequate  in  the  others.  The  failure  of  the 
extended  Kalman  filter  in  this  radar  tracking  problem  makes  it  a  good  example  problem  for 
heralding  the  benefits  of  a  global  approach. 

The  last  part  of  this  section  is  devoted  to  some  global  approximations  and  to  their 
application  to  the  radar  problem.  The  global  approaches  will  prove  to  be  quite  effective. 
They  will  give  filtering  algorithms  which  are  valid  over  a  wide  range  of  operating  condi¬ 
tions,  and  they  will  provide  an  analytic  description  of  the  effect  of  the  system  nonlinear¬ 
ities.  This  description  will  be  intuitively  right  and  useful  in  understanding  the  filter's 
operation. 

2.1  Background  for  Nonlinear  Filtering  Theory 

As  indicated  earlier,  the  problem  addressed  here  is  that  of  extracting  the  signal 
from  a  noisy  measurement,  it  is  supposed  that  the  signal  evolves  as  the  solution  to  a  dif¬ 
ferential  equation  driven  by  noise.  The  signal  will  be  denoted  by  the  n-vector  x(t)  and 

is  assumed  to  satisfy 
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x ( t )  *  f(x(t),t)  +  g(x(t),t)n(t) 


(13) 


The  functions  f  and  g  are  assumed  known  as  is  the  initial  probability  density  of  x(o). 
The  measurement  z(t)  is  assumed  to  satisfy 

z(t)  =  h(x(t),t)  +  v (t)  (14) 

where  h(x(t),  t)  is  a  known  function.  The  time  functions  n(t)  and  v(t)  are  white  noises. 
They  are  assumed  to  be  independent  and  to  have  spectral  density  matrices 

E{n(t)  nT(t+t)}  »  Q(t)«(t)  (15) 


E{v(t)  VT(t+t)}  =  R(t)«(t)  (16) 

The  approach  taken  here  to  extract  the  signal  from  the  noisy  measurement  of  Eq,  (14) 
is  to  compute  at  each  point  in  time  the  conditional  expectation  of  the  signal,  x,  given 
all  the  measurements  taken  up  to  that  time.  Throughout  what  follows  this  conditional 
expectation  shall  be  denoted  by  a  hat.  Thus,  x(t)  is  the  conditional  expectation  of  x(t) 
givan  all  measurements  collected  up  to  time  t. 

The  measurements  themselves  are  processed  to  yield  the  conditional  mean,  x(t).  The 
measurements  may  be  processed  in  other  ways  which  might  reasonably  estimate  the  state 
vector.  The  conditional  mean,  however,  is  the  estimator  which  minimizes  the  mean  square 
error  and  is  therefore  a  highly  desirable  one. 

In  this  section  the  conditional  mean  is  computed  by  propagating  the  solution  to  a 
differential  equation.  Kushner  [kushnerO)]  first  derived  differential  equations  for  the 
conditional  mean.  In  fact  he  gave  a  differential  equation  for  the  conditional  expectation 
of  any  twice  continuously  differentiable  function  of  the  state.  Suppose  ♦(»)  is  such  a 
function,  then 

^  4(x(t))  -  [♦*£  +"Htr(gQg%^)]  +  (&- ♦fi)T  R^UU)  -  h)  (17) 

where  f,  g,  and  h  are  the  functions  appearing  in  the  problem  description  given  in  Eqa.  (13) 
and  (14).  In  Eq.  (17)  4X  i»  the  partial  derivative  of  4  with  respect  to  x  and  4XX  is  the 
second  partial.  The  symbol  tr  stands  for  the  trace  of  a  matrix,  that  is,  the  sum  of  its 
diagonal  entries.  To  obtain  a  differential  equation  for  the  conditional  mean  i(x>  »  x  is 
substituted  into  Eq.  (17). 

In  general  Eq.  (17)  cannot  be  solved  in  closed  form.  The  reason  is  that  on  the  right- 
hand  side  appear  hats  (')  denoting  conditional  expectation.  Taking  the  conditional  expec¬ 
tation  requires  having  the  conditional  probability  density  function.  Propagating  the 
conditional  mean  using  Eq.  (17)  is  generally  not  enough  to  propagate  the  conditional  prob¬ 
ability  density  function. 

In  the  case  where  the  Kalman  filter  applies,  the  conditional  density  is  Gaussian  and 
can  be  characterised  by  its  mean  and  variance.  In  this  case  propagating 


x  and  (x  -  x)  (x  -  x)T 

la  all  that  is  required  to  propagate  the  conditional  probability  density.  Differential 
equations  for  the  foregoing  can  be  derived  from  Eq,  (17).  In  the  more  general  case,  the 
mean  and  covarianco  are  insufficient  to  characterize  the  conditional  density.  Generally, 
an  infinite  number  of  momenta  are  required.  It  is  not  possible  to  propagate  solutions 
an  infinite  (or  even  large)  number  of  differential  equations,  so  some  type  of  approximation 
is  required.  Such  approximations  are  the  subject  of  this  article. 

2.2  Description  cf  a  Radar  Tracking  Problem 

k  radar  tracking  problem  serves,  throughout  this  article,  as  an  example.  This  section 
introduces  the  radar  tracking  problem  to  be  used.  The  problem  is  described  and  posed  math¬ 
ematically.  The  basic  task  for  a  radar  is  to  determine  the  distance,  or  range,  from 
itself  to  some  maneuvering  target.  A  radar  accomplishes  this  by  transmitting  a  signal  and 
measuring  the  time  required  for  the  signal  to  travel  to  the  target,  be  reflected,  and 
return  to  the  radar  transmitter.  Often  the  tame  antenna  that  transmits  the  signal  is  used 
to  receive  the  return.  The  return  signal  ie  then  processed  to  estimate  the  two-way  transit 
time.  The  range  to  the  target  is  inferred  from  this  transit  time.  In  this  article  global 
approximation*  will  be  used  to  arrive  at  algorithms  for  processing  the  rsdsr  return. 

Many  different  types  of  signal  waveforms  find  us*  in  radars.  The  algorithms  to  be 
developed  here  will  apply  directly,  or  with  minor  modification,  to  many  of  the  different 
waveforms.  To  toe  specific,  however,  e  particular  waveform  is  chosen  for  this  example. 

The  basic  waveform  considered  here  ie  built  from  e  pseudorandom  number  (PAN)  code. 


The  pseudorandom  number  coda  la  a  piecewise  constant  waveform  built  from  e  sequent* 


of  pseudorandom  numbers.  A  pseudorandom  number  sequence  is  a  sequence  of  numbers  which 
can  be  generated  systematically  but  which  has  some  of  the  desired  properties  of  a  sequence 
of  random  numbers.  A  familiar  example  of  such  a  sequence  is  computer  generated  noise. 

The  method  for  building  a  PRN  code  from  a  PRN  sequence  is  as  follows.  Suppose  {aj.}j.>o 
is  a  sequence  of  pseudorandom  numbers.  Choose  a  length  of  time  T  and  define  a  PRN  code 
S(t)  by 


S(t)  =  aj_  iT  <  t  <  (i  +  1)T  (18) 

A  possible  PRN  code  S(t)  is  shown  in  Figure  4.  The  length  of  time  T  is  usually  called 
one  chip.  The  inverse  of  T  is  usually  called  the  chipping  rate.  For  example,  for  a  par¬ 
ticular  code  with  a  chipping  rate  of  10  MHz,  T  =  It”7  seconds. 
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Figure  4.  Graph  of  Possible  PRN  Code 

A  particularly  useful  class  of  the  PRN  codes  is  the  class  of  linear  feedback  shift 
register  codes.  In  practice  such  a  code  is  generated  by  a  pseudorandom  number  sequence 
in  which  the  numbers  are  binary.  These  sequences  look  like  coin  toss  sequences  except 
that  they  repeat  at  gome  low  frequency,  hinary  sequences  are  useful  in  practice  because 
they  can  be  generated  by  simple  digital  circuits. 

A  PRN  code  is  used  to  build  a  signal  waveform  by  multiplying  a  sinusoidal  carrier 
wave  by  the  PRN  code.  T Ke  result  is  then  transmitted.  If  the  PRN  code  is  a  *1  binary 
code,  then  multiplying  this  code  by  the  carrier  sinusoid  simply  introduces  a  180°  phase 
shift  when  the  PRN  code  equals  -1  or  leaves  tha  sinusoid  unchanged  when  the  code  is  +1. 

After  being  reflected  and  received  at  the  radar,  the  signal  first  has  its  carrier 
wave  removed,  what  is  left  is  a  PRN  code  which  is  delayed  by  comparison  to  the  code  on 
the  transmitted  signal.  The  object  is  to  determine  how  much  this  received  code  is  decayed. 
There  are  several  different  approaches  to  the  problem  of  determining  the  delay  between 
the  transmitted  and  received  wriveforms.  Suppose  that  S(t  +  ett)j  is  the  received  PRN 
code  and  5 ( t)  is  the  delay  in  that  code.  The  delay  «(t)  is  a  function  of  time  because 
tno  target  is  moving  hnd  tne  two-way  transit  time  changing.  Beside*  the  delayed  code 
Slt^Oft))  the  received  signal  contains  some  noise.  In  some  cases  this  noise  is  thermal. 

In  other  eases  the  noise  predominantly  due  to  other  radio  transmissions  close  in  fre- 
.qucncy  to  the  radar’s..  A  military  target,  example,  might  transmit  signals  to  coofuse 
the  radar.  These  signals  would  -look  uk-v  aoi'se  to  the  radar  receiver. 

The  signe*  to  be  processed  is  <*  combination  »S  the  delayed  PRN  code,  S ( t  *  9  ( t } ) ,  and 
noise.  If  -alt)  is  used,  to  denote  the  signal  to  be  processed,  then  mathematically 


s(t>  *  S  U  ♦  c(tf  >  ♦  v(t)  ug> 

The  sequence  of  events  leading  to  this  measurement  is  depicted  in  Figure. 5. 
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Figure  3.  Sketch  of  Radar  Tracking  Sy.sae 


The  measurement  z(t)  is  scalar,  and  the  measurement  noise  v(t)  is  white  with  spectral 
density  r(t). 

This  radar  tracking  problem  is  now  very  nearly  in  the  form  of  a  nonlinear  filtering 
problem-  A  state  state  variable  model  for  $(t)  is  all  that  is  required  to  complete  the 
mathematical  formulation  of  the  problem.  Suppose  that  the  radar  antenna  is  at  the  origin 
of  a  three-dimensional  Cartesian  coordinate  system.  The  target  has  coordinates 
(xi(t),  x2(t),  X3 { t) )  in  this  system.  The  target  coordinates  evolve  according  to  the 
equations  of  motion  for  the  target.  There  is  some  uncertainty  in  th.  evolution  of  the 
target  location,  since  the  control  inputs  to  the  target  (thrust  level  or  changes  in  aero¬ 
dynamic  control  surfaces)  are  unknown  to  the  radar.  The  coordinates  of  the  target  can  be 
aggregated  into  a  vector  x(t)  which  has  n- ^ laments,  the  first  three  of  which  are  the  coor¬ 
dinates  Xj,  x2,  X3.  Other  elements  in  the  vector  might  be,  for  example,  the  target  velocity 

or  pitch  angle.  The  target  equation  of  motion  can  be  put  in  the  form 

* 

x(t)  ®  f (x ( t ) )  +  g(x(t))n(t> 

The  difficulty  and  importance  of  developing  this  state  variable  model  is  not  to  be 
minimized.  The  state  model  is  the  starting  point  for  the  filter  designs  which  shall  be 
presented.  The  effectiveness  of  the  filter  will  depend  on  quality  of  the  state  variable 
model.  A  high  dimension  state  vector  leads  to  a  computationally  difficult  design.  It  is 
necessary  therefore  to  find  a  state  model  which  captures  the  important  features  of  the 
target  motion  in  as  short  a  state  vector  as  possible.  To  be  able  to  do  this,  the  designer 
needs  to  understand  both  the  physics  of  the  target  motion  and  the  requirements  of  th©  filter 
not  yet  designed.  Arriving  at  a  state  model  is  in  itself  a  difficult  problem. 

For  purposes  of  this  article  a  simple  target  motion  model  will  bt  used.  It  will  be 
assumed  that  the  delay  8(t)  is  a  Brownian  nation.  That  is, 

0  =  n(t)  (20) 


where  n(t)  is  a  white  noise  with  spectral  density  q.  This  target  motion  model  is  the 
simplest  that  still  has  some  randomness.  Even  though  it  is  a  gross  oversimplification  of 
any  real  target's  motion,  it  turns  out  to  yield  a  filter  that  ie  useful  for  some  problems. 
Part  of  the  justification  for  using  this  model  is  that  it  keeps  focus  on  the  filter  design 
problem  instead  of  shifting  it  to  tho  problem  of  modeling  a  particular  target. 

Mathematically,  the  radar  signal  processing  problem  can  be  posed  as 


3  »  S(t  ♦  eU>)  »  v(t) 


(22) 


The  extended  Kalman  filter  (EK?)  is  inappropriate  to  apply  to  this  problem.  Tho  KKK  pro¬ 
cedure  requires  differentiating  with  respect  to  ott).  since  5(t*e(t>)  is  piece- 

wise  constant,  the  derivative  with  respect  to  9  It)  is  either  aero  or  undefined  (infinity). 
One  ivculd  overcome  this  difficulty  by  using  a  finite  difference  Instead  of  a  derivative, 
but  there  is  really  no  justification  for  a  finite  difference.  The  global  approximations 
shown  later  apply  directly  tb  this  problem-  The  fact  that  SJt+ettj)  is  not  continuous  tor 
differentiable)  does  not  cause  them  any  difficulty. 

There  is  another  way  to  pose  this  radar  signal  processing  problem.  It  is  not  quite 
so  fundamental  as  the  problem  jufit  posed.  Some  structure  ie  imposed  on  the  processor  to 
arrive  at  a  different  formulation.  There  are  two  main  reasons  for  considering  this  other 
formulation.  Hrst,  this  second  formulation  is  mathematically  the  same  as  the  problem  of 
target  tracking  with  a  narrow  f ield-of-view  optical  device  (a  telescope  srhaps) .  This 
i\,  at,  interesting  and  useful  problec-  in  Sts  own  right  and  a  problem  vh ‘  h  appeals  directly 
to  most  people's  Intuitions.  Second,  this  sec -nd  formulation  is  traditionally  used  to 
design  radar  signal  processors.  Xt  is  enlightening  to  see  how  era Htional  solutions  com¬ 
pare  to  what  global  approximation  yield*  and  to  compare  the  solutions  obtained  for  these 
different  formulations. 

The  formulation  of  thin  radar  tracking  problem  assumes  that  the  received  sig¬ 

nal,  after  having  the  carrier  signal  removed,  is  multiplied  by  a  code  S(t*  i(t>).  This 
code  is  the  same  aw  tha  received  code  except  that  it  is  delayed  by  i(t>  instead  of  e(t). 

Tbs  delay  8  (t) con  be  thought  of  as  tho  delay  that  the  radar  expects  for  the  received  code 
to  have. 

The  expected  result  of  multiplying  the  received  signal  by  s(t  ♦  t  it) )  is  the  autocor¬ 
relation  of  the  FRN  code  ut  time  shift  ((t)  -  e(t)  corrupted  by  some  noise. 

The  code  autocorrelation  *^(*1  defined  by 

«•■*»>  “  u1**  K  /u  S(t>S(t  ♦  ’>dt 


(23) 


has  the  form 


RSS^T^  “  1  “  1^,1 


(24) 


=  e(t) 


for  |  t  -NTj  £  T 
for  N  any  integer 
otherwise 


where  T  is  the  code  repeat  time  and  e ( t )  is  a  function  which  is  small  compared  to  one. 
Figure  6  shows  how  Rss(r)  might  look.  It  will  be  assumed  in  what  follows  that  the  code 
autocorrelation  Rss(t)  has  the  form 

Kss^  It^  <  p 
=  0  otherwise 


This  assumption  is  justified,  as  far  as  the  consequent  estimation  procedure  is  concerned, 
if  the  a  •priori  probability  of  the  state  (range)  is  concentrated  on  an  interval  TR  wide. 


Figure  6.  Picture  of  PRN  Code  Autocorrelation 

In  many  practical  systems  the  repeat  time  TR  may  be  very  much  larger  than  a  priori 
timing  uncertainties.  For  example,  one  ranging  system  in  current  operation  has  a  repeat 
time  TR  =  200  days,  while  initial  timing  uncertainties  might  be  a  few  microseconds. 

Mathematically  the  result  of  multiplying  the  measurement  by  a  code  with  the  expected 
delay  is 


z  (t)  S  (t  +  8  (t) )  =  [S  (t  +  6  (t)  +  v(t)]  S  ( t  +  8  ( t. ) ) 

»  S(t+  8  (t) )  S(t+  8  (t) )  +  v  { t) 


(26) 


In  Eq.  (26)  v(t)  =  S ( t + 8 ( t) ) v ( t)  and  is  a  white  noise  with  spectral  density  r(t)  just  like 
v (t)  was.  The  first  term  on  che  right-hand  side  of  Eq.  (26)  displays  a  similaiity  to  the 
integrand  in  the  definition  of  the  autocorrelation  function.  It  seems  reasonable  to 
expect  (and  can  be  demonstrated)  that  the  RHS  of  Eq.  (26)  and  Rsg[e(t)  -  0(t)]+  n(t)  have 
equal  time  integrals,  or,  equivalently,  that  the  outputs  of  the  circuits  shown  in 
Figure  7(a)  and  (b)  have  equal  time  integrals.  If  these  twc  outputs  have  oqual  time  inte¬ 
grals,  then  a  measurement  processor  which  acts  as  a  low  pass  filter  will  have  the  same 
response  to  one  as  to  the  other.  The  processors  proposed  latar  will  act  as  low  pass  fil¬ 
ters,  so  modeling  the  physical  situation  in  Figure  7(a)  by  the  block  diagram  of  Figure  7(b) 
will  be  valid. 
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(a)  Physical  situation  (b)  Mathematical  model 

Figure  7.  Modol  for  Code  Correlator 

The  essence  of  the  arguments  above  is  that,  after  some  multiplications  by  known  sig¬ 
nals,  the  input  signal  may  be  taken  to  be  a  function  2(t)  satisfying 

2  (' )  “  R3S{.8(f)  -  6  (t)  3  +  v(t)  (27) 

The  input  signal  was  correlated  with  a  single  known  code  to  produce  the  measurement 
2(t).  Several  such  correlations  of  the  input  can  be  performed  against  several  shifts  of 
the  known  code  (i.e.,  against  codes  s[t  +  dj(t)]  ,  sft  -t  dj(t)J  ,  ...1.  This  produces  sev¬ 
eral  measurements  2^(t) 


Zi(t)  -  RBS|8(t)  -  di(t)  |  +$i(t) 


(28) 


where  (t)  is  the  time  shift  used  to  generate  the  ith  shift  of  the  known  code  and  the 
noise3  n^  are  independent  of  one  another. 

This  measurement  model  is  quite  similar  in  form  to  one  which  arises  in  optical  target 
tracking.  The  measurement  2i  in  Eq.  (28)  has  two  components.  First,  it  containn  noise. 
Second,  it  has  a  component  due  to  signal.  If  the  ith  delay,  d^,  is  equal  to  the  delay 
on  the  received  code,  then  the  signal  component  is  one.  If  the  delay  is  grossly  dif¬ 
ferent  than  the  received  delay,  then  the  signal  component  is  zero.  typical  optical 
sensor  has  the  image  plane  (television  screen,  for  example)  divided  into  a  matrix  of  small 
squares.  Each  small  square  has  a  component  of  its  output  due  to  noise.  It  also  has  a 
component  due  to  the  target.  The  target  component  is  one  if  the  target  is  in  the  small 
square  and  aero  if  it  is  outside.  The  code  tracking  problem  can  then  be  visualized  as  a 
one-dimensional  optical  tracking  problem. 

It  is  worthwhile  at  this  point  to  briefly  describe  how  the  delay  of  a  PRN  coda  has 
traditionally  been  tracked. 


Shift  Geneiator 


Figure  8.  Code  Loop  Discriminator 

Figure  8  shows  a  block  diagram  of  the  circuitry  typically  used  to  develop  an  error  signal. 
The  two  multipliers  shown  in  Figure  8  provide  two  correlations.  The  common  input  to  these 
two  multipliers  is  the  received  signal.  The  known  code  inputs  to  the  correlators  are  time 
shifted  by  a  fixed  amount  relative  to  one  another.  These  time  shifted  codes  are  obtained 
by  putting  the  code,  out  of  the  code  generator,  into  a  tapped  delay  line  (shift  register) . 

Suppose  that  the  code  in  the  center  position  of  the  shift  register  in  Figure  8  is 
believed  to  be  in  synchronism  with  the  input  code.  That  is,  the  coda  s[t  +  §(t>]  is  in 

the  center  of  the  shift  register.  The  code  one  shift  to  the  right  is  then  S(t  ♦  0  -  1) , 

and  that  one  shift  to  the  left  is  S(t  +  8  +  1).  The  results  of  multiplying  each  of  these- 

with  the  cods  (disregarding  input  noise)  on  the  input  signal  are  modeled,  according  to 

earlier  analysis,  by  RS8(0  -  §  +  1)  and  Rs8(9  -  5  -  1),  respectively.  The  difference 
between  the  two  correlations  with  the  input  signal  yields  a  measurement  z(t)  which 
satisfies 

s(t)  -  R(6-8-l)  -  R(«  -  I  +  1)  v(t>  (29) 

The  dovice  shown  in  Figure  8  is  called  an  early-late  detector.  Early-late  refers  to 
the  fact  that  the  correlations  are  with  local  codes  which  are  earlier  and  later  than  the 
expected  input.  Figure  9  shows  the  mathematical  model  of  the  device  in  Figure  8. 
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Figure  9.  Mathematical  Model  of  Code  Loop  Early-Late  Detector 

The  early-late  detector  then  gives  an  error  measurement  over  a  limited  range.  The  circuit 
can  be  enclosed  in  a  feedback  loop  and  used  to  continuously  track  the  input  code  delay. 

The  resulting  tracking  loop  is  called  a  delay  lock  loop.  In  optical  tracking  problems 
there  is  an  analogous  error  measuring  circuit  called  a  quadcell  detector.  This  circuit 
has  four  optical  sensors  located  in  a  square  array  -  one  sensor  at  each  corner  of  the 
square,  imagine  that  the  square  is  oriented  sc  that  its  sides  are  cither  vertical  or  hori¬ 
zontal.  Tne  difference  between  the  sum  of  the  outputs  of  the  two  right  cells  and  the  sum 
of  the  outputs  of  the  two  loft  cells  gives  an  azimuth  error  indication.  The  difference 
botween  the  top  and  bottom  gives  an  elevation  error  indication. 

N 

The  problems  of  using  a  non.,  inearity  of  this  sort  in  a  feedback  loop  are  fairly 
obvious.  Si  ice  the  forward  loop  nonlinearity  gives  no  output  for  errors  greater  than  two, 
it  cannot  au  itain  errors  of  that  size.  This  limits  the  input  dynamics  and  noise  level 
which  the  loop  will  track. 

In  cons*. acting  tho  early-late  detector,  only  two  multiplier  outputs  were  used.  If 
it  appears  to  offer  some  advantage,  many  multiplier  outputs  can  be  provided.  The  object 
of  the  work  which  inspired  tho  authors  interest  in  global  approximation  was  to  see  i* 
extra  correlator  outputs  could  be  used  to  extend  the  range  of  dolay  dynamics  and  noise 
through  which  the  delay  can  bo  tracked. 


One  approach  to  extending  the  range  of  operation  is  depicted  in  Figure  10. 

Figure  10(a)  shows,  on  a  single  graph,  the  outputs  of  several  early  and  several  late  mul¬ 
tipliers.  Adding  all  the  early  correlations  and  subtracting  all  the  late  correlations 
yields  a  broadened  forward  loop  nonlinearity  as  shown  in  Figure  10(b).  This  method  is 
suggested,  for  example,  in  [Spilker  (4)]  and  in  [Schiff  (5)]  .  The  difficulty  with  adding 
these  extra  multiplier  outputs  in  this  manner  is  that  each  additional  correlation  brings 
with  it  additional  noise.  The  measurement  resulting  from  adding  more  than  two  multiplier 
outputs  is  then  noisier  than  the  traditional  early-late  measurement.  This  means  that  in 
the  cases  where  the  traditional  scheme  is  able  to  track  with  small  errors,  the  extended 
range  scheme  will  experience  relatively  larger  errors,  because  of  this,  the  traditional 
wisdom  is  that  extra  multipliers  are  not  useful  for  tracking.  The  fallacy  with  this  is 
that  the  reasoning  only  applies  in  benign  conditions  where  the  traditional  scheme  can 
track  with  small  errors.  That  is.  the  reasoning  is  based  on  a  small  signal  analysis. 

Use  of  global  approximation  will  indicate  how  this  reasoning  should  be  modified.  If  con¬ 
ditions  are  severe  enough  to  cause  the  failure  of  the  traditional  scheme,  then  benefit 
can  be  derived  by  including  extra  multiplier  outputs.  This  will  be  demonstrated  in 
Section  2.3. 
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Figure  10.  One  Method  for  Employing  Multiple  Correlator  Outputs 

Two  starting  points  will  be  used  to  learn  how  to  benefit  from  extra  multiplier  out¬ 
puts.  One  approach  starts  by  modeling  the  output  of  each  multiplier  as  a  separate  measure¬ 
ment.  The  second  approach  starts  by  assuming  the  multiplier  outputs  are  weighted  and  added 
together  to  yield  a  single  nonlinear  error  measurement.  This  measurement  nonlinearity, 
might,  for  example,  look  like  the  one  graphed  in  Figure  10(b).  In  this  second  approach  the 
shape  of  the  nonlinearity  is  not  specified.  An  optimum  shape  is  determined  by  using  global 
approximation  techniques.  This  last  approach  is  analogous  to  feedback  communication 
wherein  the  moduation  or  measurement  nonlinearity  can  be  modified. 

Altogether,  three  starting  points  have  been  suggested.  The  different  approaches  use 
the  same  model  for  the  delay  dynamics,  but  differ  in  that  thoy  use  different  models  for 
the  measurement.  The  first  approach  is  to  call  tho  received  code  and  noise  in  Eq. 
the  measurement.  The  second  and  third  approaches  assume  the  recoived  eode  has  boon  multi¬ 
plied  by  various  shifts  of  a  code  generated  in  the  receiver.  The  second  approach  considers 
the  array  of  multiplier  outputs  to  be  the  measurement.  The  third  considers  a  single 
voightod  sum  of  multiplier  outputs  to  be  tho  measurement. 

None  of  these  approaches  is  amenable  to  oxtendcd  Kalman  filtering.  Tho  extended 
Kalman  filtering  uses  the  partial  derivative  of  the  measurement  nonlinearity  with  respect 
to  the  state  variablo  evaluated  at  the  expected  state.  As  mentioned  earlier,  the  first 
meieuromont  model  where  tho  received  code  is  tho  measurement  is  not  amenable  to  this 
approach,  because  the  code  either  has  aero  derivative  or  is  not  differentiable.  The  second 
approach  where  tho  multiplier  outputs  are  Considered  measurements  is  not  amonable,  because 
the  derivative  of  the  code  autocorrelation  is  sure  further  than  one  away  from  its  peak. 

This  cautws  the  extended  Kalman  filter  to  ignore  outputs  of  correlators  further  than  two 
away  from  the  expected  delay.  Tho  third  approach  wherein  tho  measurement  is  modeled  as  a 
single  adjustable  nonlinear  function  of  tho  state  is  not  amenable  extended  Kalman  filter¬ 
ing  either*  Since  the  extended  Kalman  filter  characterises  the  nonlinearity  by  its  slope 
evaluated  at  the  expected  value  of  the  state,  it  considers  the  nonlinearitios  in  Figures  9 
and  10(b)  to  fee  equivalent,  in  terms  of  small  signal  behavior  they  are  equivalent,  but 
in  terms  of  large  error  behavior  they  certainly  are  not.  These  approaches  «r*  all  amen¬ 
able  to  global  spproximation  as  will  be  seen  in  Section  £.3. 

2.J  Gaussian  Approximation 

The  radar  tracking  problem  described  in  Section  2, 2\ has  been  ma them*’ ti cully  described 
in  several  different  ways.  All.  of  the  mathematical  problems  which  have  arisen  from  the 
radar  problem  fit  within  the  general  framework  of  the  nonlinear  filtering  problem  intro¬ 
duced  earlier.  In  this  section  an  approximate  solution  for  the  nonlinear  filtering  prob¬ 
lem  will  be  used  to  arrive  at  solutions  for  the  radar  tracking  problem.  First,  the 
approximation  will  be  dosetibed  generally.  Next,  the  approximation  will  be  applied  to 
each  of  tho  mathematical  problems  which  have  arisen  from  the  radar  tracking  problem.  The 
inability  of  the  extended  Kalman  filter  to  accomplish  those  design  tasks  will  bo  pointed 
out,  and  the  designs  which  result  from  the  different  mathematical  formulations  of  tho 
same  radar  problem  will  be  compared. 

In  Section  2.1  Fajhner's  equations  for  propagating  tho  mean  and  variance  of  the  con¬ 
ditional  density  were  presented.  It  was  pointed  out  thst  the  problem  with  these  equations 
was  that  each  equation  required  on  its  right-hand  side  the  entire  conditional  density, 
while  propagating  the  equation’s  left-hand  side  only  yielded  one  moment  of  the  conditional 
density*  Since  an  infinite  number  of  moments  of  the  conditional  density  are  required  to 


reconstruct  it,  an  infinite  number  of  equations  must  be  solved  to  propagate  the  condi¬ 
tional  density.  To  circumvent  this  difficulty  an  approximation  will  be  used. 

Suppose  that  the  plant  equation  is 

x  =  f(x(t),t)  +  g (x(t) , t) n(t)  (30) 

and  that  an  observation  of  the  form 


z(t)  =  h(x(t),t)  +  v(t)  (31) 

is  made.  Differential  equations  for  the  conditional  mean  and  covariance  can  be  derived 
directly  from  Eq.  (17)  presented  in  Section  2.1.  The  differential  equation  for  the  mean 
is 

x  «  f(x,t)  +  (x^i)hT  r'1  (t)  (z(t)  -  h)  (32) 


Denote  the  covariance  matrix  by  P,  that  is 


P  *  (X  -  X)  (X  -  X)  ' 
The  differential  equation  for  the  covariance  is 


(33) 


Pij  “  <xi~*i)fj  +  (xj”*j)fi  +  i  j  “  (xi  "  xi)hTR“1(Xj  -  Xj)h 


+  [(xi  “  (xj  -  x.j)  (h  -  h)j  A  R~  1  (z(t) 


(34) 


h) 


In  this  equation  Pij  is  the  ijth  element  of  the  covariance  matrix  P.  The  initial  condi¬ 
tions  for  the  conditional  mean  and  covariance  equations,  Eqs.  (33)  and  (34),  come  from 
the  known  initial  density  of  the  state.  Specifically, 


x(o)  <*  E{x(o)  1 


(35) 


and 

P(o)  »  e|[x(o)  -  x(o)] [x(o)  -  x(o)]T}  (36) 

The  approximation  to  be  used  here  is  best  explained  by  an  intuitive  inspection  of  the 
right-hand  side  of  the  moan  and  covariance  equations.  Notice  that  the  conditional  density 
is  required  on  the  right-hand  side  of  these  equations  in  order  to  carry  out  the  expecta¬ 
tion  operations.  These  expectation  operations  are  the  integral  of  the  indicated  quanti¬ 
ties  against  the  conditional  density.  For  example,  the  quantity  £(x,  t)  appears  on  the 
right-hand  side  of  the  mean  equation.  This  quantity  car  be  expressed  in  terms  of  the 
conditional  probability  density.  Suppose  that  the  conditional  density  of  the  state  at 
time  t,  given  measurements  up  to  time  t,  is  Pt(x|z).  The  quantity  f(x,  t)  is  then  given 
by 


f(x(t),  t)  -  y*f( r,  t)pt(r|z)dr  (37) 

Since  the  conditional  density  appears  in  an  integral  like  this,  perhaps  its  precise 
shape  \s  not  critical  to  the  accurate  propagation  of  the  conditional  mean.  The  supposi¬ 
tion  is  made  that,  in  fact,  as  far  as  the  quantities  in  the  mean  and  variance  equations 
are  concerned,  only  the  mean  and  variance  of  the  conditional  density  are  significant.  If 
this  is  true,  then  the  expectations  on  the  right-hand  side  of  the  mean  and  variance  equa¬ 
tions  can  be  carried  cut  using  any  density  which  has  the  right  mean  and  variance. 

S  density  which  is  conveniently  characterized  by  its  mean  and  density  is  the  Gaussian 
density. and  that  is  the  one  that  will  be  used  here.  For  example,  using  Gaussian  approxi¬ 
mation  f(x»  t)  becomes 


i(x,  t>  *  ft  (r,  t)  Pt(r|Z)  dt 


(38) 


That  is,  all  the  conditional  expectations  on  the  right-hand  side  of  the  mean  and  var¬ 
iance  equations  are  carried  out  by  assuming  that  the  conditional  density  is  Gaussian  in 
fora. with  mean  value  x  and  covariance  P.  The  derivatives  of  x  and  P  appear  on  the  left- 
hand  side  of  the  mean  and  covarianoe  equations,  what  results  is  a  coupled  set  of  differ¬ 
ential  equations  which  can  be  solved  by  ordinary  numerical  methods.  The  effect  of  this 


approximation  is  then  to  truncate  the  number  of  equations  reuired  to  propagate  the  condi¬ 
tional  density.  Since  the  density  has  been  supposed  to  only  depend  on  its  mean  and  vari¬ 
ance,  only  the  mean  and  variance  equations  need  to  be  propagated.  The  procedure  leads  to 
equations  which  are  much  like  the  familiar  Kalman  filtering  equations  in  form,  but  which, 
as  will  be  seen,  depend  on  the  global  character  of  the  measurement  and  system 
nonlinearities. 

2.4  Use  of  Gaussian  Approximation  to  Determine  Optimum  Measurement  Nonlinearity 

The  first  mathematical  problem  on  which  Gaussian  approximation  shall  be  used  is  the 
deformable  detector  problem.  Suppose  that  the  input  delay  process  8 { t)  satisfies 

e (t)  =  n (t)  (39) 

and  that  the  nonlinear  measurement,  built  from  weighted  correlator  outputs,  satisfies 

z(t)  =  h ( 8  (t)  -  6  ( t) )  +  v(t)  (40) 

The  noise  processes  n(t)  and  v(t)  have  spectral  densities  r(t)  and  q(t),  respectively. 

The  shape  of  the  measurement  nonlinearity  h(e)  depends  on  what  weights  the  correlator  out¬ 
puts  are  multiplied  by  before  being  added  together.  Either  one  of  the  nonlinearities 
shown  in  Figures  9  and  10(b)  could  be  achieved  by  some  selection  of  weights.  More  gener¬ 
ally,  suppose  that  weight  wi  is  applied  to  the  correlator  shifted  by  i  increments  with 
respect  to  the  expected  on-time  code.  The  result  of  this  is  a  nonlinearity  composed  of 
straight  line  segments  connecting  the  points  (~(n+l),  0),  (-n,  w_n) ,  (-(n-1),  w-(n-i)), 

...,  (-1,  w_j),  (0,  wD) ,  ...,  (m,  Wm) (m+1,  0)  where  the  w^  are  the  arbitrary  weights. 

This  nonlinearity,  shown  in  Figure  11,  will  be  denoted  by  h(e).  Notice  that  for  an 
integer  i,  h(i)  =  w^.  The  detector  drawn  in  Figure  11  would  probably  not  be  a  useful  one. 
The  point  is,  however,  that  very  general  shapes  are  obtainable. 


Figure  11.  Hypothetical  Detector  Obtainable  by  Using  Weights  w^ 

Noise  is  an  essential  consideration  in  determining  the  optimum  detector  shape.  The 
noises  present  on  the  different  correlator  outputs  are  independent,  zero  mean,  and  white 
with  equal  covariances.  Let  n^(t)  be  the  noise  present  on  the  output  of  the  i^h  corre¬ 
lator  and  let  E(nj_(t)  ni(t)}  =  rs(t-t).  The  spectral  density  of  the  noise  on  the  weighted 
sum  of  the  correlator  outputs  is  then 
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The  postdetection  noise  variance  is  then  proportional  to 
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(41) 


The  wf  must  be  chosen  so  that  their  signal  detection  assets  outweigh  their  noise 
liabilities. 

It  will  be  assumed  that  the  function  h{*)  is  such  that 


h2(e)  de 


(42) 


For  the  nonlinearities  which  will  arise  in  this  design,  this  condition  will  be  satisfied. 

The  object  is  now  to  determine  the  measurement  nonlinearity  h(*)  (weights  w£)  which 
gives  the  best  performance.  The  nonlinearity  will  be  constrained  to  have /h2  (e)de  -  1 
and  to  be  antisymmetric.  Beyond  that  it  iB  unconstrained.  The  procedure  is  to  design  a 
filter  with  h(*)  unspecified  and  then  to  choose. h(*)  to  optimise  the  filter's  performance. 
The  conditional  moments  [Fujisaki  et  al  (6)]  and  [dark  (7)J  satisfy 


(43) 


§  »  (6  -  0)h  ±(z(t)  -  h) 


and 


2  i  +  q  +  (6  -  8) 2 (h -  h)  i-  (z(t)  -  h) 


(44) 


These  equations  result  from  specializing  Eqs.  (32)  and  (34)  to  the  present  situation.  A 
lower  case  p  is  used  to  represent  the  covariance  to  emphasize  the  fact  that  it  is  a 
scalar.  Applying  Gaussian  approximation  to  this  problem  yields 


d8 


5  =  GlEL 


dz 


_  _  G2  (P)  . 
1  ■ 


dt 


(45) 

(46) 


where 


G(p)  = 


hf 


eh(e)  exp  {- 


2p 


de 


(47) 


This  can  be  changed  into  a  more  recognizable  form  by  defining  H(p)  to  be 

H  (p)  = 


■  p  vte  £ ah,e’ I 


eh(e)  exp  { -  ^  }  de 


(48) 

(49) 


The  function  H(p)  is  then  the  describing  function  gain  for  the  nonlinearity  h(»)-  If 
pH (p)  is  substituted  for  G(p),  the  filter  equations  become 


de  =  ESM  d2 


(50) 


it 


_  P2H(p) 2 
r 


+  q 


(51) 


These  equations  can  be  recognized  as  the  Xalman  filter  linearized  with  the  describing 
function  gains.  Gaussian  approximation  can  generally  be  interpreted  as  a  Kalman  filter 
linearized  using  a  describing  function.  In  general,  there  will  usually  be  a  data  depen¬ 
dence  in  the  covariance  equation.  Two  elements  of  the  problem  under  consideration  combine 
to  remove  data  dependence  in  the  covariance  equation.  The  measurement  nonlinearity  is  a 
function  of  the  estimation  error,  not  of  the  state  alone,  and  it  is  an  antisymmetric 
function. 

The  mean  and  covariance  equations  represent  an  approximate  solution  to  the  filtering 
problem  for  an  arbitrary  nonlinearity  h.  The  complete  problem  will  be  solved  when  the 
nonlinearity  is  selected  to  yield  optimum  filter  performance.  Inspection  of  the  covari¬ 
ance  reveals  that  only  one  term  is  affected  by  the  choice  of  the  nonlinearity.  That  term 
corresponds  to  the  qudratic  term  in  the  usual  Kalman  filter.  To  minimize  the  covariance 
then,  the  best  strategy  is  to  maximize  h2(p).  Doing  this  makes  the  derivative  of  the 
covariance  as  small  as  possible.  The  optimum  detector  h*  then  satisfies 
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eh(e)  exp  -  ^  de|  =  |  /  eh*(e)  exp  -  ^  de 
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This  is  equivalent  to  solving  the  unconstrained  problem 
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r2(e)  de 
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and  setting 


h*  (e) 


r*(e)  [  /V2(e)  de]‘l/2 


Figure  12.  Optimum  Measurement  Nonlinearity 

It  is  more  or  less  linear  over  a  range  (-  %/p,  Wp).  An  interesting  feature  of  the  design 
is  that  the  nonlinearity  is  not  fixed  hut  changes  in  time.  The  nonlinearity  depends 
directly  on  the  traoking  error  variance  p.  since  p  changes  in  time>  then  so  does  the 
nonlinearity.  The  variance,  for  example,  might  start  out  large  when  the  radar  is  first 
turned  on.  As  the  signal  1b  processed,  the  radar  becomes  more  certain  of  the  target 
whereabouts  and  the  variance  decreases.  As  the  variance  decreases,  the  linear  range  of 
the  nonlinearity  shrinks.  The  linear  range  of  the  nonlinearity  is  always  botweon  plus  and 
minus  one  sigma  of  the  predicted  tracking  error. 

This  general  sequence  of  operations  is  more  or  less  like  ones  used  in  many  optical 
and  radio  signal  acquisition  systems.  A  narrow  field  of  view  is  required  for  final  track* 
ing  so  that  the  final  traoking  accuracy  is  good,  but  a  broad  field  is  required  during 
initialisation  because  the  initial  uncertainties  are  usually  large.  The  standard  proc* 
dure  then  is  to  begin  operation  with  a  broad  field  device  and,  after  the  errors  have  been 
reduced,  to  switch  over  to  a  narrow  field  device.  The  procedure  which  has  resulted  from 
this  nonlinear  filtering  approach  agrees  generally  with  standard  practice  and  with  intui¬ 
tion.  This  procedure  offers  the  additional  benefit  that  it  provides  an  analytical  frame¬ 
work  for  deciding  when  to  switoh  between  sensors. 

Using  the  optimal  detector  gives  mean  and  variance  equations 


where 


and 

h*  (e) 

=  (f)~  2  (*p)‘  4  6  ex?  {“  §?} 

Integrating  yields 

HMp)  -  f 1/4  P'3/4 

(62) 

Thus 

x  _-l/4  d1/4 

6(t)  =4  E  Z  t) 

l  r 

(63) 

Z'1/2  E1/2 
dt  ^  4  r 

(64) 

The  steady  state  covariance  is 

then  given  by  p  =  0  or 

pi/2  =  4  m2-/2  qr 

(65) 

=  7.0898  qr 

(66) 

Figure  13  shows  results  of  Monte  Carlo  simulation  of  this  tracking  loop.  One  hundred 
runs  were  made,  and  the  predicted  and  actual  mean  square  tracking  errors,  as  functions 
of  time,  are  plotted  in  Figure  13.  The  conditions  for  these  runs  are  as  follows.  The 
error  at  the  start  is  Gaussian  with  expected  value  zero  and  variance  10.0.  The  spectral 
density  of  the  noise  in  the  state  equation  (called  q)  is  1.0.  The  spectral  density  of  the 
measurement  noise  (called  r)  is  0.354.  This  spectral  level  is  chosen  so  that  the  pre¬ 
dicted  steady  state  one  sigma  error  is  0.25. 

Figure  13  shows  that  for  this  problem  Gaussian  approximation  does  not  work  as  well 
as  onto  might  hope.  The  covariance  does  not  behave  as  it  was  predicted  to  behave. 

Figure  13  is  a  little  misleading,  however.  The  tendency  is  to  believe  that  a  particular 
trajectory  will  behave  roughly  like  the  one  sigma  trajectory.  This  is  not  the  case,  how¬ 
ever.  A  sampling  of  the  error  traj octorios  indicates  that  for  about  80  percent  of  the 
runs  the  errors  behave  as  predicted.  For  the  remaining  20  percent  the  errors  are  much 
worse  than  predicted.  An  average  computed  on  the  basis  of  these  errors  falls  somewhere 
between  the  trajectories  which  are  not  captured  and  those  that  are.  A  given  error  tra¬ 
jectory,  then,  looks  either  somewhat  worse  or  much  better  than  the  experimental  one  sigma 
plot  which  is  shown. 
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Figure  13.  Acquisition  Performance  of  Gaussian  Approximation 


The  problem  with  the  system,  as  it  is  now  configured,  is  that  the  actual  errors  do 
not  affect  the  covariance.  Even  when  the  actual  errors  are  very  large,  the  predicted 
covariance  as  plotted  in  Figure  13  is  quite  small.  Somehow  the  actual  errors  must  factor 
into  the  covariance  computation.  The  reason  that  the  measurement  does  not  affect  the 
covariance  is  that  the  measurement  nonlinearity  is  antisymmetric.  In  the  covariance  equa¬ 
tion,  Eq.  (44)  for  example,  the  coefficient  on  the  measurement  is 


If  a  Gaussian  density  is  assumed,  then  any  antisymmetric  function  of  (0-6)  has  expected 
value  zero.  For  this  reason  the  data  dependence  drops  out  of  the  covariance  equation  and 
the  covariance  equation  runs  open  loop.  That  is,  the  covariance  is  not  responsive  to 
actual  errors. 

This  situation  can  perhaps  be  corrected  by  taking  another  measurement.  The  antisym¬ 
metric  nonlinearity  was  constructed  by  adding  weighted  correlations,  and  a  symmetric  non¬ 
linearity  may  be  constructed  by  the  same  technique.  In  Section  2.5  it  will  be  seen  that 
this  type  of  structure  will  follow  naturally  from  posing  the  radar  tracking  problem  dif¬ 
ferently.  If  multiplier  outputs  are  treated  as  measurements,  then  using  the  Gaussian 
approximation  procedure  will  result  in  a  mean  equation  just  like  the  one  here  but  a  sig¬ 
nificantly  different  covariance  equation.  The  difference  will  be  that  the  covariance 
equation  will  have  an  adaptive  term.  This  term  will  be  seen  to  drive  the  covariance,  so 
that  it  agrees  with  the  actual  tracking  error.  Thus  the  covariance  estimation  will  be 
closed  loop  instead  of  open  loop. 

2.5  Use  of  Gaussian  Approximation  for  Processing  Many  Correlator  Outputs 

The  next  problem  formulation  to  be  considered  is  the  one  wherein  the  output  of  each 
multiplier  is  itself  considered  a  measurement.  This  formulation  imposes  less  structure 
on  the  problem  that  the  last  one  did.  In  the  last  formulation  it  was  assumed  at  the  outset 
that  multiplier  outputs  were  to  be  weighted  and  summed  together.  The  filter  design  was 
thus  not  allowed  the  option  of  processing  each  multiplier  output  individually  and  then  com¬ 
bining  them  nonlinearly.  The  only  degree  of  freedom  the  filter  design  was  allowed  to 
resolve  was  the  shape  of  the  weighting  function.  The  approach  taken  there  can  be  justified 
since  it  is  parallel  to  the  approach  normally  taken  to  design  this  type  of  signal  process¬ 
ing  hardware,  and  since  the  problem  of  determining  an  optimal  measurement  nonlinearity  has 
application  elsewhere.  For  radar  signal  processing,  however,  it  will  be  seen  that  a  more 
complete  and  better  performing  design  can  be  achievdd  by  removing  some  of  the  constraints. 
In  Sections  2.6  and  2.7  less  constrained  formulations  shall  be  pursued. 

Recall  that  when  the  outputs  of  the  individual  multipliers  are  considered  as  measure¬ 
ments,  the  system  and  measurement  models  become 

0  =  n  (t)  (68) 


and 


zi(t)  »  h± (e  -  8)  +  v ( t) 


(69) 


where  the  measurement  zf  is  the  output  of  the  i^h  multiplier.  The  function  hf(x)  is  the 
code  autocorrelation  function  at  shift  x  -  i  as  graphed  in  Figure  14. 


Figure  14.  Output  of  ifc^  Measurement  Nonlinearity  Versus  Tracking  Error 

Kushner's  equation  can  be  used  to  find  the  mean  and  covariance  equation  for  this  formula¬ 
tion  just  as  it  was  for  the  last  one.  In  this  case  the  equations  become 


and 


(70) 


hA) 


(71) 


These  are  still  unsolvable,  but  Gaussian  approximation  can  be  used  to  arrive  at  approximate 
mean  and  covariance  equations  just  as  in  the  previous  section.  Applying  Gaussian  approxi¬ 
mation  to  this  problem  yields  the  mean  equation 


and  covariance  equation 


§  5>i*i 


(72) 


p  a  q  -  ?  [Lwi]2  +  [  £ °izi  -  c] 

In  these  equations  the  quantities  w^,  a^,  c  are  given  by  the  following: 


hl‘ vfeX”hil“>  “p{'^do 
"i  '  vfei""°Vu)  “p  {'  au 


z  V 


(73) 


(74) 


(75) 


(76) 


(77) 


The  mean  equation  in  this  case  is  somewhat  different  in  appearance  than  the  mean 
equation  derived  in  Section  2.4.  in  reality,  however,  the  two  equations  are  strikingly 
similar.  In  the  previous  section  the  individual  correlator  outputs  were  weighted  and 
summed  to  form  a  single  measurement  which  entered  the  mean  equation.  The  optimum  weight¬ 
ing  for  the  i^h  correlator  was  determined  to  within  a  multiplicative  function  of  p  to  be 
of  the  form  IT  exp  (-  (iT)J/2p).  The  mean  equation  in  this  case  is  also  driven  by  a 
weighted  sum  of  correlator  outputs.  If  p,  the  error  covariance,  is  large  compared  to  ?, 
then  the  weight  w^  is  approximately 

-■  ■  1  IT  OXP  j-  if- 


1 

The  filter  starting  with  a  loss  structured  formulation  has  chosen  to  estimate  the  con¬ 
ditional  mean  with  a  structure  quite  similar  to  the  one  imposed  during  the  formulation  of 
the  problem  in  the  last  section. 

The  covariance  equation  which  has  developed  from  the  less  structured  formulation  in 
this  section  is  different  than  the  covariance  equation  of  the  last  section.  The  important 
difference  is  that  measurements  enter  the  covariance  equation  in  this  case.  The  effect  of 
these  measurements  can  be  explained  intuitively  as  follows. 

The  measurements  are  weighted  and  summed,  and  the  result  used  to  drive  the  covariance 
equation.  Each  measurement  is  soma  nonlinear  function  of  the  error  iplua  noise)  and  the 
weighted  sum  is  also  a  nonlinear  function  of  the  error  (plus  noise).  Thus,  in  this  case 
the  error  affects  the  covariance  directly.  The  weight  applied  to  the  Ith  correlator 
output,  when  p  is  large,  is 


(UT}3 

yr*p 


p) 


exp 


{i  r>* 


This  function  is  graphed  below  in  Figure  16.  These  weights  are  applied  to  the  correlator 
outputs  and  the  weighted  outputs  summed.  Thus  some  function  of  the  tracking  error,  ha(e), 
enters  the  covariance  equation.  For  integer  argument  i  hstU  »  a,.  It  is  also  true  that 
when  the  error  covariance,  p,  la  large  compared  to  one,  the  function  hj(i)  is. the  same  as 
the  function  graphed  in  Pigure  15.  There  it,  then,  a  symmetric  function  of  the  tracking 
error  which  drives  the  covariance.  The  scalar  c  which  is  subtracted  on  the  right-hand 
aide  of  the  covariance  equation  (Eq.  73)  is  equal  to  the  expected  value  of  hjie).  Expected 
value  means  integration  againat  a  Gaussian  density  with  covariance  p.  The  predicted  covar¬ 
iance,  p*  then  influencea  the  value  of  c.  If  the  predicted  error  covariance  is  on  average 
equal  to  tha  actual  error  covariance,  then  hit*)  and  -c  cancel.  Mhen  these  terms  cancel, 
the  covariance  equation  ravorts  to  the  open  loop  equation  derived  in  the  laet  section. 

(then  these  two  terms  do  not  agree,  then  a  correction  term  is  introduced  to  the  covariance 
equation.  This  correction  drive*  the  covariance  so  es  to  bring  the  scalar  c,  the  expected 
value  of  hj(e),  and  tha  average  value  of  hjte)  into  coincidence.  The  coveriance  equation 
la  enclosed  in  s  feedback  loop  and  driven  to  match  the  actual  errors. 

Honte  Carlo  simulation  of  this  tracking  system  was  par formed.  The  conditions  for 
this  simulation  were  Identical  to  those  leading  to  the  results  of  the  last  section.  Two 
changes  have  occur rad  dus  to  tha  approach  used  in  this  section.  The  actual  errors  are 
smaller  and  the  predicted  errors  are  larger  than  those  shown  in  figure  13.  There  is  a 
good  match  between  the  actual  and  pradlctod  error*. 


The  approach  taken  in  this  section  has  been  successful.  It  has  yielded  the  qualita¬ 
tive  information  of  the  approach  in  the  last  section  and  has  added  a  feedback  term  to  the 
covariance  equation.  This  addition  has  resulted  in  improved  performance.  The  approach  is 
still  based  on  a  formulation  wherein  structure  is  imposed.  It  is  still  the  case  that 
correlation  of  the  received  code  with  codes  generated  in  the  receiver  is  assumed,  in  the 
next  section  this  structure  will  be  removed,  and  what  will  result  will  be  similar  to  the 
tracking  loop  derived  here  but  will  show  improved  performance. 


2.6  Application  of  Gaussian  Approximation  to  Estimation  of  PN-Code  Delay 

In  the  third  formulation  of  the  radar  tracking  problem  the  delayed  PN  code  waveform 
is  taken  to  be  the  measurement.  This  approach  is  the  least  structured  of  all  those  taken 
here.  In  addition  a  more  general  model  for  the  delay  process  (target  motion)  will  be 
assumed.  It  will  be  assumed  that  the  delay  is  the  first  element  in  a  vector  whose  ele¬ 
ments  are  necessary  to  describe  the  target's  motion. 


Let  Xj(t)  denote  the  varying  PN-code  delay  due  to  target  motion  and  x(t)  equal  the 
n-dimensional  state  vector  whose  first  component  equals  Xj(t).  We  interpret  the  above 
system  as  the  two  equations: 


x(t)  =  Fx  (t)  +  Gn  (t) 
r(t)  =  h(x(t) ,t)  +  v(t) 


(78) 


where 


F  *  the  state  feedback  matrix,  possibly  time  varying 
d  *  the  input  matrix,  possibly  time  varying 

n(t)  -  scalar  white  Gaussian  noise  (WGN)  of  spectral  height  Q  which  models  the  unknown 
target  dynamics 


h(x(t),t)  ■  S(t-xj(t))  -  the  received  code  (79) 

v(t)  a  tne  received  noise  (WGN)  of  spectral  height  r(t),  and 
S(t)  =  the  reference  PN-code  waveform  i* 1-valued) 


The  extended  Kalman  filter  cannot  be  applied  to  this  system  because  the  measurement  nonlin¬ 
earity  cannot  be  linearised  about  a  reference  trajectory.  Attempting  to  linearise 
h(x (t) , t)  around  a  reference  trajectory  x^tt)  fails  because 


»h(x(t),t) 
. . ix 
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m  -111 
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3*n,  x-Xj. 


but 
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S(t  -  xj) 


-S  '  ( t  *>  x  j ) 


(81) 


and  S*(t)  is  very  badly  behaved  because  of  the  switching  discontinuities  in  the  code. 
Extended  Kalman  filtering  is  hcuce  ruled  out  at  the  start  because  of  the  sharpness  of  the 
nonlinearities  in  the  observation  aquation.  We  shall  Have  to  resort  to  global  approxima¬ 
tion  of  the  exact  nonlinear  filtering  equations. 


Let  x It)  denote  the  conditional  mean  of  x(t)  given  the  measurements,  ar.d  we  have  by 
daawinakl  ill  formulas  for  the  evolution  of  the  first  and  second  moments  of  the  conditional 
probability  density  function  of  x(t)i 


x(t)  •  rx(t)  ♦  ~  j^ItTh  -  x(t)hj^*(t)  -bj 


<«) 


*  (mt)  *  GQC7]^  ‘  7  -pjulh*-  -  x^CUb] 

^ _ ^ («2) 

♦  |  jf*4  *  -  x^  -  t*i  -  XjHkj  *Th^}Jh  [*ttJ  *  **] 

these  equations  are  unsolo.eble,  but  a  practical  processing  algorithm  een  be  dt^aioped  using 
Gaussian  approximation.  After  applying  Gaussian  approximation,  caveral  simplifications 
can  be  made  to  the  resulting  equations, 

The  first  rimplif icaiioA'that  can  be  performed  on  the*6  equations  is  the  elimination 
of  the  subtractive  portion  ( -h)  of  tha  innovations  process  UCt)  -  h)  in  the  mesh  equation. 
ThU  follows  frattWo  hypotheses!  U)  the  eytstafcry  properties  of  tin*  Joint  Gaussian  dis¬ 
tribution  end  ( J)  the  assumption  oi  high  code  frequency  and  lowpass  processor.  Explicitly , 
wa  assume  "instantaneous  averaging*  of  tha  product  of  two  codes  by  the  loupes*  filter t 


S(t  -  x)S(t)  »  R(x) 


(83) 


From  Eg.  (81)  the  term  under  examination  is 

-  i  [xttTh  -  x(t)h]h 


(84) 


which  can  be  rewritten 


-  j  [(x(t)  -  x(t))h(x(t),t)J  h(x(t),t).  (85) 

Using  Gaussian  approximation  on  Eq.  (85)  and  merging  the  multiplied  expectations  yields 

_  means  is  non 

Eqa.  (7?)  and  (83)  yields  the  useful  identity 

h(X,  t)h(Y, t)  »  R(X j  -  Y2) 


:(t))h(X,t)h(Y,t)Nx(x(t) ,P(t))Ny(x(t),P(t) JdXdY 


(86) 


where  Njj(x,P)  means  X  is  normally  distributed  with  mean  &  and  covariance  P.  Employing 
».  (79! "  .  . 


Thus,  Eg.  (86)  becomes 

/•  0> 


7“  JLjL  (X-x(t)JR(Xi  -  Yi)Nx(x(t).P(t))Nv(x(t).P(t))didX 

Define  the  error  vector  e(t)  and  associated  dummy  variables  as  such 
e (t)  -  x(t)  -x(t)  i  V  «  x-x(t)  j  W  »  Y-x(t) 


(87) 


(88) 


(87) 


and  tranalate  Eq.  (88)  from  (X,  Y)  to  (V,  W)  coordinates  to  get  an  expression  for  the  ith 
component  of  Eq.  (88) 


(90) 


tLL'  jRtVj  -  MiJNytO.Pttmyo.MtndVdW 


This  expression  equals  xoro  for  all  1  because  the  function  Vj  ia  odd  in  the  argument 
(V,  W) ,  while  R,  Hy,  and  t»w  are  even  in  the  same  argument.  Donee,  the  product  is  odd,  and 
the  Integral  is  *«$>. 

The  second  simplification  occurs  in  the  covariance  equation.  First,  the  subtractive 
portion  of  the  third  term  of  Eq.  (82)  is  isolated. 


-  “  (***<*>  -  ^(t))  (x^tt)  -x^t))  -  (x^t)  -  i^ftil  tx^t)  *T^*(T) )  ]h(x  (t) ,  t)  hlx(t) ,  t)  (91) 

Employing  the  Gaussian. assumption  and  instantaneous  averaging,  we  get  that  Eq.  (91),  after 
translation,  equals 


’Pijtt>]R(V»  -  WllKv(0,P(t))Nv(O,P(tndydM 
Secondly,  the  second  term  of  Eq.  (82)  can  be  written 

-  -  [ ( x^Ttr-^TtmOxT^  ]  [ ( (u7h(7(t)7t )  j 


(92) 


(93) 


Again,  combining  integrals  under  the  Gaussian  assumption,  using  instantaneous  averaging, 
and  translating  the  integration  domain  yields  for  Eq.  (93) 


V^R ( Vl  -  W i )  Hy  (O, X t) )  N*  (0,  P ( t) ) dS<?M 


(94) 


The  contention  ie  that  terms  [fiqs.  (92)  and  (94)]  sum  to  t«ro.  The  calculation  of  these 
two  integrals  is  detailed  in  Appendix  1.  they  are 


E,.  (M,  .  .  1  ('„>„/>£)  fiZ/.far  -  i] 

».i  - ♦  i  (f11pjl/»'i) [«.p  (ri^)- 1] 

which  negate  each  other,  making  receiver  implementation  much  simpler. 

the  remaining  integral  terms  o£  Eqs.  <811  and  <32;  in  terms  of  the  translated 
variable  V  =  X  -  *(t>,  we  get  the  final  result  for  the  receiver  translated 


(95) 


(96) 


x  (t) 


Fx(t)  + 


i  r 

x1(t)  -  v1)dv1E(t) 

•'•CD  ** 


(97) 


dP  (t) 
3t - 


^FP(t)  +  P(t)FT  +  GOG^^  +  i  J 


vivj  "  pij]Nv(£'Pit))s<t-3«1-vl)c:vJa(t)  (98) 

The  receiver  has  the  following  notable  features’  m\  t-h*  f irct-  . 

arts  ss's^rsE?'  %  “• 

s(in-^ui'.T)''e(.fibT1'  B°ih  rplov 

bit  Xi(t)  V i )  #  (3)  because  of  the  use  of  the  centered  variAbli*  PfM  \r\  ». 

E<"  1981  <‘rlv"  1Uel<  “,1  *1“  dtl''“  *>•  »’)  ttaoS 


by 

by 


The  correspondence  between  this  processor  and  those  given  earlier  can  be  illumim‘f.1 
y  assuming  x  *  x,  and  that  ?,0  i  O  in  [fiqa.  (97)  and  (96) J.  The  receiver  is“^ifiS 


Xj  (t) 
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( r>j  iler)s( 


t-Xj  't)  -  Vl)dVis(t) 


\f  fi*P  /—--Hi  ■■■■)  «(t  -Xt(U  -  V|)dV|itt) 
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199) 


(100) 


These  equations  can  be  interpreted  in  two  ways.  (1)  Th®  Integrals  in  So-  f9«i  «<«*  (,nm 

•a.raiuKu*,M4lSii!u:i£r?r^b#  unw5  K^^cSS^ 

o,®  1,  *  (t)  ahould  be  sent  and  subsequently  multiplied  by  *(t)  The  covari- 

f£*  iniiirii  \£°  ^  t™4ted  48  0  •*<"**  varying  7llt«paraiaetir.  (2)  Moving  RtRInalde 

fir  ill  u  <HJufUon*  instruct  us  to  form  all  correlations.  3<t-x?(t)  MMslt)  ld 

apprx  Lti  jh* ,  Utl^loS  c^oS 

ssrs:  sniss!  *ss  Sis’ 

Approximating  &js.  (i>8)  and  (100)  by  discretisation  o?  V  ,  vo  get 
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Correlator  outputs 
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21C  - '  ^  4*d* i  5S»  ,i2*«i,2S5.2!#«iJ  phyi^i1  ia.1i  L- 

CVuoS*  ^  :?•• and  that  th* 

provided^th*  values  a-t  th4  wwo  4*  *h°8*  derived  by  [feowles  (8,  9)] 

derived  to  drive  the  detector  vidth^2VP^ftt)  tt°«akifto  ft  ‘"a  °*  W4i,,ht*  has  b*eft 

S(t)  should  produce  high  correlation *with  ndL  ihter«retabi|_ae  such ,  if 


Pj, it}  is  increased  to  guard  against  losing  lock.  This  data  dependency  helps  guard 
against  modeling  errors  and  the  inaccuracies  o£  the  Gaussian  assumption. 


Figure  15.  Weighting  Pattern  in  Figure  16.  Weighting  Pattern  in 

Kean  Equation  Covariance  Equation 

A  useful  quality  of  this  processor  is  that  it  is  easily  extensible  to  higher- 
dimension  state  variables.  The  two  weighting-patterns  derived  are  the  only  two  necessary 
regardless  of  the  dynamics  order.  This  is  seen  from  Eq.  (97)  and  (98).  The  ifch  component 
of  the  second  term  of  Eq.  (97 j  equals's 
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Vi  Ky  (0,P(t))S(t  -  x1(t)  -  V^)  dVt  a  (t) 


(101) 


Eliminating  all  integrations  except  those  with  respect  to  \\  and  V,,  and  factoring  the 
(ointly-normal  density  of  Vj  and  V;  into  a  conditional  times  a  marginal  density, 

Lq.  (103)  becomes; 


1  j  Ny  (0,Pu<tW  5  (t 


x^  (t) 


«vl  *{t) 

(104) 


The  bracketed  expression  equals  the  conditional  moan  of  given  V,,  which  is  pilvl/pn* 
hence,  Eq,  (104)  becomes;  11 
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This  tells  us  th..t  xt  (tt  is  to  be  driven  by  the  output  of  the  corresponding  expression  in 
the  Xjft)  equation,  scaled  by  (t)/?^  <t) . 

Likewise,  the  second  term  of  (93)  is  computable  as; 


l  K 


hM  \ 


(0,  Pa(t»S  (t  -  x4(t)  -  Vl)  dV\  aft) 


U9«) 


which  specifies  a  scaling  factor  of  P,,P*i/P*a  *0  be  applied  to  the  t>, ,  Ct.)  equation's 
output  to  drive  the  corresponding  term1©!  the  equation.  * 


2.6  Discrete-Time  Point-Mass  Approximation  Method  for  PD!  Representation 


s)  The  ^Point-Mass  Approximation  ihs  will  v.r'Vern  ourselves  with  an  approxi¬ 
mate  method  o T  solving  for  the  eWitT-re.ai  probabn  .  •  -  «kn»ity  function  (Pi)1')  of  discrete- 
time  systems.  Here,  a  useful  tcuhr. tque  is  to  pop*:".-'  the  POP  only  st  o  set  of  points 


(grid)  which  support  most  of  the  PDF.  To  begin  with,  we  specify  the  system  and  its 
exact  recursive  solution.  The  system  is: 


where: 


x(t+l)  =  i(x(t),t)  +  g(x  (t) ,  t)n (t)  (107; 

z(t)  =  h  (x(t) , t)  +  v(t)  (108) 

x(0!  =  C  (105) 

x(t)  is  an  n-dimensional  state-vector; 
z(t)  i3  a  p-dimensional  observation  vector; 

n(t)  is  a  zero-mean  m-dimensional  WGN  sequence  of  covariance  matrix  Q ( t ) ; 
v(t)  is  a  zero-mean  p-dimensional  WGN  sequence  of  covariance  matrix  R(t); 
g  is  a  function  which  is  matrix-valued  (n  x  m' 

£  and  ji  are  nonlinear  functions; 

£  is  a  zero-mean  Gaussian  random  vector  of  covariance  matrix  P ( 0 | — 1 >  and  mean 
5c(0|-l)  which  provide  the  initial  PDF  for  the  filter 

Letting  Pt/T  (X>  denote  the  probability  density  of  x(t)  given  the  observations 
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(-2(0)  ,  2(1)  ,  .  .  .  ,  2  (t)  }  , 


(110) 


it  is  necessary  for  the  filter  to  produce  two  functions: 

(1)  The  filtered  PDF  of  the  state  x(t); 

(2)  The  predicted  PDF  Pt+jyt <X^  of  the  state  x(t+l) . 

Once  the  PDF  has  been  constructed,  any  type  of  estimate  £(t/t)  or  A(t+l/t)  can  be 
generated,  such  as: 

(1)  fc(t/t)  =J  IPt/t(l)dI  “  conditional  mean,  for  a  minimum-variance  approach; 

(2)  &(t/t)  =  j  where  Pt/fc(l)  -  Pt//t(8)  for  x  /  8  for  a  maximum  likelihood  estimate 

The  filtering  equation  is  simply  an  application  of  Bayes1  law.  Assuming  that 
Pt^t_1<X)  is  available,  and  that  z(t)  has  just  been  observed,  then 


t/t 


(X) 


_pr[ 

z(t)  1  x ( t )  »  x] 

Pt/t-l^ 

r  Pr 

[s ( t)  | x { t )  «=  x 

]Pt/t-l(I)dl 

(111) 


But  by  the  given  normal  distribution  of  v (t) ,  which  wo  will  abbreviate  as 
N  (0,R(t) ) ,  we.  can  deduce  that 


Pr[z(t)  |x(t)  “  X  j  “  N(z(t)  -  h(x#t)  ,  R  ( t )) 
thus  the  filtered  update  is  recursively  given  by: 


t/t 


(X) 


N (z (t)  -  h(x>t),  R(t) (l> 


f  N(z(t)  -  h(x,t),  R (t) )  Pt/t_1(X>dX 

To  derive  the  predicted  PDF,  Pt+l/t^X)'  in  terras  of  Pt/t  (l>  >  one  must  weigh  a 
the  possible  ways  that  x(h+l)  can  equal  If  we  let  Tt+x^t  (ll8)  denote  the 

Pt+l/t  W  "  /  ,  Tttl/t  8->  Pt/t  <£> 


(112) 


(113) 


probability,  i.e.,  the  probability  that  x(t+l)  ■  x  given  tnat  xjt)  ■  6,  then 


and  sum  all 
transition 


(114) 


The  function  T  is  easily  ascertained  by  knowledge  of  the  distribution  of  n (t) .  Then  we 
have: 


Tt+i/t{i|£>  =  N(i“  '  2(£»t><Ht)aT(£ft)) 


(115) 


Hence  the  predicted  density  is  recursively  given  by: 


?t+l/t(I> 


f(6,t) 


g(8,t)Q<t)£T(8,t))  Pt/t(B)de. 


(116) 


Equations  (113)  and  (1)6)  allow  recursive  propagation  of  the  conditional  PDF.  However, 
for  general  f,  g,  and  ft,  it  is  rare  that  closed- forr.i  solutions  will  be  available.  Hence, 
Bucy  [16]  and  Bucy  and  Senne  [111  developed  the  point-mass  PDF  representation. 


The  point-mass  representation  approximates  Pt/t  and 


Pt+jyt  by  a  set  of  impulses: 


2N+1 

Pt/.  ^  =  p^/.  »•  •  •  »®n))  «(i  “  b (ii , . . . ,ia)j 

“"‘’'•••'Vi 


where  b(4j,...,in)  is  a  grid,  taking  values  in  #n.  The  simpler  scheme  is  to  keep  the 
grid  fixed  in  time.  A  more  advanced  scheme,  developed  by  Bucy  [10],  is  to  translate  the 
grid  to  maintain  its  center  on  the  conditional  mean  and  rotate  the  grid  to  align  its 
axes  with  the  principle  axes  of  the  error  ellipsoid.  The  simpler  version  turns  the 
filtering  and  prediction  equations  of  (113)  and  (116)  into  respectively: 

-  h(b(t, . In),t)  ,R(t>)  pt/t^1(bU in)) 

r  ..  -  "■  - - - - - - - - - — - - - - - - - -  / 

N  |jit)-h  (b(m, , . . .  ,ma)  ,t) ,  R(t)  )pt/t-i(£(“ . V) 

(117) 

and 

2N+1  . 

Pt+1/t(j>('u....in))  ■  y  N fb(i y  -  f(b(B» . n^>.t)  ,  (118) 

m,f  .rr^a^-l 

2(fe(ml,...fmn)  ,tK)(fc)2T(b(m1 . mn)  ,t>)  Pt/t(fe(*V ) 


pt/tl^Ul . 


N(sCt) 
^+1  — 1 ' 

mt  ,S,m  »1 


Equation  (U8)  may,  however,  require  a  slight  normalisation  to  compensate  for  numerical 
innaecurac)***  which  cause  the  total  probability  mass  to  doviate  from  1.  Alternatively, 
equations  (117)  and  (118)  may  be  combined  and  a  single  renormalization  performed  to 
account  for  the  denominator  of  (117)  and  numerical  innaccuracios . 

Thin  basic  point-mass  solution  is  not  immediately  useful  to  the  problem  of  spread- 
spectrum  ranging  because  that  problem  is  naturally  continuous-time.  However,  a  close 
variant  of  the  Bucy  point-mass  technique  can  be  used  to  solvo  this  system  by  approxi¬ 
mating  the  exact  continuous- time  solution  (Kushner's  Equation)  by  point  masses.  Further¬ 
more,  the  point-mass  solution  is  very  natural  for  PN-code  tracking  because  of  the 
discrote-tiire  nature  of  the  code. 


Tracking  the  conditional  mean  x(t)  requires  computing  the  pdf  of  x(t),  p  .  This 
further  requires  an  on-line  computation  of  KuBhner's  Equation,  from  which  Equations  (81) 
and  (82)  sten.  The  full  PDF-propagation  equation  is: 


»Pxt*,t|£) 


at 


-  MpxJ  ♦  p  Px(X,t|z£)  £h(X,t)  -  z(t)  -  hj  (119) 


where  L  is  the  Fokl ^r-Planck  operator  for  our  dynamics  equation,  (78),  whioh  is  given  by: 

: 


A  3P„  |  i  .a  A  _  »  Px 

^  riipx  +  Fi*  +  T  2^  ^  sxptj 


(120) 


For: 


F^  =  1th  row  of  Fj 
F^j  =  (i, j)^  element  of  F;  and 
=  i^1  element  of  g. 

Implementing  this  equation  directly  is  impossible  because  of  the  continuous-valued  domain 
over  which  Px  is  defined,  thus  requiring  infinite  computer  memory.  This  problem  can  be 
treated  by  Buoy’s  method  of  sampling  px  at  a  convenient  spacing,  and  propagating  px  only 
at  the  chosen  grid  values  of  X.  Furthermore,  this  is  natural  for  our  system  in  particu¬ 
lar  because  of  the  code  waveforms,  wherein  a  good  spacing  for  the  X^grid  is  1  (chip) . 

Due  to  computational  overhead  of  implementing  higher-order  dynamics-processes,  I 
shall  limit  the  discussion  to  the  first-order  process  specified  by  (t)  =  5(t).  Then 
the  discretized  PDF  is  supported  only  by  the  natural  unit-spaced  grid  selected  for  the 
xi-axis.  The  following  development  is  extensible  to  any  n-dimensional  process  with  its 
respective  n-dimensional  grid  support.  However,  two  important  considerations  must  be 
borne  in  mind: 


(1)  For  the  derivative  of  xs (t) ,  e.g.,  x2  =  Xi  =  velocity  (chips/sec.),  there  is 
no  natural  grid-spacing  as  for  x2. 

(2)  Implementing  an  order-n  process  estimator  will  generally  result  in  a  different 
detector  weighting-pattern  for  each  component  of  the  state-variable  update. 
This  results  from  the  fact  that  the  marginal  and  conditional  densities  of  an 
arbitrary  joint  density  are  not  necessarily  of  the  same  family,  as  was  with 
the  jointly  Gaussian  density. 


Kushner's  Equation  exhibits  the  same  dynamics/predictor,  measurement/update  type  of 
structure,  as  evinced  by  the  first  and  second  terms  of  Equation  (119).  Equation  (119) 
for  our  first-order  case  becomes: 


3pyi(X,,t|Z^) 

3t 


i 

3  p 
3X  i 


XI 

T* 


r  - 

A" 

|h(Xi,t)  -  Mxi  (t),t) 

z(t)  -  h 

(121) 


Discretization  of  the  dynamics  term  is  best  treated  by  approximating  the  second  deriva¬ 
tive  by  a  three-point  method.  Letting  denote  the  integer-separated  values  of  X(  that 
will  support  the  sampled  PDF,  we  may  substitute 


JQ[Px,(r:  +  1, 


(122) 


for  the  second  derivative  of  p  at  r  t . 

*  1 

The  observation  term  of  Eq.  (121)  is  best  handled  by  deriving  an  equivalent  term 
that  is  directly  obtained  from  discrete-time  and  discrete  PDF-space  ( r i )  probabilistic 
considerations.  Handling  this  term  directly  in  discrete  time  is  justified  by  practical 
considerations.  Given  the  continuous -time  formulation  of  Eq.  (121),  a  receiver  would 
normally  only  approximate  that  calculation  by  discretizing  all  functions  and  derivatives 
in  time.  (This  is  simply  because  analog  processing  of  the  signals  would  be  technically 
too  difficult.)  Hence,  the  associated  approximations  would  be  possible  sources  of 
inaccuracy.  A  rederivation  of  the  observation  term  of  Eq.  (121)  proved  to  be  simpler 
and  more  robust  to  point-mass  approximations  than  direct  discretization  of  the  same, 
under  aomputer  simulation. 

Letting  At  denote  the  sampling  period  of  the  receiver,  the  correlations,  S(t-ri)z(t), 
will  be  discretized  in  time  and  indexed  by  the  integer  n,  for  n«*l,  2,  3,...  Hence,  our 
observed  information  is  a  sequence  of  (2Ne  ■*  1) -dimensional  random  vectors  parameterized 
by  discrete-time  index  n: 


v(n) 


v_N  (n)' 


vi  ( n) 

V+N  ^n\ 

e 


for  n  =■  1,2,3, , 


(123) 


where: 


v,  (n)  *  f  *(t)s(t  -(&j  ( (n-1)  At)  +  i^di 

1  (n  l)At  '  * 


(124) 


N  is  the  number  of  local-code  phases  ahead  and  behind  the  on-time  code  that  are 
cSrrelated  with  z(t).  The  estimate  is  also  a  discrete-time  sequence;  At  is  assumed 
to  be  small  enough  so  that  quantization  error  is  acceptable.  Notice  that  we  will  let 
the  Ti-grid  move  rigidly  as  the  estimate  (nAt)  moves /  so  that  the  center  value  of  Fi 
equals  (nAt) : 


r,  =  (nAt)  +  i  ,  i  =  -Ne,-Ne+1,...,+Ne 


(125) 


This  vector  sequence  of  observations  allows  us  to  propagate  the  sampled  PDF  through 
Bayes'  I.aw.  Specifically,  assume  we  have  generated  pXJ(rj,  (n-1) At|z for 
F  i  =  ((n-1)  At)  -  N  to  ^((n-DAt)  +  N  in  unit  steps,  and  that: 


( (n-1) At) 


S^p  (r,,(n-l)At|z<n"I)At) 

Fi  Xl  0 


(126a) 


Then  we  must  generate  Pxi(Fi,  nAt|z”At 

(nAt)  = 


)  for  the  same  Flf 
(r !  ,nAt | Z 

F  i  1 


and  compute  the  new  estimate 
"At)  .  (126b) 


Lastly,  the  set  of  points,  px  (ri,nAt,  z£  ),  for  the  current  Tj-grid  (which  is  centered 
on  ((n-1) At))  must  be  interpolated  to  correspond  to  the  values  on  the  new  Fj-grid 
centered  on  the  new  estimate  xi (nAt) . 

To  begin  this  task,  we  must  first  apply  the  Fokker-Planck  operator  of  Eq.  (122)  to 
get  the  predicted  PDF  values; 


(r, 


nAt|  Z 


(n-1)  At) 
o 


JS 


PX1 (T;, (n-l)At|Z^n-1)At)  +  |  QAt 
-  2pXi(F1,(n-l)At|Z(Jn_1)At)  +  pXi 


Pxi(F1+l,(n-l)At|z^n‘1)At) 

1  (127) 

(Ti-1, (n-l)At|z*n_1)At* 

O  j 


for  all  Fi  in  the  current  grid.  The  incoming  vector  v(n)  then  allows  us  to  update  these 
values  to  pXj(Fi,  nAt|z£ht)  by  Bayes  Law: 


Pxi(rlfnAtiz”At) 


Pr 


|y<n)|x,(nAt)  =*  r  ,  At]  (r ,  ,nAt  |  Z^”"1*  At) 


^  ^[numerator] 


(128) 


Because  Xi(t)  and  n(t)  are  Markov  processes  and  z(t)  depends  in  a  memoryless  fashion  upon 
them,  we  have: 


Pr[v(n)  |x,(nAt)  -  r,,  Z(Jn_1)AtJ  =  Pr[v(n)  |  x,  (nAt)  ■  Tjj  (12?) 

This  likelihood  function  is  easily  computed  by  noting  that  the  components  of  v(n)  are 
distributed  as  such: 


v^  (n)  |  x j  =  Tj  v  N  (CAt ,  rAt)  for  i  =  T;  -  ( (n-1)  At)  ; 

(130) 


v^(n) | x i  =  r i  v  N(0,  m At)  for  all  other  1 


All  Vf(n)  are  independent  since  the  correlations  of  white  noise  with  the  orthogonal 
local-code  phases  produce  independent  random  variables.  The  single-chip  grid-spacing 
is  very  important  in  this  respect  for  computational  simplicity.  Thus,  the  likelihood 
function  is  given  by: 


(131) 


Pr[v(n) |xi (nAt)  =  I'll  *>  — ~=.exp 

t  3  s/zrAtr 


"(vr,-<1(n)  ~  At)  ] 


2atr 


x  n 

i=“Ne 


/2^6tr 


exp 


-(vi(n)) 

2Atr 


where  means  #i((n-l)At).  Eq.  (131)  equals: 

+N 

.  +1  I 
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1  \2N„+1 


^v/2ivAtr  / 


exp 


£. 


-vi  (n) 


2Atr 


exp 


2vrt-&i ^ At  ”  (At) 


2Atr 


(132) 


Using  Eq.  (132)  in  Eq.  (125)  yields  the  measurement-update  formula: 


PxI(r-nAtlZoAt'  =  N 


exP  [r  vr  (n)]  px,(r‘> 


nAtjz 


5^  exp|"r  vi  (n)  PXl(A> 


+  i,  nAtjz 


(n-1) At 


) 


i=-N 


(133) 


where  ki  means  $t((n-l)At)  and  Fi  =  #i  +  i.  A  simple  way  to  implement  Eq.  (133)  is  to 
scale  the  predicted  probabilities  px  (Ti,  nAt | Z^n-l) at)  by  the  factors  exp  (Vrs_£  (n)/r) 
for  each  1^.  The  scaled  samples  should  then  be  normalized  for  unit  sum,  and  Eq.  (126b) 
employed  to  generate  the  new  estimate.  Lastly,  the  grid  should  be  shifted  to  the  new 
values. 


i  »  r,  -  Si(nAt),  i  =  -Nq,  -Ne+1,  ....  +Nq  ,  (13*) 


An  initial  PDF  assumption  starts  this  recursive  estimator/tracker. 

Formulas  Eq.  (127)  and  (133),  which  specify  the  receiver  for  the  Brownian-motion 
case,  are  comparable  in  computer  overhead  to  the  DDAG  detector.  These  equations  dictate 
the  minimum-variance  estimator  for  our  state-space  system  with  only  the  approximations  of 
finite  PDF  grid  and  discrete  time.  Notice,  however,  that  the  grid-spacing  was  crucial 
to  getting  the  measurement  update  formula  for  the  Bayesian  Detector,  whereas  in  the  DDAG 
detector,  smaller  spacing  only  implies  sampling  the  weighting-patterns  at  more  points. 

CONCLUSION 

The  point-mass  approximation  is  probably  most  useful  where  computing  power  is 
available  and  where  the  extended  Kalman  Filter  is  known  to  fail  due  to  poor  linearizabil- 
ity  of  f  or  h.  other  global  approximations  are  available  which  require  less  computation 
but  which  are  not  quite  as  general  as  point-mass  and  have  their  own  associated  problems. 
The  most  popular  are: 

(1)  Gaussian  Sums:  derived  by  Alspach  and  Sorenson  [12],  this  method  is  good  for 
multimodal  PDF's.  It  consists  of  representing  the  PDF  as  a  stun  of  weighted 
Gaussian  distributions  (a  non-orthogonal  series)  and  propagating  the 
weights  essentially  as  a  Kalman  filter  would. 

(2)  Other  orthogonal  series  expansions:  these  include  Edgeworth  expansion  [13], 
Gram  Chalier  series  [14],  Gauss-Hermite  Polynomials  [15],  and  Least-Squares 
Polynomial  Approximations  [16].  Generally  two  problems  occur  with  these 
methods:  (1)  truncation  of  the  series  may  result  in  some  points  of  the  PDF 
being  negative, (2)  truncation  may  result  in  an  unnormalized  PDF. 

Due  to  the  nature  of  the  code-tracking  problem  (unimodality  and  the  discrete-time 
nature  of  codes) ,  the  point-mass  approximation  was  a  natural  choice  for  global  nonlinear 
estimation. 
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APPENDIX  I 


First,  the  computation  of 


OD  00 

J  J  ViW1R(V1“w1)  Nv  N  (0,Pu(t))  dvx  dwx 

-00  —  OO  A  1  W1  * 


(1.1) 


will  be  detailed.  Inserting  the  multivariate  Gaussian  density  function,  this  becomes: 

2  2 

L _ _  /•"  /•”  /-V'"  ’  “ 

-  /  /  V  Id  B/I7  -Id  \  oun  I - 

2itP 


-1  /•"  /•"  /-(V?  +  wl>  \ 

TT)  /  /  viwiR<vrwi}  6XP  (-2PnTtr )  dvidwi 


Consider  a  rotation  of  the  (V^  w^)  axes  by  45°,  specified  by: 


i  M  P:l  l;:h;  M  Pi 


Then,  we  can  write: 


(1.2) 


(1.3) 


V2i  +  W2  .  f2  +  g2  and  R(V1~W1)  =»  R(  \f2  f)  (1.4) 

and  ^  (g  +  f)  (g  -  f). 

The  function  R(  s/2£)  is  given  by  1  -  >/2|f|  for  jf|  <  1/  >/2  ,  and  0  otherwise.  Hence  the 
full  region  of  integration  is  a  vertical  strip  ^2  wide  in  the  fg-plano  (Fig.  1,1),  Sinco 
our  three  functions  in  Kqn.  (1.4)  are  even,  we  may  limit  the  integration  to  the  shaded 
area  indicated  in  Fig.  (1.1),  and  multiply  by  4.  Thus,  Eqn.  (1.2)  becomes: 


lipjTm  fAf  «*  la  *  'V* '  fl  nwa  u  aa,.  <«.« 


which  becomes 


'-!l  /’  ^  -i1  >-r>  ,h  *a,  c.5) 


o  o 


a  sum  of  four  Integrals  over  rectangular  supports,  and  separable.  Computation  of  the 
integrals  and  addition  of  the  results  yields  the  final  value  of: 


P11  (t> 


|  exp 


(1.7) 


as  the  value  of  (1.1). 
Next,  the  value  of 


f  j  (V1  -  *UU>}  R  (W  SV1<0'  J>li<t,)  NM1(0*  (1.8) 


will  be  shown  to  exactly  cancel  (1.1).  This  insures  that  a  receiver  with  a  or.e-dimenaional 
state  vector  can  safely  drop  the  middle  ten  in  the  variance  equation.  Expression 
(1.8)  equals: 


wgm  /*  f‘  (vi  *  pu(t,l  *  {vi‘l'i)  8*p(-^7trr)dvidHi 


(1.9) 


Again,  we  sh«rl  aliffl  our  nsw  coordinate  system  with  (1.3)  end  integrate  over  the  region 
indicated  in  Fig.  (1.1).  Using 


vl2  “  Pll(t>  =  y  (9  +  f)2  -  Pll^>  >  R<V1  -  wx)  =  1  -  n/2  f  s 


(I. 10) 


2  2  2  2 
and  =  r  +  gz 


yields  separable  integrals,  which  all  sum  out  to: 


(ip^Tty)  “  1  ' 


which  is  the  desired  result. 

To  generalize  this  computation  to  the  (i,  j)th  component  of  the  covariance  matrix 
requires  first  dealing  with  the  term: 


viwjR(vrwi)  Nv  p(t))  Nw  p(t))  dzd” 


(1.12) 


Eliminating  all  variables  of  integration  except  for  V, ,  V. ,  W, ,  and  W.  allows  us  to  write 
this  as:  J 


•  L  L  L  L  vi“jR(vi"wi| 

Vwj\ 


P11  pli 
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(1.13) 


P11  Plj 

Pil 


jdV1dVidW1dW^ 


Re-arranging  the  remaining  integrations: 


R(V^-W^) 


Wj  Nwrw 


X 


(X.14) 


The  bracketed  integrals  are  computed  using  the  properties  of  jointly  Gaussian  density 
function  and  recognizing  conditional  means  and  covariances.  The  two  integrals  are 
respectively  computed  to  be: 


T~vl  \  <0'  pll>  »  1^wl  \  <0'  pii^>' 


which  turns  (X.14)  into: 


J fm  r(v1-w1;nVi  <o,  fn(t))»Wi  (o,  *u(t>)  dVldWl 


This  is  the  same  an  expression  (1.1)  scaled  by 
A  similar  computation  shows  that 


(ViVj-Pij)  K  (Vj^-Wj)  Nv  (O.P(t)>  Nw  <0,P{t>)  dVdW 


(1.15) 


(1.16) 


(1.17) 


equals  (X.8)  scald  by  the  same  factor.  Thus,  as  (l.l)  and  (1.8)  add  to  zero.  so  do 
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SUMMARY 
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tion,  and  data  procesaing.- 

™»nf-ifieation  technology  it  particularly  useful  for  the  determination  of  nonlinear  aerodynamic 
Identification  tecnnoiogy  p  ,  of  -ttacv  The  method*  outlined  here  (equation  error, 

fighter  aircraft  high  angle  of  attack  aerodynamics  illuatratea  the  technology. 

NOMENCLATURE 


Symbol 

Symbol  Definition 

Units 

A* 

Axial  acceleration 

ft/ sec2 

Ay 

Lateral  acceleration 

ft/sec2 

A* 

Vertical  acceleration 

ft/sec2 

>>v 

Uing  »P*n 

ft 

CW 

Wing  mean  aerodynamic 

ft 

cD 

Drag  coefficient 

— 

Cl 

Lift  coefficient 

ci 

Rolling  moment  coeff. 

JC./st  ) 

— 

tt(  ) 

t 

C. 

Pitching  moment  eeeff. 

°m(  ) 

»v»<  > 

cn 

Yawing  moment  coeff. 

c«<  ) 

JC„/i(  ) 

•*#» 

Cy 

Side  force  coeff. 

t  (  ) 

Expected  value  of  <  ) 

Met  throat 

tbw 

f 

State  dynamic*  function 

-» 

8 

Gravity  (JJ.J) 

fe/sec 

*P 

Preeeur*  altitude 

ft 

J 

Matrix  of  partial  derivatives  of 
measurements  with  teepee'  *o 
pa rime ter a 

St 

Parameter  estimation  performanct 
index 

M 

Fiaher  information  matrix 

-- 

N 

Mach  number 

B 

Humbar  of  data  poiota 

-- 

Symbol 

P 

P 

P 

Q 

Q 

8 

8 

t 


Symbol  Definition 

*V* 

Roll  rste 

Number  of  parameter*  to  be  identified 
(Pbw)/(2VT) 

Pitch  rate 

(Qc)/(2V7) 

Dynamic  pressure 
Ysv  rate 

(Bb^/aVx) 

Timm 


Unit* 


deg/sec 

(r*d/«ec) 


deg/tec 
(rad/ sec) 


lbu/ft* 

deg/*ee 

(r«d/aec) 


u  Syttcm  input 

Vert  )  Covariance  of  (  ) 


»T 


uf 

s 

y 

a 

6- 

{AH» 

% 

** 

«S 

) 


True  airspeed 
Fuel  flow 
System  stmt* 

System  output  (measurement) 
Angle  of  ettevk 
Sideslip 

Ambient  pressure  ratio 
Aileron  deflection 
Rudder  deflection 
noritontel  teil  deflection 
Sami l  perturbation  in  (  ) 


ft/sec 

Ib/hr 


deg 

deg 

deg 

deg 

deg 


6 


Set  of  “?v-v«ra  peremeters 


CAKB 


Ambient  temperature  retio 


** 

Heaturement  error  of  measurement  < 

** 

u 

Keatuteoent  autocorrelation 

as. 

°K 

Root  m**n  square  mtcru r-smsne  error 
of  measurement  X 

-- 

w* 

Angular  rai»  noodimtneiooelited  by 
isagth/speed 

Super  and 

Subscripts 

(  ,'mEA* 

Flight  test  measuremsnt  of  (  3 

t\ 

<  > 

Estimate  of  {  )  neat  remans 

<  >»/« 

tloaaboom 

1.  MOTIVATION 

System  identification  is  e  technology  for  determining  •  mathematical  modal  of  a  dynamic  syaeea  from 
observations  of  its  response  to  its  inputs.  This  technology  has  found  app'icatioo  lo  a  number-  of  fields 
of  engineering:  aircraft  aerodynamics  11),  process  control  (2).  electric  power  production  sod  distribu¬ 
tion  13),  propulsion  [4],  medicine  15),  econometrics  (f|,  end  structural  dy.emica  {7). 

The  use  of  system  identification  methods  can  mak.y  unique  contributions  to  the  mathematical  modeling 
of  e  dynamic  system: 


•  Determination  of  parasMtera  which  model  the  dynamic,  as  opposed  to  the  static,  characteristics 
of  the  system.  Examples  of  such  parasmters  are  those  which  model  mechanical  damping  in  a 
structure  or  the  angular  rate  aerodynamic  coefficients  for  an  aircraft. 

a  Model  validation.  The  use  of  input/output  data  from  the  operational  system  may  be  the  only 
way  to  ensure  validity  of  the  mathematical  system  model. 

Although  system  identification  is  often  regarded  as  a  set  of  techniques  for  data  proc'essing,  the 
overall  technology  has  a  broader  scope  which  includes  test  planning,  instrumentation  specification,  and 
choice  of  matheswiticat  model  structure,  as  well  as  the  numerical  methods  of  parameter  estimation  and  the 
atatisticel  techniques  of  interpreting  results.  The  overall  scope  of  the  problem  can  be  illustrated 
with  the  eircreft  aerodynamic  modeling  example.  In  planning  the  effort,  the  following  queations  should 
first  be  addressed. 

e  What  unique  contributions  can  system  identification  stake  to  the  particular  modeling  objectives? 

e  What  is  the  purpose  of  the  model?  It  might  be  used  for  eetive  control  design  or  for  investi¬ 

gation  of  the  physiee  underlying  the  aerodynamic  phenomena.  The  ultistate  purpose  of  the  model 
will  affeet  the  choice  of  the  stucture  of  the  model  to  be  identified  end  the  estimation  accu¬ 
racy  required. 

e  What  test  inputs  are  acceptable  from  the  point  of  view  of  safety  of  operation  while  the  data 

are  being  collected?  Stull  control  surface  excursions  may  not  excite  the  dynamic  modes  of  in¬ 
terest  while  large  excursions  may  cause  spins  or  other  dangerous  maneuvers. 

*  What  aspects  of  the  input  and  output  of  the  system  must  be  ateetured?  Possible  measurements  on 
the  aircraft  include 

-  etatrol  surface  position, 

-  engine  KPM  and  fuel  flow, 

-  attitude  (roll,  pitch,  yaw), 

-  angulac  rate, 

■'  angular  acceleration, 

-  translations!  acceleration, 

-  velocity  with  respect  to  the  sir, 

-  angle  of  attack  and  sideslip. 


*  dynamic  pressure,  and 
-  position. 

*  How  accurately  must  these  quantities  be  measured? 

a  Whet  duration  of  experiment  is  most  cost-effective?  tong  time  record*  may  he  expensive  to  ob¬ 
tain,  yet  short  time  records  may  not  yield  accurate  eatirstes.  Mow  high  must  the  sampling 
rate  ha? 

•  Mow  tempi**  must  the  mathematical  modal  he?  Mow  many  parsmstera  need  to  be  estimated?  for 
example,  are  aerodynamic  forces  and  moment*  linear  functions  of  vehicle  ststee  or  do  signifi¬ 
cant  nonl insert tie*  exist?  The  estimation  of  a  much  larger  number  of  parameters  may  be  re¬ 
quired  in  the  latter  case. 

a  Can  the  quality  of  the  dal#  be  tested  it  the  experiment  te  being  tun?  Can  the  system  model  be 
determined  fn  real’ rim*  or  must  it  be  determined  by  batch  processing  method*  run  off-line 
sftee  the  dais  have  been  -collecte-d? 

#  If. the  data  from  the  expettaeot  must  be  processed  off-line  following  the  experiment,  whst  will 
be  the . expense  of  the  processing?  Given  the  nature  of  the  chosen  model,  what  type  of  proces¬ 
sing  method  is  "squired? 

e  Mow  « so  we  assess  rh«  validity  of  the  model  dtteemi  ed  by  processing  the  t-ist  data? 

The  technology  of  system  Identification  is  particularly  vell-rwitef  to  the  processing  of  data  from 
aircraft  flight  tests.  Identification  author*.  can  extract  information  on  mathematical  aerodynamic 
saodels. 

The  development  of  an  integrated  flight  tettiug  procedure  depends  on  the  ability  to  identify  non- 
Unset  aerodynamic  characteristics  and  propulsion  system  performance  from  flight  teat  data.  The  identi¬ 
fied  mo.-iel*  help  to  define  performance,  stability  and  control,  and  unaugmented  airframe  dynamic  char¬ 
acteristics  c:  the  aircraft  frying  evaluated,  by  identifying  the  nonlinear  aerodynamic  modal*  in  a  mul¬ 
tivariable,  tab!*- look-up  format,  direct  correlations  can  be  made  with  ptef light'  aerodynamic  predictions 
<#•(-,  wind  turn* 1  data)  and  simulation  model*. 

by  using  a  date  processing  technique  that  can  identify  aerodynamic  and  inatalled  propulsion  models 
from  many  large-amplitude,  dynamic  teat  conditions,  it  is  possible  to  enhance  the  test  productivity 


through  a  reduction  in  required  test  tine.  For  the  dynamic  maneuvers,  the  teat  time  ia  defined  in  term* 
of  seconds.  Other  motivating  factors  that  support  the  development  of  this  technology  include:  improve¬ 
ments  in  safety  of  flight,  and  a  general  expansion  of  requirements  for  higher  fidelity  aerodynamic 
models  of  the  aircraft. 

Safety  of  flight  can  be  enhanced  during  a  flight  envelope  expansion  test  program  by  using  system 
identification  techniques  to  validate  the  aircraft  simulation  model  for  flight  regimes  already  tested. 
The  updated  mathematical  mode:  can  then  be  used  to  make  preflight  predictions  for  flight  envelope  expan- 
aion  teat  conditions.  In  addition,  when  nonlinear  identification  models  and  identification  techniques 
are  used,  the  pilot's  task  is  greatly  simplified  since  he  ia  not  required  to  maintain  small  perturbation 
flight  about  a  trimmed  operating  points 

The  need  for  improved  modeling  of  aircraft  aerodynamic  characteristics  has  been,  and  continues  to 
be  apparent  in  numerous  areas  of  technical  and  operational  importance.  Four  such  arean  are:  1)  flying 
quality  nilitrry  specification  compliance  testing,  2)  training  simulations,  3)  design  methods  for  spe¬ 
cification  of  aircraft  characteristics,  and  4)  the  development  of  mission  profiles  that  make  optimum  use 
of  the  airplane's  capabilities.  In  general,  there  is  s  need  for  an  improved  understanding  of  an  air- 
planaa'a  aerodynamic  characteristics  which  supports  design  improvements  for  increased  cost  effective¬ 
ness,  expanded  mission  flexibility  and  enhanced  operational  safety. 

The  objectives,  of  this  paper  ere  to  (1)  outline,  the  aspects  of  system  identification  which  are 
relevant  to  the  aircraft  nonlinear  aerodynamic  identification  problem,  (2)  introduce  three  new  useful 
data  processing  techniques,  end  (3)  demonstrate  the  application  of  the  system  identification  technology 
to  the  Analysis  of  flight  test  data  from  the  halve  1  Air  Teat  Center  F-4S  aircraft. 

2.  OVEWMEti.  OP  SYSTBK  IDENTIFICATION  METHODOLOGY 

2.1  The  Integrated  System  Identification  Process  \ 

2  *  1 . 1  Iterative  Nature  of  the  Process 

Figure  1  indicates  an  integrated  identification  procedure  which  applies  to  a  wide  variety  of  dyna¬ 
mic  system  types  (e.g.,  structural  dynamics,  vehicle  dynamics,  etc.).  This  system  idsntif iest ion  pro¬ 
cess  consists  of  an  iterative  loop  of  test  planning,  actual  testing,  and  data  processing.  Two  feedback 
loops  can  be  vital  to  the  ctverell  success  of  the  program.  An  inner  loop  is  closed  during  the  data  eol- 
Uction  phase.  Thie  inner  loop  cheeke  the  quality  of  the  date  produced  by  the  testa.  The  checking  is 
done  in  real-time  or  nearly  in  real-time.  If  the  quality  of  the  data  is’  determined  to  be  poor,  then 
corrective  action  can  be  taken  while  the  teat  ersw  and  facilities  are  still  available.  For  example,  the 
teat  can  be  repeated  with  modified  inputs  or  sensor  configurations.  Foot  data  quality  may  be  due  to: 

e  failed  sensor  channels, 

•  excessively  noisy  sensor  channels,  or 

e  failure  to  excite  dynamic  modes  of  interest. 

Data  quality  evaluation  may  be  performed  by  a  variety  of  real-time  techniques  including: 

e  visual  inspection  of  data  record*  at  displayed  by  atrip  chart  recorder  or  C«T, 

e  fault  detection  tilor*  (31,  or 

e  actual  parameter  estimation  using  algorithm*  configured  for  high  completion  speed  |9J. 

An  outer  loop  of  test  planning  may  also  need  to  be  closed  shout  the  entire  identification  process.  The 
test  planning  is  a  "boot-strap"  process.  A  model  of  the  system  it  needed  in  order  to  chons#  test  input 
signals  and  to  specify  sensrr  requirements.  The  characteristics  »f  the  model  which  result  from  the 
identification  data  procaseifig  may  indicate  that  additional  input  sigmale  or  sensor*  srs  required  for 
complete  model  ideatif icstioo.  If  thie  is  the  case,  them  another  aet  of  data  collection  testa  will  b* 
required. 

2.1.2  Tf>»  Freteet  Manning  There 

Tha  pretest  planning  phase  includes  the  specif iesciom  of  an  instrument  system  and  test  inputs.  The 
overall  technology  of  spates*  identification  sow  extend*  to  include  analytic  method*  of  specifying  test 
input  signals  to  minimise  th  uncertainty  of  estimates  of  paremsters  {10)  and  of  specifying  sensor  »-.u- 
racy  requirements  (It). 

2.1.3  Ths  Pate  Froctaeine  Fhass 

The  actual  processing  of  th*  data  requires  four  major  steps.  These  *»** 
a  flight  data  processing  and  analysis, 
e  medal  ernictvte  determination, 
a  parameter  estimation,  and 
a  nodal  validation. 


2. 1.3.1  Flight  Beta  Preprocessing  and  Analysis 

The  preprocessing  and  analysis  of  flight  test  data  i*  a  raajor  element  of  the  integrated  system 
identification  procedure.  The  overall  objectives  of  this  task  are  to  reviev  measurement  excitation, 
remove  vildpointa,  reconstruct  unmeasured  quantities  (i.e.,  acceleration  of  the  aircraft  at  the  center 
of  gravity),  and  develop  a  set  of  kinematically  consistent  measurements.  Where  measurement  consistency 
canno'  be  established,  requirements  for  instrumentation  system  error  source  modeling  are  defined.  Meas¬ 
urement  consistency  has  a  significant  impact  on  parameter  identification  accuracy  since  unaccounted-for 
errors  will  bias  parameter  estimates. 

2. 1.3.2  Sodel  Structure  Determination 

The  model  structure  determination  phase  [12)  consists  of  processing  the  input/,  clput  date  to  deter¬ 
mine  the  significant  linear  and  nonlinear  equations  and  associated  parameters  that  are  necessary  to  rep¬ 
resent  an  observed  system  response.  Questions  addressed  here  include  the  determination  of  the  order  of 
the  siodel  (e.g.,  number  of  degrees  of  freedom)  and  a  mathematical  form  (e.g.,  polynomial)  to  represent 
any  nonlinear  character  in  the  dynamic  equations.  For  linear  dynamic  systems,  the  d  termination  of 
order  is  of  primary  issportance.  For  nonlinear  systems  the  determination  of  forms  to  reprenent  nonline¬ 
arities  has  equal  importar c. 

2. 1.3.3  Parameter  Estimation 

The  estimation  of  unknew  parameter  values  fellows  the  determination  of  a  suitable  model  struc¬ 
ture.  Numerical  values  of  unknown  parameter*  are  determined  by  choosing  them  to  optimise  some  perfor¬ 
mance  index  which  measures  how  veil  the  methemat irii  teuiel  represents  the  observed  data.  Possible  per¬ 
formance  criteria  include: 

a  minimixat ien  of  au»  «f  -alighted  squared  fit  errors,  and 

a  miniaisst iss  e.t  me  autocorrelation  of  the  fit  errors. 

Fit  error  is  the  difference  between  the  observed  response  of  the  dynamic  system  and  the  aimulated  re¬ 
sponse  of  the  system  model  to  (he  &sse»ved  inputs. 

The  det eraiaation  <it  model  structures  and  the  estimation  of  parameter  value*  are  often  done  in 
parallel.  Model  ettuctures  are  determined  by  fitting  several  competing  candidate  models  [specified  by 
the  user)  to  the  observed  system  response.  The  model  structure  which  gives  the  "best"  fit  of  the  data 
is  the  structure  chosen. 

Several  criteria  may  be  used  ty  evalutate  the  closeness  of  the  fit.  Simple  mean  square  tit  error 
is  not  a  suitable  criterion  for  thr  comparison  of  all  candidate  models.  The  use  of  this  criterion  will 
always  lead  to  the  choiee  of  highly  complex  models,  since  adding  degree?  of  freedom  to  the  model  always 
leads  to  reduced  mean  square  fit  error.  Hore  useful  criteria,  when  evaluating  candidate  models  having 
different  numbers  of  parameters  are  the  mean  square  prediction  error  and  the  F  statistic  ilJ).  These 
criteria  weight  fit  error  against  the  number  of  free  parameters  in  the  model. 

A  tww~*tage  process  has  been  found  to  be  effective  for  determining  the  structure  of  the  model  and 
estimates  of  the  parameters.  First,  candidate  m* dels  are  evaluated  by  choosing  their  parameters  to 
minimise  fit  error  or  mean  square  pred*<lio«  error.  This  evaluation  is  done  using  a  numerical  scheme 
which  accurately  evaluates  the  performance  index  b<  t  which  may  not  accurately  evaluate  the  parameter 
estimates  themselves.  Once  the  model  etrurtvre  Is  established  in  this  way,  the  pr.rameter  estirstea  are 
refined  using  a  scheme  which  gives  more  accurate  parameter  estinstes.  For  dynamic  systems,  a  computa¬ 
tionally  efficient  method  which  is  effective  for  model  structure  determination  is  the  equation  error 
minimi  rat  ion  method,  Parameter  estimates  may  subsequently  be  refined  using  output  error  min  miaat ion  or 
combined  state  and  parameter  estimation  methods.  All  three  of  these  estimation  aarthode  are  treated  in 
greater  detail  in  Sett  ion  2.3. 

2.1.).*  Model  Validation 

A  good  criterion  fot  the  validation  of  a  model  is  the  use  of  the  model  to  predict  new  data.  A 
typical  procedure  might  be  to  use,  say,  fOI  of  the  available  data  to  determine  the  model  structure  and 
parameter  values.  Then  the  resulting  model  would  he  used  to  predict  the  remaining  28-5  of  the  data.  The 
degree  of  validation  achieved  Can  then  be  interpreted  from  the  accuracy  of  the  prediction. 

Statistical  performance  criteria  include: 

•  mean  square  fit  error, 

•  mean  square  prediction  error,  and 

a  whiteness  (statistical  independence)  of  residuals. 

Whiteness  can  often  be  evaluated  effectively  by  visusl  inspection  of  plots  of  the  observed  date  super¬ 
imposed  on  plots  of  the  predicted  date.  Plots  of  the  residuals  themselves  may  also  be  used. 

Finally,  validation  should  include  comparison  of  the  modal  determined  from  system  i. Isnt itiesf ion 
with  models  svsilablt  a  priori.  Aerodynamic  models,  at  determined  by  identification,  should  be  compared 
to  those  determined  from  theoretical  predictions  or  wind  tunnel  tests. 

2.2  Tsst  Planning 


The  first  step  in  ths  integrated  system  {dent ificet ion  procedure  is  the  planning  o.  the  teats.  A 
[flt»_Illiem^  identification  test  plan  should  include  epecifications  for  both  the  input  signals  JlO| 


2. 1.1.1  Flight  Data  Preprocessing  and  Analysis 


The  preprocessing  and  analysis  of  flight  test  data  is  a  major  element  of  the  integrated  system 
identification  procedure.  The  overall  objectives  of  this  task  are  to  review  measurement  excitation, 
remove  vildpoints,  reconstruct  unmeasured  quantities  (l.e.,  acceleration  of  the  aircraft  at  the  center 
of  gravity),  and  develop  a  set  of  kinematically  consistent  measurements.  Where  measurement  consistency 
cannot  he  established,  requirements  for  instrumentation  system  error  source  modeling  are  defined.  Meas¬ 
urement  consistency  has  a  significant  impact  on  parameter  identification  accuracy  since  unaccounted-for 
errors  will  bias  parameter  estimates. 

2. 1.3.2  Model  Structure  Determination 

The  model  structure  determination  phase  [12]  consists  of  processing  the  input/output  data  to  deter¬ 
mine  the  significant  linear  and  nonlinear  equations  and  associated  parameters  that  are  necessary  to  rep¬ 
resent  an  observed  system  response.  Questions  addressed  here  include  the  determination  of  the  order  of 
the  model  (e.g.,  number  of  degrees  c.f  freedom)  and  a  mathematical  form  (e.g.,  polynomial)  to  represent 
sny  nonlinear  character  in  the  dynamic  equations.  For  linear  dynamic  systems,  the  determination  of 
order  is  of  primary  importance.  For  nonlinear  systems  the  determination  of  forma  to  represent  nonline¬ 
arities  has  equal  importance. 

2. 1.3.3  ParaaMter  Estimation 

The  estimation  of  unknown  parameter  values  follows  the  determination  of  a  suitable  model  struc¬ 
ture.  Numerical  values  of  unknow  parameters  are  determined  by  choosing  them  to  optimise  some  perfor¬ 
mance  index  which  measures  ho*<  well  the  mathematical  model  represents  the  observed  data.  Poaaible  per¬ 
formance  criteria  include: 

•  minimisation  of  sum  of  weighted  squared  fit  errors,  and 

•  minimisation  of  the  autocorrelation  of  the  fit  errors. 

Fit  error  i*  the  difference  between  the  observed  response  of  the  dynamic  system  and  the  simulated  re¬ 
sponse  of  the  sysCem  model  tv  the  observed  inputs. 

The  determination  of  model  structures  and  the  estimation  of  parameter  values  are  often  done  in 
parallel.  Model  structures  are  determined  by  fitting  several  competing  candidate  models  (specified  by 
the  user)  to  the  observed  systen  response.  Tha  model  structure  which  gives  the  ‘best"  fit  of  the  data 
is  the  structure  chosen. 

Severs!  criteria  may  be  used  to  evalutats  the  cloaeneas  of  the  fit.  Simple  mean  square  fit  error 
it  not  a  suitable  criterion  for  the  comparison  of  all  candidate  oriels.  The  uta  of  this  criterion  will 
always  lead  to  the  choice  of  highly  complex  models,  since  adding  agrees  of  freedom  to  the  smdel  always 
lead?  to  reduced  mean  square  fit  error.  Moro  useful  criteria,  .ten  vv. lusting  candidate  models  having 
different  numbers  of  parameters  are  the  mean  square  prediction  ror  and  the  F  statistic  (13).  These 
criteria  weight  fit  error  against  the  number  of  fret  parameters  ,  .  the  model. 

A  two-ttsge  process  has  been  found  to  be  effective  for  determining  the  structure  of  the  model  and 
estimates  of  the  parameters.  First,  candidate  model i  are  evaluated  by  choosing  their  parameters  to 
minimise  fit  error  or  mean  square  prediction  error.  This  evaluation  is  done  using  s  numerical  scheme 
which  accurately  evaluate*  the  performance  index  but  which  may  not  accurately  evaluate  the  parasieter 
estimates  themselves.  Once  the  model  structure  it  established  in  this  way,  the  parameter  estimates  are 
refined  using  a  scheme  which  gives  more  accurate  parameter  estimates.  For  dynamic  systems,  a  computa¬ 
tionally  efficient  method  which  is  effective  for  model  structure  determination  i*  the  equation  error 
minimUetton  method.  Parameter  estimates  may  subsequently  be  refined  using  output  error  minimisation  or 
combined  state  end  parameter  estimation  eethode.  Alt  three  of  these  estimation  methods  are  treated  in 
greater  detail  in  Section  2.3. 

2. 1.3.4  Model  Validation 

A  gond  criterion  for  the  validation  of  a  model  if  the  use  of  the  teodel  to  predict  new  data.  A 
typical  procedure  might  be  to  uae,  say,  SOX  of  the  available  data  to  determine  the  model  structure  and 
parameter  values.  Then  the  retutting  model  would  be  used  to  predict  the  remaining  20X  of  the  data.  The 
degree  of  validation  achieved  can  then  be  interpreted  fro*  the  accuracy  of  tha  prediction. 

j  Statistical  performance  criteria  include; 

|  e  mean  square  fit  arror, 

!  a  mean  square  prediction  error,  and 

| 

i  e  whiteness  (statistical  independence)  of  rciiduals. 

1 

;  whiteness  con  often  be  evaluated  effectively  by  visual  inspection  of  plots  of  the  observed  data  euper- 

I  imposed  on  plcte  of  the  predicted  data.  Plots  of  the  residuals  themselves  may  also  be  used, 

\ 

Ftnelly,  validation  should  include  comparison  of  the  model  determined  fro*  system  identification 
with  models  evsilsble  •  priori.  Aerodynamic  models,  at  determined  by  identification,  should  bs  compared 
to  thoae  determined  from  theoretical  predictions  or  wind  tunnel  testa. 

2.2  Test  Planning 

The  firat  step  in  the  integrated  system  identification  procedure  is  the  planning  of  the  teete.  A 
t  Holloa  identification  test  plan  should  include  specifications  for  both  the  input  signals  (10) 
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and  the  instrument  system  [11].  Analytic  methods  exist  for  specifying  these  two  critical  items  as  func¬ 
tions  of  system  identification  accuracy  requireaients. 

Design  of  the  system  excitation  requires  some  a  priori  knowledge  of  the  system  characteristics  as 
well  as  data  collection  constraints  such  as  sampling  rate  and  data  length  limits.  It  is  also  necessary 
to  specify  the  objectives  of  the  identification.  For  example,  some  parameters  may  be  known  accurately  a 
priori.  The  objective,  then,  say  be  to  identify  only  those  parameters  that  are  poorly  known. 

Figures  2  and  3  [1]  illustrate  the  importance  of  test  signal  design  in  reducing  the  uncertainty  in 
the  estimates  of  parameters.  Figure  2  shows  an  elevator  input  signal  for  identifying  five  parameters 
which  model  the  longitudinal  dynamics  of  the  C-8  transport  aircraft.  The  input  time  history  is  chosen 
to  minimise  the  sum  of  the  covariances  of  the  five  parameters.  Figure  3  compares  the  standard  deviation 
in  parameter  estimates  for  the  optimal  input  and  for  a  doublet  input  which  has  the  same  total  energy  as 
the  optimal  input.  As  can  be  seen,  a  doublet  elevator  input,  commonly  used  for  aircraft  flight  testing, 
is  not  as  effective  as  the  optimal  input.  In  executing  the  test,  the  suboptimal  input  shown  on  Figure  2 
results  in  performance  nearly  exactly  that  of  the  optimal,  but  is  ouch  easier  to  implement  by  the  pilot. 

The  other  important  factor  in  the  initial  teat  plan  is  specification  of  the  instrumentation.  Un¬ 
fortunately,  the  instrument  systems  employed  to  record  data  for  system  identification  processing  are 
often  not  designed  with  such  ends  in  mind.  For  example,  aircraft  autopilot  instruments,  which  are  often 
used  to  record  data  during  test  flights,  may  not  have  sufficient  accuracy  to  allow  consictent  aerodyna¬ 
mic  parameter  estimation.  This  is  becoming  less  of  a  problem  due  to  the  use  of  navigation  grade  sensors 
in  modern  digital  flight  control  system  designs.  The  ring-laser  gyro-baaed  strapdown  inertial  reference 
unit  is  a  good  example  of  this  change. 

Sensors  are  subject  to  a  variety  of  errors  that  degrade  both  state  and  parameter  estimation  accura¬ 
cies.  Reference  11  presents  an  analytical  technique  to  determine  the  effect  of  sensor  errors  on  estima¬ 
tion  accuracies.  Both  random  (e.g.,  additive  uncorrelated  noise  in  meisuremcnts)  and  systematic  (e.g., 
instrument  bias  or  scale  factor  errors)  are  treated.  One  important  conclusion  is  that  systematic  errors 
of  relatively  small  magnitude  in  comparison  with  random  errors  can  cause  siguificent  parameter  estima¬ 
tion  bias.  It  such  systematic  error*  are  unavoidable,  then  parameters  modeling  them  can  be  added  to  the 
set  of  totel  parameters  to  be  estimated.  This  technique  can  reduce  overall  parameter  estimation  uncer¬ 
tainty. 

2.3  Data  Processing  Algorithm 

A  large  number  of  amthoda  exist  for  performing  system  identif icstion  dots  processing.  The  beat 
algorithm  for  any  given  application  depends  strongly  on  the  type  of  model  end  on  the  nature  of  the 
available  data.  No  one  type  of  processing  algorithm  can  handle  ell  possible  application*. 

this  section  outlines  three  processing  mat hod*  (Table  1)  which  have  been  found  tc  be  effective  in  a 
variety  of  applications.  These  methods  are: 

•  equation  error  minimisation  methods, 

a  output  error  minimisation  methods,  and 

a  simultaneous  stags  and  parameter  esliaution  see  thesis. 

2.3.1  Equation  Error 

The  equation  error  minimitetion  method*  estimate  unknown  parameters  by  choosing  them  to  minimise  a 
performance  index.  A  contiruous  dynamic  system  mutt  bn  represented  ae: 

dx/dt  •  f(x,  u,  t,  6)  ♦  w 

where  0  is  e  eel  of  p  unknown  parameter*  and  w  is  s  tima-varying  unobservable  disturbance.  An 
analogout  formulation  exist e  for  a  discrete  dynamic  system.  The  performance  index  3ft(0)  to  be  mini¬ 
mised  ie: 


~TT~  ~  (1) 


The  equation  error  minimisation  method  it  often  railed  the  least  squares  method  because  of  rhe  form  of 
the  performance  index 


The  effective  use  of  the  equation  error  minimitetion  requires  the  a  priori  determination  of  system 
etstee  x,  controls  u,  end  etete  derivatives  di/dt  over  the  time  interval  of  the  test.  A  priori 
here  meant  that  these  quantities  must  be  determined  before  the  unknown  system  pa  matters  ere  estimated. 
This  determination  may  be  dona  using  direct  meaauremant*  or  using  system  characteristics  which  am  in¬ 
dependent  of  the  parameters,  for  example,  an  unmeasured  state  derivative  may  be  determined  by  (very 
carefully)  numerically  differentiatls^  a  matured  ateta  time  history. 

The  ram  u  it  a  at  echo*  tic  quantity  which  represents  unmeasurable  process  disturbance*  in  the 
system,  w  includes  wind  gusts  and  uomodelcd,  high -order  aerodynamic  effects. 

The  special  advantage  of  the  equetioa  error  ainimitatlon  method  ties  in  the  fact  that  many  non¬ 
linear  dynamic  system  functions  f(x,  u,  t,  #)  are  linear  in  the  parameters  6 .  In  other  words. 
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f(x,  u,  t,  e) 


p 

z  e.f.(x,  u, 
j-i  J  J 


c)  *  fP+i(x*  u*  l) 


(2) 


The  functions  fj,  j*l ,2 , . . .p+1  are  independent  of  the  p  unknown  parameters  6j.  The  parameter 
values  which  minimite  .S7*e(  8)  can  be  found-  explicitly  using  linear  algebraic  operations  C 14  J .  The  dis¬ 
advantages  of  the  equation  error  minimisation  method  arise  primarily  from  the  requirement  for  very  accu¬ 
rate  measurements  of  states  and  controls.  States  will  inevitably  be  measured  with  some  error.  No  meas¬ 
urement  at  all  may  be  available  for  other  states. 

2.3.2  Output  Error 

Output  error  minimization  methods,  like  equation  error  minimization  methods,  estimate  unknown 
parameters  by  choosing  them  to  minimize  a  performance  index.  The  dynamic  system  must  be  represented  as 

dx/dt  «  fix,  u,  t,  @),  xito)  ”  g(6)  (3) 

y  »  hix,  u,  t,  9)  +  v  (4) 

where  9  is  i  set  of  p  unknown  parameters  and  v  is  s  time-varying,  unobservable,  additive  measure¬ 
ment  error.  The  performance  index  12^(9)  to  be  minimized  is 

.<?  (9)  “  z  [y.-  y( t . ,  0)]2  (5) 

°  i“l  1 

Mere  yi  is  the  observed  system  output  at  time  t£.  y(t£,  0  )  is  the  system  output  y  predicted 
for  time  t£  by  solving  the  system  state  equations  and  measurement  equations  using  the  measured  sys¬ 
tem  inputs  uitjO  and  the  a  priori  parameter  values  6. 

The  effective  use  of  the  output  error  minimization  requires  the  very  accurate  measurement  of  system 
inputs  u  and  the  measurement  of  system  outputs  y.  The  method  will  tolerate  errors  in  the  measurement 
of  y. 


The  term  v  is  a  stochastic  quantity  which  represents  instrument  measurement  errors,  e.g.,  analog- 
to-digital  quantization  noise. 

The  special  advantage  of  the  output  error  minimization  method,  with  respect  to  the  equation  error 
method,  is  that  the  measurement  requirements  are  greatly  relaxed.  The  method  does  not  require  the  accu¬ 
rate  measurement  of  all  state  and  state  derivatives.  Rather,  it  is  effective  using  noisy  measurements 
of  the  limited  number  of  outputs  that  are  available. 

The  actual  determination  of  the  parameter  values  9  which  minimize  the  performance  index  &q(.Q) 
is  computationally  more  complex  than  the  minimization  ^Pe(9).  This  is  because  )  is  a  non¬ 

linear  function  of  the  parameter  set  0.  Finding  the  minimizing  parameter  set  requires  an  iterative 
numerical  scheme  [15,  16] .  The  application  of  such  numerical  methods  is  often  not  straightforward. 

The  principal  disadvantage  of  the  output  error  minimization  scheme  is  that  it  does  not  explicitly 
allow  for  the  presence  of  unmodeled  disturbances  in  the  state  dynamics.  Such  disturbances  are  represen¬ 
ted  in  the  equation  error  method  by  the  term  w.  "Process  noise"  is  Che  term  often  used  to  describe 
these  unmodeled  effects. 

It  should  be  noted  that  the  output  error  method  can  account  for  system  dynamics  disturbances  of 
unknown  magnitude  if  the  form  of  these  disturbances  is  accurately  represented.  The  disturbance  w  roust 
be  explicitly  represented  as 

w  »  w(t,  6 )  (6) 

The  unknown  elements  of  the  disturbance  are  represented  using  part  of  the  unknown  parameter  vector  6. 
One  might  estimate  the  horizontal  plane  components  of  a  steady  wind  present  during  a  flight  test,  for  ex¬ 
ample. 


2.3,3  Combined  State  and  Parameter  Estimation 


Methods  which  combine  state  and  parameter  estimation  are  required  if  significant  levels  of  both  un¬ 
known,  unmodeled  disturbances  and  measurement  errors  are  present  in  the  system  under  study.  The  perfor- 
mance  index  used  here  is  very  similar  to  the  output  error  index  (  6 ) .  However,  the  estimated 
outputs  y  are  now  direct  functions  of  the  observed  outputs  y.  The  performance  index  is 

M  , 

S’. (9)  -  t  [y.  -  yC t . ,  0,  y)r  (7) 

*  i-1  1  1 

The  estimated  outputs  are  determined  using  both  the  system  dynamic  equations  and  the  observed  values  of 
the  outputs  themselves. 

Methods  for  ths  determination  of  y  given  measurements  y  and  an  assumed  form  of  the  system  dyna¬ 
mics  have  bean  widely  studied  under  the  topics  of  state  estimation  [17]  and  linear  system  observation 
[18].  The  use  of  the  Kalman  filter  to  estimate  y  in  the  modified  output  error  performance  index 
leads  to  "maximum  likelihood"  parameter  estimates  [19].  The  procedure  requires  that  SPt  be 
evaluated  as  a  function  of  6  using  the  Kalman  filter  to  estimate  y.  The  parameter  values  are 


estimated  by  choosing  them  to  minimize  -5^(0)  using  a  Gauss-Newton  method  [20] .  The  use  of  this 
maximum  likelihood  estimation  procedure  often  allows  the  estimation  of  system  noise  levels  as  well  as  of 
parameters  describing  the  physical  plant. 

2.4  Some  Practical  Considerations 

This  section  briefly  discusses  a  number  of  practical  considerations  which  should  be  taken  into 
account  in  an  effective  system  identification  data  processing  scheme. 

2.4.1  Assumptions  Regarding  Measurement  Noise  Statistics 

A  common  problem  is  to  assume  that  the  measurement  error  should  be  modeled  as  a  Gaussian  white 
process  when  in  fact  systematic  errors  such  as  bias  and  scale  factor  exist.  Systematic  measurement 
errors  will  usually  cause  larger  parameter  estimation  errors  than  random  noise  errors  of  the  same  root- 
mean-square  level.  A  very  common  scale  factor  found  when  dealing  with  any  instrument  using  electronic 
pickoffs  is  -1.0.  This  is  due  to  simple  polarity  errors  made  when  installing  the  instrument.  Reference 
il  covers  methods  of  assessing  the  relative  significance  of  systematic  meaaui.ement  errors  and  random 
measurement  errors. 

2.4.2  Number  of  Independent  Parameters  In  The  Model 

Problems  can  arise  from  an  attempt  to  fit  too  complex  a  model  to  the  available  data.  The  chief 
symptom  of  this  is  that  a  large  scatter  of  estimated  parameter  values  will  be  seen  if  several  data  sets 
are  used  independently  to  estimate  values  for  the  same  parameter  set. 

2.4.3  Extrapolation  of  Results 

An  identified  model  should  not  be  used  to  predict  system  behavior  for  operating  regimes  far  beyond 
those  encountered  during  data  collection.  Operating  regime  predictions  should  be  limited  in  both  input 
bandwidth  and  amplitude  to  those  tested. 

2.4.4  Excitation  of  All  System  Modes 

This  problem  can  be  avoided  by  careful  choice  of  inputs  during  the  test  planning  stage.  A  second 
solution  is  to  process  multiple  maneuvers  simultaneously  which  contain  different  control  inputs.  By 
doing  this,  the  required  modal  information  is  extracted  from  a  set  of  simpler  maneuvers,  rather  than  one 
complicated  maneuver. 

2.4.5  Effective  Use  of  Sequential  Data  Processing  Schemes 

System  identification  data  processing  requires  the  computational  steps  of  model  structure  determin¬ 
ation,  parameter  estimation,  and  model  validation.  An  additional  preliminary  step  of  prefiltering  meas¬ 
urements  may  also  be  required  for  effective  use  of  an  equation  error  parameter  estimation  method.  An 
effective  overall  computational  scheme  may  require  that  the  operations  of  prefiltering,  modal  structure 
determination,  and  parameter  estimation  be  carried  out  in  a  sequential  rather  than  in  a  store  nearly 
simultaneous  manner.  Care  must  be  taken  to  ensure  that  the  algorithms  employed  at  any  given  stage  do 
not  remove  critical  information  from  the  data.  As  a  simple  example,  the  bandpass  of  a  noise  prefilter 
should  be  higher  than  that  of  the  modes  of  the  system  to  be  identified. 

2.4.6  Process  Noise 

The  term  "process  noise"  refers  to  unmodeled  factors  in  the  state  dynamics  of  the  system  being 
identified.  Sources  of  process  noise  include: 

(1)  unmeasured  environmental  disturbances  -  wind  gusts  acting  on  an  aircraft,  for  example, 

(2)  unmodeled  nonlinearitiea  or  degrees  of  freedom  in  the  state  dynemics,  and 

(3)  errors  in  measuring  input  signals. 

The  effect  of  process  noise  ie  usually,  but  not  always  ,  to  dagrade  estimation  accuraciea.  If  Mature- 
ments  of  system  states  are  highly  accurate,  then  the  process  noise  becomes  the  major  aourca  of  estiaa- 
tion  error.  Under  some  circumstances,  process  noise  in  the  form  of  unmeasured  environmental  disturban¬ 
ces  can  improve  estimation  accuracy.  The  environmental  disturbances  might  excite  modes  of  tha  syetam 
which  are  not  excited  by  the  known  input  test  signal. 

The  relative  significance  of  process  noise  in  an  identification  effort  depends  roughly  upon  tha 

ratio 


r  ■  RMSiprocets  noiee)/RHS(knovn  inputs)  (6) 

where  RMS(  )  refers  to  the  root-mean-equare  state  excursion  due  to  ths  indicated  source  of  excitation. 

If  r  ie  large,  then  the  process  noise  ie  significant.  If  r  it  small,  then  tha  process  noise  is  not 
significant.  It  is  difficult,  however,  to  specify  a  value  of  r  indicating  ths  boundary  between  signi¬ 
ficant  or  insignificant  process  noiss  level*  which  will  ba  valid  for  all  systems, 

Effactiv*  system  identification  methods  exist  for  uss  when  available  date  contain  procaaa  noise.  » 

The  equation  error  formulation  ia  preferred  if  all  system  states  can  b*  measured  or  estimated  accur¬ 
ately,  otherwise  the  formulation  combining  state  and  parameter  estimation  will  be  required. 


2. A. 7  Initialization 


Many  parameter  estimation  formulations  require  the  iterative,  numerical  solution  of  nonlinear  equa¬ 
tions.  The  output  error  and  the  combined  state  and  parameter  estimation  formulations  fall  into  this 
category.  Iterative  numerical  algorithms  require  initial  estimates  of  parameter  values  in  order  to 
begin  execution  of  the  first  iteration.  Inaccurate  initial  estimates  may  cause 

(1)  convergence  of  the  estimation  method  (which  usually  employs  some  form  of  performance  criterion 
minimisation  algorithm)  to  a  local  minimum,  or 

(2)  divergence  of  the  estimated  parameter  values  as  iterations  proceed.  Divergence  may  occur,  for 
example,  if  the  values  of  the  initial  parameter  estimates  cause  an  instability  in  the  dynamic 
system  model. 


An  effective  way  to  obtain  initial  parameter  estimates  for  starting  iterative  algorithms  is  often 
to  employ  the  equation  error  estimation  formulation.  As  noted  in  Section  2.3.1,  the  equation  error 
formulation  usually  requires  only  the  solution  of  a  linear  set  of  algebraic  equations  in  order  to  obtain 
parameter  estimates.  Such  equations  may  be  solved  without  a  priori  parameter  estimates.  An  effective 
two-step  parameter  estimation  procedure  is 

(1)  estimate  initial  parameter  values  using  the  equation  error  formulation,  then 

(2)  refine  these  estisiates  using  either  the  output  error  or  the  combined  state  and  parameter  esti¬ 
mation  formulations. 

The  values  of  parameters  estimated  using  the  equation  error  formulation  are  sensitive  to  errors  in  meas¬ 
uring  states  (measurement  noise).  However,  the  parameter  estimates  calculated  using  an  equation  error 
criterion  even  with  data  corrupted  by  measurement  noise  are  often  sufficiently  accurate  for  use  as 
start-up  values  for  iterative  algorithms. 

2,4.8  numerical  Methods 

System  identification  algorithms  engender  a  variety  of  numerical  mathematical  requirements.  Table 
2  lists  four  >f  these: 

a  solution  of  differential  equations, 

e  solution  of  linear  algebraic  systems  of  equations, 

•  solution  of  least  squares  problems,  and 

a  minimisation  of  general  nonlinear  multivariable  functions. 

Effective  methods  to  handle  these  problems  range  from  the  classical  systematic  elimination  method  of 
Causa  (23)  for  the  solution  of  systems  of  linear  algebraic  equations  to  more  recent  developments  in  the 
solution  of  linear  least  squares  problems  (26), 

An  important  consideration  in  using  any  of  the  methods  of  Table  2  is  that  of  numerical  condition¬ 
ing.  Numerical  conditioning  refers  to  the  sensitivity  of  the  output  of  a  numerical  algorithm  to  small 
changes  in  the  input  to  the  algorithm.  For  example,  assume  that  in  solving  a  system  of  n  linear  equa¬ 
tions 

Ax  •  b, 

tha  matrix  A  is  known  exactly,  but  the  vector  b  is  subject  to  uncertainty  6b.  The  norm  of  the  re¬ 
sulting  uncertainty  in  6x, 
x(  is  bounded  by  (25). 


I  6x1 


IN 


II  6b  it 


(9) 


T 

where  Uj  is  the  largest  eigenvalue  of  AA  and  un  is  the 


smallest  eigenvalua  of  AA  .  The  quantity 
cend(A)  * 


(10) 


is  callad  the  condition  number  of  A  and  ia  alwaya  greater  than  1.0.  Similar  bounds  for  solution  sen- 
•itivitiea  exist  for  least  squares  parameter  estimation  problem*  (26). 

The  condition  number  of  a  numerical  problem  can  give  insight  into  the  precision  required  to  obtain 
acceptable  accuracy  of  solution.  The  uncertainty  6b,  for  example,  might  be  due  to  rounding  due  to 
finite  precision  in  computer  word  length.  If  the  condition  number  of  e  problem  ie  10*,  then  eight 
significant  figure*  of  accuracy  would  be  required  to  maintain  IX  accuracy  in  the  solution. 
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3.  RECENT  TECHNIQUES  FOR  THE  NONLINEAR  AERODYNAMICS  PROBLEM 

Methods  for  the  maximum  likelihood  identification  of  linear  state  dynamic  models  are  well  estab¬ 
lished  [29].  Such  methods  have  been  applied  to  problems  of  linear  modeling  of  aircraft  aerodynamics 
using  flight  test  (22]  data.  These  methods  are  sometimes  applied  in  a  piecewise  manner  to  fundamentally 
nonlinear  systems. 

The  intrinsic  nonlinear  nature  of  aircraft  aerodynamic  models  may  inhibit  the  effective  use  of 
linear  identification  methods.  For  example,  if  a  limited  amount  of  data  is  available,  it  may  not  be 
possible  to  identify  many  linear  perturbation  models.  A  single  nonlinear  model  may  have  fewer  total 
free  parameters.  Also,  excursions  through  the  nonlinear  portion  of  the  model's  dynamic  range  may  be  so 
rapid  that  no  single  linearized  model  can  represent  a  significant  portion  of  the  trajectory. 

The  use  of  a  nonlinear  model  may  be  required  if  the  goal  of  the  analysis  is  to  determine  which  one 
of  several  competitive  phenomenological  mathematical  models  best  fits  available  data.  A  "phenomenologi¬ 
cal"  model  is  one  that  is  constructed  from  fundamental  physical  principles.  Such  a  model  may  have  a 
very  complex  form  mathematically  but  may  have  a  minimum  number  of  unknown  coefficients. 

There  are  certain  computational  difficulties  associated  with  the  use  of  a  nonlinear  dynamic  model 
in  a  maximum  likelihood  parameter  estimation  algorithm. 

(1)  Calculation  of  Sensitivities.  The  estimation  of  parameters  through  the  use  of  the  maximum 
likelihood  criterion  requires  the  maximization  of  the  likelihood  of  the  data  with  respect  to  the  unknown 
parameters.  The  determination  of  the  maximizing  parameter  values  requires  numerical  optimization  tech¬ 
niques.  The  most  efficient  of  these  [27]  are  descendents  of  the  Levenberg-Marquardt  nonlinear  least 
squares  method  [16,30].  These  algorithms  require  the  evaluation  of  the  partial  derivative  of  modeling 
residuals  with  respect  to  parameter  values.  This  partial  derivative  is  often  called  a  "sensitivity". 
The  calculation  of  these  sensitivities  is  not  difficult  in  principle.  They  satisfy  differential  equa¬ 
tions  which  are  closely  related  to  the  system  dynamic  equations,  but  which  contain  terms  based  on  the 
algebraic  partial  derivatives  of  the  dynamic  equations  (see  Section  3.1).  The  difficulty  is  one  of 
practice.  Any  time  that  the  structure  of  the  nonlinear  model  is  changed,  then  the  sensitivity  differen¬ 
tial  equations  must  be  changed  also.  This  requires  tedious  algebraic  differentiation  of  the  modified 
dynamic  equations. 

(2)  Evaluation  of  Covariance  of  Parameter  Estimates.  A  parameter  covariance  matrix  can  be  esti¬ 
mated  using  the  Cramer-Rao  bound  [31].  The  most  common  use  of  this  bound  assumes  that  the  errors  in 
predicting  the  response  of  the  system  are  due  to  an  additive,  white  (negligible  autocorrelation)  random 
process.  If  the  analyst  also  desires  confidence  intervals  for  parameter  estimates,  then  the  additional 
assumption  that  the  errors  have  a  normal  distribution  must  also  be  made.  These  assumptions  are  commonly 
violated  when  a  nonlinear  system  is  modeled.  In  particular,  the  whiteness  assumption  is  typically  vio¬ 
lated. 

(3)  Generic  Model  Structure.  There  is  a  need  to  represent  nonlinear  functions  of  several  vari¬ 
ables  in  the  model  used  in  the  identification  algorithm.  Ideally,  a  single  generic  form  should  repre¬ 
sent  multidimensional  surfaces  of  arbitrary  shape.  These  functions  represent  total  aerodynamic  force  or 
moment  coefficients  as  functions  of  angle  of  attack,  angle  of  sideslip,  and  angular  rates. 

3.1  Problem  Definition 

The  dynamic  system  is  modeled  as  nx  first  order  nonlinear  differential  equations. 

x  «  f(x,  u,  w,  t,  9)  (11) 

having  an  output  measured  at  discrete  times  tk 

y(t^)  *  h  (x,  u,  t^,  6)  +  v  (t^)  (12) 

Here 

x  »  nx  component  state 

u  ■  nu  component  inputs  measured  without  error 
6  *  nth  component  unknown  parameter  vector 

w  is  a  nw  component  random  input  (process  noise  source)  having  statistics 


E  (v)  •  0  (13) 

B  [w  (t^)  vT(tj)l  ■  Q  (t^)  (14) 

The  scalar  v  (tk)  ■  vk  is  a  random  measurement  error  having  the  stetistica 

B  (vk)  -0  (15) 

E  <v’)  -  r  (16) 


Note  that  the  assumption  of  scalar  measurements  does  not  cause  a  great  loee  of  generality.  This  formu¬ 
lation  can  accomodate  multiple  sensors  simply  by  assuming  that  the  interval  between  meeturemente  ie 


sometimes  very  small.  The  only  loss  of  generality  regards  the  representation  of  correlation  of  measure¬ 
ment  error  between  sensors. 

If  we  assume  that  the  stochastic  quantities  have  normal  distributions  then  the  joint  density  or 
likelihood  function  of  a  sample  of  nt  measurements  y  (tg)  *  yjt  is 


An  extended  Kalman  filter  [17]  can  generate  both  the  measurement  estimates  yg  and  the  measurement 
uncertainties  0g.  The  estimate  of  6  having  the  smallest  variance  is  the  one  which  maximize#  j?(y; 
0)  with  respect  to  8.  The  maximization  of  !£  is  equivalent  to  the  minimization  of  the  negative  log 
likelihood  function  given  by 

-log  Se(y_  ;  £>  •»  (18) 

1  nt  2  ,  . 

\  s  |[yk  *  y  <£>]  !\  <8>  +  2  log  ok(8.)[ 

Note  that  if  the  Og  are  known,  then  the  minimization  of  -logif'  can  be  treated  as  a  nonlinear  least 
square  problem. 

Aspects  of  this  problem  addressed  here  are  the  following. 

(1)  The  minimization  of  Eq.  (18)  with  respect  to  £  requires  the  use  of  an  iterative  numerical 

procedure  similar  to  a  Newton  or  quasi-Newton  method.  If  the  Og  are  known,  then  the  most  effective 

procedure  is  that  of  Levenberg  and  Marquardt  [16,30].  A  drawback  to  the  Levenbetg-Marquardt  method  is 
the  requirement  for  the  evaluation  of  3y/3  8.  Direct  analog  finite  difference  method*  [32]  avoid  this 
problen  by  approximating  the  partial  derivative  by  a  finite  difference.  Section  3.2  extends  the  finite 
difference  analog  of  the  Levenberg-Marquardt  procedure  to  the  more  general  form  of  Eq.  (18). 

(2)  Validation  of  an  identified  model  should  include  the  determination  of  confidence  intervals  or 

variances  for  estimated  parameter*.  These  may  be  estimated  using  the  Cramer-Rao  bound,  which  state*  that 

R[(6  -  6  )  (6  -  0  )  ]  >  M  1  (19) 

where  H  is  the  information  matrix,  given  by 

lMlij  *  -[a2(i<>8  ^<2  ;  oJ/aoj  aoj  do) 

and  £*  is  the  true  parameter  value. 

Experienced  analysts  in  the  system  identification  field  are  aware  chat  the  Cr*m«r-R*o  bound  is  usu¬ 
ally  optimistic  (33).  That  is,  it  tends  to  predict  parameter  variances  which  are  very  small  in  compari¬ 
son  to  variances  observed  among  estimates  derived  tree  multiple  data  set*.  The  moat  easily  implemented 
expressions  for  the  infomation  matrix  aatum*  that  the  measuretwnt  errors  are  not  autocorrelated.  If 
autocorrelation  is  accounted  for,  than  the  Cramer-Rao  bound  more  realistically  approximate*  the  true 
parameter  variance.  Section  3.3  outlines  methods  hssed  on  principle*  of  generalized  least  square*  which 
automatically  account  for  first  order  autocorrelation  of  ncaeurement*.  Thia  method  produce*  parameter 
estimates  which  have  a  lower  variance  than  those  which  do  not  account  for  autocorrelation. 

(3)  Modeling  nonlinear  aerodynamic#  require*  the  representation  of  nonlinear  function*  of  several 
varlablta.  For  example,  pitch  moment  of  en  aircraft  is  s  nonlinear  function  of  angle  of  attack  a, 
angle  of  tideslip  8,  pitch  rat*  Q,  and  elevator  angle 

Cm  “  Cm(  *•  6  •  Q*  V 

Much  work  in  the  identification  of  the**  function*  ha*  been  bated  on  their  representation  a*  multidimen¬ 
sional  polynomial*  (34).  Thia  approach  it  affactiva  for  local  modtla.  A  local  model  ia  on*  that  i* 
valid  ovar  a  restricted  region  of  tht  flight  tnvtlope,  say  for  act  u  interval  of  10°.  Such  modal* 
often  uat  txpanaion*  for  function*  like  Ca  in  tha  itatts  of  degree  no  higher  then  two.  The  repre¬ 
sentation  of  a  global  modal  using  polynomial  expansion*  may  require  very  high  order  polynomial*.  The 
representation  of  a  pitch  moment  curve  for  on*  particular  aircraft  through  a  40°  angle  of  etteck 
region  require*  a  ninth  degree  polynomial  [35l»  Section  3.4  indicates  how  the  uee  of  local,  low  degree 
polynomial  modal*  lead*  very  naturally  to  a  aplint  formulation  for  a  global  modal. 

3.2  Derivative  Free  Minimisation  of  th*  negative  Log  Likelihood  Function 


Finding  th*  parameter  value*  8  which  minimize  tq.  (It)  requires  th*  use  of  a  numerical  optimiza¬ 
tion  schema.  Typically,  each  iteration  of  tha  algorithm  update*  8  aa 
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Vl  '  2n  + 

(22) 

where  68.  aatiafiea 

M66_  -  -g 

(23) 

g.  -  aiog^Ty,  9)  / 30  . 

(24) 

and  M  i»  defined  in  Eq.  (20). 

Differentiating  Eq.  (18)  give*  (35] 


"iJ 


*i  ■  -  £l}tv*,n  Mi}  ■ }  vk(3a£  /3ei)/ok 


nt 

t 

k*l  | 


-  t  {  (3vk/30  .)Ovk/36  j)/o^  [-Ovk/3e.)(3o*  730^) 


where 


30^)00*  /38. )  ♦vk(3o|[  /38  . )Ook  /36  ^) 

♦°ko\  /ae^ej)  -vk  O2ok/3o.3o3) )  vk/ok 

-  i  (*>k  /m^Oo2  /aa^/oj  ♦  i  (j2  0J  /so*  ae3>  /  o[ 
\  •  yk  *  yk  (0i- 


(25) 


(26) 


(27) 


Following  the  *pirit  of  the  bevenberg-Marquerdt  Method,  we  ximplify  the  expreaeion  for  the  Heieian  by 
dropping  the  tern*  proportional  to  vk.  If  the  Model  fit*  the  data  well,  vk  ahould  approach  tero 
near  convergence.  With  the  exception  of  the  leat  tero  of  Eq.  (26),  it  ia  now  poaaible  to  evaluate  both 

gradient  and  Heaaian  if  only  the  firat-ordtr  aenaitivitie*  and  o*  to  changea  in  a  are  known. 


St  ia  poaaible  to  derive  analytically  ordinary  differential  equation*  for  3vk/30.  and  so^/so*. 

When  the  plant  dynawic  equation*  are  linear,  the**  differential  aanaitivity  equation*  (alao  called  "aan- 
•itiviey  aquation*1*)  have  a  particularly  tinpla  fora  (36).  When  th*  plant  dynanic*  are  nonlinear,  how¬ 
ever,  a  ouch  note  practical  Method  ia  to  approximate  the  partial  derivative*  with  finite  difference*. 

JV*8i  V’  •V  *  V-**  ^  S>il  m) 

or 

»vk  •  £*)  -  vk(0-tt)]  /  2  |  £t  [  (29) 

where  (jt  *)j  •  c<tj. 

Partial  derivative*  of  o2(g)  are  approximated  aiailarly. 

The  "direct  analog"  (321  type  of  optiaiaatio*  algorithaa  uae  th*  finite  difference  approximation* 
to  th*  partial  derivative*  a*  direct  tubatitute*  for  th*  partial  derivative*.  The**  algorithm*  have 
been  atwdied  carefully  for  th*  aolution  of  th*  nonlinear  leaat  a qua  re  problem  and  have  been  found  to 
have  convergence  propertied  nearly  identical  to  algorithaa  which  ucc  analytic  expreaeion*  for  the  aenai- 
tivitiea. 

Alternate  aethnda  for  ainiaitation  of  tha  negative  tog  likelihood  function  without  evaluating  de¬ 
rivative*  ua*  algorithm*  which  ate  not  direct  analog*  of  darivativa  aethodt.  w*  have  taatad  on*  of 
the**  method*  (37|  in  a  nonlinear  ayattm  identification  algorithm  and  found  it  to  ba  not  aa  affective  a* 
tha  direct  analog  mat hod. 

I*  practical  implementation*  of  tha  fiait*  difference  amthod  for  the  aolution  of  ayatea  identifica¬ 
tion  problaaa,  a*  typically  aiaultanaoualy  aolv*  a  eat  of  nx  nominal  atat*  aquation*  together  with  nth 
aeta  of  nx  perturbed  parameter  atat*  aquation*.  Thit  i*  required  in  otder  to  iap lament  tha  ona-aided 
appremimation  to  th*  aanaitivity  given  in  tq.  (2g). 

tha  laat  tan  of  Kq.  (2b)  cannot  be  eliminated  by  aaauming  that  vk  ia  email  near  tha  minimum, 
the  ten*  cannot  ha  com  true  (ad  froa  firat  order  partial  derivative*  of  v  and  o1,  It  can  ba 
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estimated,  however,  using  methods  similar  to  those  employed  in  solving  large  residual  nonlinear  least 
square  problems  [38].  This  topic  will  not  be  treated  in  further  detail  here. 

3.3  Autocorrelated  Measurement  Errors 

It  is  desireable  to  determine  the  expected  accuracy  of  the  parameters  estimated  from  flight  data. 
The  usual  method  for  doing  this  is  to  compute  a  parameter  covariance  matrix  as  the  inverse  of  the  Fisher 
information  matrix  M. 

Methods  of  generalized  least  squares  [39]  indicate  expressions  for  parameter  estimation  covariance 
using  the  assumption  that  measurement  errors  are  autocorrelated.  This  autocorrelated  process  is  the 
output  of  a  first  order  difference  equation  driven  by  white  noise.  Not  only  parameter  covariances  but 
also  parameter  estimates  themselves  are  altered  by  the  autocorrelation  assumption.  An  estimation  algo¬ 
rithm  which  does  not  explicitly  account  for  the  measurement  error  autocorrelation  will  still  produce  un¬ 
biased  parameter  estimates.  However  the  actual  variance  of  such  estimates,  as  opposed  to  the  Cramer-Rao 
predicted  variance,  will  be  higher  than  those  produced  by  an  algorithm  which  does  explicitly  account  for 
autocorrelation. 

We  consider  here  only  the  output  error  case  (no  process  noise)  of  the  maximum  likelihood  estimator 
for  dynamic  systems.  The  estimated  measurements  y  are  functions  of  the  unknown  parameter  set  6.  The 
information  matrix  M  is  given  by 

M  -  1_  .T  (30) 

o2 

where  J  is  the  matrix  of  sensitivities 

(Jlw“ "  3y(l*tk)/aea  (31) 

for  the  case  of  white  measurement  noise.  Iterations  of  the  identification  algorithm  solve 

HA0_  -  -&  (32) 


i  ”  K  JT  1  (33) 

o 

\  -  •  y(tk> 

The  covariance  of  the  measurement  error  is  a  diagonal  matrix 

E( v  vT)  *  o2  I  (34) 

where  v^  *  v(t^)  ,  the  measurement  error  at  t^.  (35) 

Now  suppose  that  the  measuresMnt  errors  have  a  nondiagonal  covariance  matrix  of 

B(v  vT)  *  o2  V  (36) 

For  a  purely  linear  estimation  problem,  (i.e.  y  *  J0  +  v),  the  "generalised  least  squares"  estimate 
of  £  satisfies  [39] 

UVXJ)  0  -  jV1*  (37) 

The  covariance  of  tha  parameter  estimates  is 

E  [(8  -  0*)(O  -  8*)T]  •  o2(jVXJ)*1  •  Var<8)  (38) 

Any  other  linear,  unbiased  estimator  has  a  covariance  matrix  which  exceeds  that  given  in  Eq.  (38). 

If  the  noise  vector  v  is  generated  by  a  first  order  autoregressive  process 

v<tfc)  -  w(tk_j)  ♦  *k  (39) 

where  ck  is  a  aero  mean,  constant  variance  process,  then  V  has  the  form 


1 

0 

PJ 

«  •  • 

n-1 

P 

p 

1 

0 

P2 

0 

1 

p"-j 

e 

e 

„n-l 

„0-2 

„n-3 

• 

P 

P 

P 
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The  inverse  of  V  is  V-1  ■  P  T  P  where 


The  generalized  least  square  estimator  for  6  can  be  easily  implemented  by  writing  Eq.  (37)  as 

{(P  J)T(P  J)]  6  -  (P  J)T  P  2  (42) 

The  multiplications  P  J  and  P  £  are  simple  because  P  is  sparce. 

For  application  to  the  output  error  system  identification  problem,  a  nonlinear  least  square  prob¬ 
lem,  Eq.  (42)  is  applied  to  parameter  variations  on  each  iteration  of  a  successive  approximation  al¬ 
gorithm. 

(P  J)  T  (P  J)A8_  »  (P  J)  T  P  -  £  (e)  1  (43) 

If  P  is  unknown,  then  it  c«n  be  estimated  using 


nt  -  1 

*  (th)  ,  , 

k“l  .  nt  -  1 


<v 


nt  -  nth 


Each  iteration  calculates  a  A6_  value  using  P  evaluated  by  Eq.  (44),  with  v  (t^)  calculated 
from  J  at  the  end  of  the  previous  iteration. 

If  the  measurement  noise,  o2,  is  unknown,  it  can  be  estimated  using 


o2  - - J- 


(Pv)  (Pv)  . 


nt  -  nth 

The  covariance  of  the  parameter  estimates  is 


V*r<8)  - - i - (Pv)t(Pv)(<PJ)T(?J)f 1 

nt  -  nth 


F  -  P/1  -  P2 


3.4  Spline  Model  Structure 


The  detensicetion  of  a  nonlinear,  quasistatic  aerodynamic  (or  hydrodynamic)  model  require*  defini¬ 
tion  of  a  coefficient  function  having  a  general  form 

C  •  C(o,  8,  J»\  «,  Ft,  M)  (4g) 


where  a  end  6  err  -eletive  flow  englee,  u'  it  e  diaentionlets  enguler  rete  vector,  £  it  e  con¬ 
trol  vector,  end  »,,,  Fr  ,  end  N  ere  the  dimensionless  numbers  of  Reynolds,  Froude,  end  Mech. 
The  model  structu. -  detensinetion  problem  for  identif icetion  of  eerodynemic  models  usuelly  refers  to  the 
problem  of  determining  e  methemeticel  form  for  this  mult ivsrieble  function. 

Spline  functions  ere  effective  weys  to  represent  these  coefficient  functions.  A  one  dimensionel 
spline  function  it  e  piecewise  polynomiel  function  heving  certein  continuity  conditions  between  pieces. 
Figure  4  illustrstes  e  one  dimensionel  cubic  spline.  C(a)  might  represent  pitch  moment  es  e  function 
of  tngle  of  attack.  C(o)  here  is  e  cubic  polynomiel  on  eech  of  the  three  regions  indiceted.  The  func¬ 
tion  is  everywhere  continuous  end  hes  continuous  first  end  second  derivetives.  The  points  Oj,  cij, 
03,  04  ere  celled  the  knots  of  the  spline. 

The  spline  function  hes  severel  properties  which  meke  it  en  effective  interpoleting  function  (40). 

(1)  The  spline  in  Figure  4,  for  extmple,  is  uniquely  determined  once  the  velues  of  the  function  et 
the  four  knots  ere  known  end  certein  end  conditions  ere  specified. 

(2)  The  shepe  of  the  interpoleting  function  is  not  overly  sensitive  to  the  function  velues  et  the 
knots.  Smell  chenges  in  these  velues  do  not  ceuse  overly  lerge  chsnges  in  interpoleted  function  velues 
between  knots. 

(3)  The  interpoleting  function  C(a)  hes  en  optimel  smoothness  property.  It  is  the  unique  func¬ 
tion  which  interpoletes  the  specific  velues  et  the  knots,  hes  the  continuity  conditions  listed  sbove, 

end  hes  the  minimum  meen  squere  curveture. 

Spline  function  representetion  of  nonlineer  eerodynemic  or  hydrodynsmic  coefficient  functions  msy 
be  resdily  identified  using  either  mexismim  likelihood,  equetion  error  or  output  error  techniques.  The 
enslyst  mutt  specify  the  number  of  knots  end  their  locetions.  The  peremeters  to  be  identified  ere  then 
the  function  veluei  et  the  knots.  The  identificetion  of  the  C(d)  curve  in  Figure  4  would  require  the 
estiststion  of  four  peremeters. 

In  Figure  4,  the  coefficients  c1  -  C4  ere  the  function  velues  st  the  knot  locetions  - 

These  coefficients  sre  the  psrsswtere  which  will  be  estimated  by  the  identificetion  elgorithm. 
The  piecewise  cubic  polynomials  Kg|  -  kgq  provide  cubic  interpolation  of  Cj  -  C4  tor  <s  in 

the  range  (uj,  04!  and  linear  extrapolation  for  a  outside  this  range- 

Each  of  ths  piecewise  cubic  polynomials  is  defined  over  the  entire  rang*  of  X»  ).  The 

function  C(«)  is  s  linear  combination  of  the  Kgj  bests  functions.  The  definitions  of  ere 

such  that 


‘V  *jo 


l*i 

ifl 


This  makes  the  coefficients  in  the  linear  combiner i»n  equal  to  the  Cj  values.  The  array  in  Figure  4 
defines  the  Kgj  functions  over  the  five  a  regions. 

The  identificetion  of  epline  functions  is  west  effective  when  used  with  derivative  free  methods  to 
minimise  the  negetive  log  likelihood  function.  Such  methods  do  not  require  the  explicit  calculation  of 
the  sensitivity  of  the  spline  function  to  changes  in  the  parameters  which  define  the  spline.  The  only 
requirement  it  for  the  evelustlen  of  the  spline  coefficients  given  function  values  st  the  knots,  snd  for 
the  evaluation  of  the  function  at  intermediate  points  given  the  spline  coefficients.  Eeeh  iteration  of 
the  "direct  analog"  method  requires  the  evaluation  of  the  innovations  (tg)  for  nominal  and  for  per¬ 
turbed  pareewter  valuas, 

Kathode  exist  for  the  u*e  of  mvltidtsmeioe.il  spline  functions  to  represent  smooth  surfaces  (411. 
Intermediate  methods  ate  also  useful.  An  i  »f**diet«  method  represents  variation  of  t  function  in  one 
dimension  with  s  spline  function  and  representation  in  other  dimension*  with  other  types  of  functions, 
such  as  low  order  polynomials. 


4.  EXAMPLE  AFPLlCnTlOk 


The  results  presented  in  this  paper  demonstrate  the  operational  status  of  nonlinear  system  identi¬ 
fication  data  processing  techniques.  Aerodynamic,  ••stilled  thrust,  end  flight  teat  instrumentation 
calibration  models  ere  identified  for  the  F'-iS  from  maneuvers  which  encompassed  s  large  range  in 
angle  of  attack,  sideslip,  airspeed,  control  inputs,  and  bony  rotation  rates.  The  capability  for  iden¬ 
tifying  nonlinear  aerodynamic  models  in  a  format  compatible  with  preflight  predictions  is  demonstrated. 
A  methodology  for  detetxtiftiag  the  accuracy  of  the  parameter*  estimates  ie  presented. 

4.1  F-iS  Flight  That  Program  Overview 

The  identification  results  presented  in  this  section  are  hexed  on  a  flight  test  program  planned 
specifically  for  the  collection  of  dele  f  »r  notification.  The  sola  objective  et  the  flight  test  pro- 
grim  v»«  to  gs nerate  test  data  which  toutl  he  used  to  develop  and  validate  nonlinear  system  identifica¬ 
tion  analysis  techniques.  The  Havel  Air  Yatt  Center  (KATC)  F-4S  aircraft  (to no  2*4),  so  F-4J  which  has 
be*4  modified  to  include  a  tmcwweriag  fl*p/*lst  system,  was  the  teat  aircraft. 

The  airborne  data  acquisition  systems  used  for  the  flight  text  program  included  a  2-axis  rate  gyro, 
a  vertical  gyro,  a  directional  gyro,  engine  tW  and  fuel  flow  a*storeme«ts,  a  Vasia  linear  accelerom¬ 
eter,  control  surface  deflection  msssuremsrta  Ju  a  teat  sirdata  system.  The  sirdar  a  system  uses  a 
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noaeboom  which  hat  a  pitot-static  head  for  the  measureawnt  of  impact  pressure,  static  pressure  and  ten* 
perature.  The  noaeboom  alto  haa  vanei-  for  measurements  of  angle-uf-attack  and  sideslip. 


The  flight  teat  prograa  waa  conducted  in  a  clean  configuration  with  the  throttles  fixed  for  each 
maneuver  and  with  flight  near  and  beyond  stall.  Teat  conditions  were  generally  initiated  from  wing- 
level,  constant  altitude  flight  with  &  ■  10°  and  M  '•  .6.  For  some  test  conditions,  the  pilot  ap¬ 
plied  variable  aft  stick  to  naneuver  the  aircraft  through  the  desired  angle-of-attack  test  range.  For 
some  of  the  stall  entry  conditions  lateral  stick  and/or  pedal  doublets  were  combined  with  the  longitudi¬ 
nal  stick  cooaund.  Other  maneuvers  included  single  axis  and  multi-axis  sequenced  doublet  ;nput*.  These 
inputs  were  made  by  the  pilot  and  were  not  intended  to  be  repeatable  nor  precise  with  regard  to  their 
spectral  characteristics,  but  were  generally  effective  in  creating  large-amplitude  motions.  The  overall 
teat  goal  waa  to  force  the  aircraft  through  a  broad  range  of  test  conditions.  The  level  of  excitation 
of  primary  teat  variablea  achieved  during  the  F-4S  flight  teat  program  is  sumarized  as  follows: 


• 

Angle  of  attack: 

-1° 

<  a  <  40° 

• 

Sidcalip: 

lei 

<  18° 

• 

Mach  No: 

H 

<e  6 

• 

Rotational  Sates: 

1*1 

<  90°/S, 

• 

Full  amplitude  control 

inputs 

I Qt  *  20°/S, 


i Rj  <  25°/S 


4.2  Modeling  Approach 

The  approach  selected  for  modeling  nonlinear  aerodynamic  characteristics  produces  system  identifi¬ 
cation  results  that  can  be  used  to  validate  preflight  estimated  aerodynamic  models.  The  models  are  used 
for  flight  simulator*  and  for  making  prediction*  of  aircraft  performance,  stability  and  control  charac¬ 
teristics.  Theae  aerodynamic  model*  must  account  for  the  effect  of  a  nuwber  of  flight  condition  and  air¬ 
craft  configuration  variablea.  The  “art”  in  formulating  the  models  is  to  represent  the  total  aerodyna¬ 
mic  coefficient  by  an  incrcstental  buildup,  with  each  increment  described  by  one  or  two  independent  vari¬ 
ables*  This  process  is  illustrated  by  the  following  example  for  the  rolling  moment  coefficient  equation, 

(1)  Selact  the  independent  variablea t 

V  *.  *•  vv 


i.e.,  rolling  moment  coefficient  is  a  function  of  angle  of  attack,  sideslip,  roll  and  yaw  rate, 
rudder  sod  aileron  position. 


(!)  Partition  independent  variables  into  reasonable  groups: 


C,  *  dC, 


•  -4C 


4C. 


SIWE5UP  SrtMAMte  VfHJER 


t 

Atunott 

())  Select  functional  relationships  for  each  group: 

iC,  *  f(o  ,  S) 

sioesup 


•  Ct  Co)  •  A 


ie.  •  i(o,  r>  •  fie,  «) 

OTXAMIC 

*  e.  (*><n  n  v_)  ♦  c,  («hw>  n  v  > 

r  w  "  »  *  1 


fit,  •  1(0.  «,) 

HSCMI  * 


•  C,  («)  * 

*-  ■ 


4C.  -  f< o,  4  ) 

AtUUOM 


-  C,  <«)  *  *. 
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For  this  model  formulation,  each  of  the  stability  derivatives,  (i.e.,  Cj^,  )  is  modeled  as  a  non¬ 
linear  function  of  angle  of  attack.  By  using  a  cubic  interpolation  spline,  as  described  in  Section  3.4, 
the  parameter  identification  algorithm  solves  for  Cn  at  specific  values  of  angle  of  attack  (i.e.,  the 
knots  of  the  spline).  This  procedure  is  illustrated  In  Figure  5  which  shows  the  identified  variation  of 
C  with  angle  of  attack.  For  the  F-4S  parameter  identification  study,  C&,  was  identified  for  cx  » 
5°,  15°,  25°,  and  35°.  The  lower  four  parts  of  Figure  4  illustrate  the  variation  of  the  inter¬ 
polation  splines  with  angle  of  attack. 

Because  these  interpolation  splines  are  scaled  by  the  appropriate  value  of  C  (i.e.,  the  a  m  5° 
spline  is  scaled  by  the  value  of  for  a  •  5°),  the  summation  of  the  four  interpolation  splines  de¬ 

fines  the  value  of  Cgg  for  any  value  of  angle  of  attack.  It  should  also  be  noted  that  each  interpola¬ 
tion  spline  has  the  value  of  when  a  equals  its  knot  value,  and  it  is  zero  for  other  knot  values. 

The  spline  formulation  is  suitable  also  for  representing  installed  propulsion  system  performance 
models  and  test  instrumentation  calibration  factors.  As  demonstrated  in  the  next  section,  more  compli¬ 
cated  models  can  be  represented  by  a  bicubic  spline  formulation. 

4.3  System  Identification  Results 

The  results  presented  in  this  section  demonstrate  the  capability  for  identifying  flight  test  in¬ 
strumentation  calibration  factors,  performance  data  including  a  model  for  net  thrust,  and  stability  and 
control  characteristics  from  a  common  set  of  flight  test  data. 

The  aerodynamic  models  are  generally  represented  by  a  cubic  spline.  The  aerodynamic  parameters  are 
identified  at  four  specific  values  of  angle  of  attack  (i.e.,  a  “  5°,  15°,  25°,  35°).  The  aerody¬ 
namic  estimates  are  expected  to  be  most  accurate  in  the  10°  <  a  <  30°  range  due  to  the  density  of 
test  data  in  this  range  (see  Figure  6  which  crossplots  a  vs  g  for  every  quarter  of  a  second  for  the 
six  maneuvers  analyzed). 

A  common  format  is  used  to  present  the  identified  models.  The  estimates  are  defined  as  a  solid 
line.  When  cubic  splines  arc  used  to  represent  the  parameter,  the  solid  line  represents  the  cubic 
spline  interpolation  between  each  of  the  four  knots.  The  identified  parameters  are  noted  with  a  solid 
circle  symbol.  The  parameter  estimation  uncertainty  is  shown  by  dashed  lines  on  either  side  of  the  es¬ 
timate.  The  distance  between  the  solid  line  and  the  dashed  lines  represents  the  20  uncertainty  of  the 
estimate.  Preflight  predictions  are  illustrated  by  a  solid  triangle  symbol. 

The  validity  of  the  parameter  estimates  can  be  established  from  three  different  considerations. 

(1)  Engineering  judgement:  are  the  estimates  reasonable  from  the  point  of  view  of  general  agreement 

with  preflight  predictions? 

(2)  Estimation  uncertainty:  what  is  the  magnitude  of  the  +  20  bands  about  the  estimate? 

(3)  Prediction  accuracy:  How  well  does  the  identified  model  predict  flight  test  measurements  for  many 

test  maneuvers?  Does  the  identified  model  predict  these  measurements  better  than  a  model  based  on 

preflight  parameters? 

4.3.1  Flight  Test  Sensor  Calibration 

Figure  7  summarizes  the  types  of  error  terms  considered  for  the  accelerometers,  rate  gyros,  verti¬ 
cal  gyros,  a/0  vanes,  and  the  pitot-static  impact  pressure  measurements.  These  errors  represent  system 
level  errors  in  that  the  contributions  from  separate  sources  (i.e.,  PCM  cslibretion,  sensor  installation 
and  sensor  performance)  generally  cannot  be  identified.  For  example  an  identified  scale  factor  error 
for  an  alpha  vane  could  be  due  to  upwash  and/or  PCM  calibration  errors.  On  the  other  hand,  initial 
errors  for  the  vertical  gyro,  which  vary  with  test  condition,  «re  probably  due  to  vertirality  error* 
resulting  from  the  erection  circuit.  The  identified  instrumentation  calibration  models  for  the  NATC 
F-4S  are  presented  in  Figure  8.  All  instruments  required  some  correction  and  the  roll  rate  gyro  signal 
was  found  to  be  unusable.  As  a  result,  a  roll  rate  signal  had  to  be  reconstructed  from  vertical  and 
directional  gyro  measurements. 

4.3.2  Aerodynamic  and  Thrust  Models 

r igures  9  through  12  present  F-4S  parameter  identification  results  that  illustrate  the  extraction 
of  patameters  l'or  both  thrust  and  aerodynamic  model*  from  a  common  set  of  flight  test  data. 

Figure  9  illustrate-  the  identified  model  for  net  thrust  and  the  dreg  polar  which  has  been  recon¬ 
structed  from  models  of  Cq  end  Cl  as  a  function  of  angle  of  attack.  The  model  for  net  thrust  was 
defined  in  -erma  of  a  linear  relationship  betveeen  corrected  net  thrust  and  corrected  fuel  flow.  The 
difference  between  che  identified  nnd  preflight  model*  for  corrected  net  thruat  ia  a  bias  in  fuel  flow 
of  38  lbs/hra  baaed  on  a  pressure  altitude  of  hp  ■  30,000  ft.  The  accuracy  of  the  flight  teat  fuel 
flow  senaor  i*  ISO  lbs/hr.  The  identified  drag  polar  matches  the  preflight  polar  shape  in  term*  of 

CdM1N*  <Ct/cD>MAX  ,nd 

Figure  10  present*  stablicer,  rudder  end  lateral  control  power  estimates  as  function*  of  angle  of 
attack.  The  flight  derived  control  power  estimates  show  good  agreement  with  preflight  decs  in  terms  of 
trends  with  angle  of  attack  with  slight  differences  in  the  actual  value  of  the  derivative.  The  uncer¬ 
tainty  of  th*  lateral  convro'  derivative*  (C*  end  CL  )  a*  shown  by  the  fanning  of  the  +2o  curve*  it 

■  A 

due  to  the  leek  of  late, cl  control  excitation  at  lov  and  high  a.  In  ganeral,  when  the  uncertainty 
boundaries  are  larg*  for  acne  part*  of  tha  flight  regime,  this  information  can  be  used  for  planning 
additional  flight  test  conditions. 
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Figure  11  presents  the  identified  variation  of  -saving  moment  coefficient  (Cn)  with  angle  of 
attack  and  sideslip.  Values  for  C„  and  C„  are  obtained  from  the  C„  model  at  8  ■  0°.  Cn 
vaa  modeled  by  a  bicubic  spline  with  specific  values  identified  for  ■*  “  3°,  15°,  25°,  35°  and  8  * 
6°,  12°  and  18°  (Cn  "  0  for  *  0).  A  bicubic  spline  was  selected  for  the  Cn  acdel  due  to  the 
expected  nonlinear  variation  with  angle  of  attack  and  sideslip. 

Identified  values  for  roll  rate  and  yaw  rate  stability  derivatives  for  the  F-4S  are  presented  in 
Figure  12.  Good  agreement  is  evident  between  preflight  prediction  and  identified  values  for  Cn  snd 
Cjp  (for  a  <  30  ).  The  identified  value  for  Cn  is  much  greater  than  the  preflight  prediction, 
although  both  are  independent  of  angle  of  attack.  Significant  differences  exist  between  identified  and 
predicted  values  of  C£r  for  a  >  20°. 

4.3.3  Flight  Measurement  Predictions 

The  ability  of  the  identified  aerodynamic  and  net  thrust  models  to  reproduce  flight  Cest  measure¬ 
ments  for  the  six  test  maneuvers  is  tabulated  in  Figure  13.  Figure  13  summarises  the  root  mean  square 
reaiduale  between  the  flight  test  measurements  and  estimates  of  the  measurements.  The  predicted  meas¬ 
urements  sre  generated  from  a  simulation  of  the  F-4S  baaed  on  the  identified  aerodynamic,  thrust,  and 

instrumentation  models.  The  simulation  is  run  with  actual  flight  test  time  histories  of  the  control 

commands. 

To  gain  some  feeling  for  the  relative  worth  of  the  identified  model,  the  F-4S  simulation  was  con¬ 
figured  with  the  preflighr  leroJynamic  data  and  run  with  the  flight  teat  control  inputs.  The  resulting 

measurement  residuals  sre  also  presented  in  Figure  13.  In  ell  cates,  the  identified  model  provides  a 
better  explanation  of  the  flight  test  measurements.  Figure  14  compsrea  the  time  histories  of  the  flight 
test  measurements,  predictions  base:  on  the  identified  model  and  prediction*  baaed  on  the  preflighr 
laodel  for  one  of  the  flight  conditions  analyzed. 

5.  SUMMARY 

(1)  The  integrated  system  identification  technology  encompassea  teat  planning  and  on-line  data  consis¬ 
tency  testing,  as  well  as  data  processing. 

(2)  Effective  system  identification  data  processing  often  require*  independent  aiodel  structure  deter¬ 
mination,  parameter  ~  timation,  and  model  validation  steps. 

(3)  When  working  •  it  n  linear  systems,  the  sensitivity  functions  needed  in  order  to  apply  iterative 
estimation  algorithm*  can  aioat  effectively  be  calculated  using  finite  difference  approximations. 

(4)  Autocorrelation  of  measurement  errors  significantly  effects  parameter  estimation  covariance. 

(3)  Spline  functions  are  useful  for  the  representation  of  nonlinear  function*  during  the  identification 
process. 

(6)  Nonlinear  syatem  identification  data  processing  technique*  can  be  used  tn  identify  ecrodynaoic, 
propulsion  ayttem  and  intcruswntecion  calibration  model*  from  «  coaeson  set  of  flight  test  condition*. 

(7)  A  single  eerodynaaic/nat  thrust  math  swdel  i*  identified  for  the  F-4S  fro*  si*  different  flight 
conditions.  The  identified  aetheaiatical  model  predicts  the  sensor  Masovement*  fob  the  six  flight  con¬ 
ditions  which  encompass  a  large  range  in  angle  of  attack,  sideslip,  airspeed,  control  inputs  end  body 
rotation  rate*.  Aircraft  response  predictions  are  improved  with  the  identified  model  relative  to  pre¬ 
diction  based  on  a  preflight  model. 

(8)  Extraction  of  performance,  stability  end  control,  and  high  angle  of  attack' characteristics  from  a 
single  *w>d*l  het  been  illuatreted, 

(9)  The  capability  for  identifying  nonlinear  aerodynamic  mod*!*  in  a  format  compatible  with  prefligbt 
predictions  he*  been  demonstrated. 

(10)  Dynamic  teat  techniques,  which  require  nonlinear  system  identification  data  processing  techniques, 
car.  improve  test  productivity.  The  reeulte  presented  in  thie  paper  ate  based  on  less  than  300  second* 
of  flight  teat  tima. 
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Table  1:  System  Identification  Data  Processing  Methods 


HETIHO 

ADVANTAGES 

DISADVANTAGES 

PERFORMANCE  INDEX 

MINIMIZATION  ALGORITHM 
TYPICALLY  USEO 

EXAMPLES  OF 
APPLICATIONS 

Equation  Error 

*  Effective  In  presence 
of  process  noise 

•  Computational 
simplicity 

Sensitive  to 
measurement  errors 

i||f  -  f(x,u.e.t)j2 

linear  least  squares 
[U] 

aeronautics  [20] 

Output  Error 

•  Effective  In  presence 
of  measurement  errors 

Sensitive  to 
process  noise 

i(y-y(t.»)l2 

Gauss-Newton  [16] 

rail  vehicles  [21] 
qas  turbines  [4] 

Combined  State  and 
Parameter  Estimation 

•  Effective  In  presence 
of  both  measurement 
and  process  noise 

Computational 

conplexlty 

HOI 

Gauss-Newton  t'6] 
Kalman  filter  [17] 

aeronautics  [22] 

Table  2:  Numerical  Methods 
Used  in  System  Identifica¬ 
tion 


- - - - - - 

NUMERICAL  MATHEMATICAL  REQUIREMENT 

EFFECTIVE  METHOD  OF  APPROACH 

SOLUTION  OF  DIFFERENTIAL  EQUATIONS 

LINEAR 

TRANSITION  MATRIX  [23] 

NONLINEAR 

MULTISTEP  HETHOOS  (ADAMS- 
BAStlfORTH)  [  24  ] 

SOLUTION  0E  LINEAR  ALGEBRAIC  SYSTEMS 

OF  EQUATIONS 

POSITIVE  DEFINITE, 
SYMMETRIC 

CHOLESXY  [  25  1 

GENERAL 

GAUSSIAM  ELIMINATION  [  25] 

SOLUTION  OF  LEAST  SqUAHES  PROBLEMS 

LINEAR 

FACTORIZATION,  OR  SQUARE  ROOT 

HETHOOS  {  26] 

NONLINEAR 

GAUSS-NEWTON  [  V  ] 

MINIMIZATION  OF  GENENAL  NONLINEAR 

MULTIVARIABLE  TUNCTIONS 

QUASI -NEWTON  f2B  ] 

10 
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Figure  3i  Coeperieon  of  Standard  Deviation 
in  Faraaater  Eat lea tea  for  Optimal  Input 
and  Doublet  (Serna  Input  Energy) 


Figure  li  Die  Integrated  Syatem  Identification  Procedure 
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SUMMARY 

For  the  realization  of  an  optimal  filter  in  practice  quantitative  knowledge  of  the 
system  and  measurement  noise  is  necessary.  The  required  stochastic  parameters,  covariances 
power  spectra  and  in  the  case  of  coloured  noise  the  dynamic  model  of  the  shaping  filter 
usually  has  to  be  determined  from  measurements  which  precede  the  filter  design. 

This  paper  describes  a  technique  for  the  estimation  of  stationary  stochastic  noise 
data  from  real  measurements  which  contain  the  noise  as  well  as  non-observable  determini¬ 
stic  values.  The  separation  is  accomplished  by  a  specific  smoothing  procedure.  The  sto¬ 
chastic  behaviour  of  the  data  is  proven  by  a  Gaussian  probability  distribution  test. 

Then,  based  on  conventional  stochastic  relations  the  covariance  and  the  spectral  density 
are  evaluated. 

In  the  next  step  shaping  filters  are  determined  from  the  power  spectral  density  to 
model  coloured  noise.  The  filter  structure  is  selected  and  the  filter  parameters  are 
optimally  identified  by  a  least  squares  procedure. 

Finally  the  filter  state  is  augmented  by  the  degree  of  the  shaping  filter.  As  further 
evidence  examples  for  the  individual  steps  are  given. 

1 .  INTRODUCTION 

For  the  design  of  optimal  filters  the  dynamic  model  of  the  system  and  information 
about  the  expected  stochastic  system  disturbances  and  measurement  errors  are  required  [l]  . 

The  dynamic  system-model  is  generally  derived  without  major  problems  from  knowledge 
of  the  physical  background  of  the  system  or  from  measurement  of  the  system  transfer-func¬ 
tion. 


Much  more  difficult  is  the  determination  of  the  system  disturbances  and  measurement 
errors,  here  called  system-  and  measurement  noise.  They  have  to  be  provided  for  the  fil¬ 
ter  algorithms  in  the  form  of  stochastic  parameters!  covariances  and  power  spectral  densi¬ 
ty.  The  estimation  of  these  parameters  requires  theoretically  an  infinite  number  of 
measurements  of  a  stationary  stochastic  noise-process.  In  reality  only  time-limited 
measurements  exist  whioh  are  not  exactly  stationary  and  contain  additional  deterministic 
variables . 

The  results  of  the  parameter  estimation  are  approximations  which  are  usually  suffi¬ 
cient  if  some  limiting  conditions  (controllability  and  observability  of  the  system  and 
stability  of  the  designed  filter  [2j  )  are  guaranteed. 

Another  problem  is  the  requirement  of  Gaussian  distributed  and  uncorrelated  (white) 
noise  for  the  Kalman  filter  formulation.  In  reality  white  noise  does  not  exist-  Real 
noise  (for  example  gyro  drift  or  wind  speed)  is  correlated,  has  a  limited  frequency  band 
and  is  called  coloured  noise.  A  solution  of  this  problem  is  possiblo  by  the  so-called 
state  vector  augmentation  (3l  .  Coloured  Gaussian  noise  can  be  generated  by  a  linear 
shaping-filter  from  Gaussian  white  noise  (Fig.  1).  The  shaping  filter, usually  a  first 
or  second  order  linear  filter,  is  added  to  the  system  model  and  the  state  vector  of  the 


WHITE  NOISE 


LINEAR 

SHAPING 

FILTER 


COLOURED  NOISE 


Fig.  1  Shaping  Filter 


system  is  augmented  by  the  order  of  the  shaping-filter.  While  this  procedure  is  applic¬ 
able  to  the  system-noise  without  problems  the  treatment  of  the  measurement-noise  is,  at 
least  in  theory,  difficult.  If  the  measurement-noise  is  integrated  in  the  system  by  a 
shaping  filter  the  measurements  are  error-free  which  would  result  in  a  loss  of  stochastic 
observability  and  filter  stability. 

Exact  but  relatively  difficult  solutions  for  the  coloured  noise  are  given  in  [4], [2]. 
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In  reality  this  is  no  problem.  Coloured  measurement-noise  usually  contains  an  addi¬ 
tional  part  of  high  bandwidth  which  is  approximately  white  and  can  be  separated  from  the 
coloured  part.  While  the  coloured  part  is  modelled  by  a  shaping  filter  the  approximately 
white  part  serves  in  the  filter  algorithm  as  the  required  measurement-noise.  So  the 
application  of  coloured  system  and  measurement  noise  in  a  Kalman  filter  is  possible. 

In  the  following  sections  techniques  for  the  estimation  of  the  noise-parameters 
from  real  measurement  for  the  application  in  a  Kalman  filter  are  described.  In  particular 
the  evaluation  of  the  covariance,  power  spectrum  (for  continuous  filters)  and  of  shaping 
filters  are  treated.  These  methods  are  based  on  the  conventional  mathematical  relations 
for  linear  stochastic  processes  [2] ,  [sj  . 

2.  DETERMINATION  OF  STATIONARY  MEASUREMENTS 

The  large  number  of  measurements  which  is  required  for  the  determination  of  stochastic 
parameters  (e.g.  variance  or  power  spectral  density)  generally  is  not  derived  from  an 
ensemble  of  measurements  at  a  fixed  time.  Usually  the  time-variable  measurements  of  one 
ensemble  member  is  applied.  This  is  correct,  if  the  ergodic  hypothesis  is  valid  which  means 
that  this  member  is  representative  for  the  desired  stationary  stochastic  noise  process. 
Unfortunately,  such  representatives  measurements  are  rarely  available  in  practice.  In  most 
cases  non-observable  deterministic  signals  or  measurement  errors  (trends)  are  superimposed 
to  the  stochastic  noise.  For  example  the  Signal  of  a  doppler-radar  contains  the  determinis¬ 
tic  aircraft  velocity  as  well  as  the  stochastic  measurement  error  noise.  If  no  redundant 
measurements  are  available  a  separation  of  the  two  variables  is  only  possible  by  their 
different  frequency  behaviour.  The  resonance  frequencies  or  the  bandwidth  of  the  observed 
system  is  approximately  known  from  its  physical  background  (e.g.  phygoide  of  an  aircraft). 
Usually  this  frequency  lies  below  the  frequency  band  of  the  measurement  noise  so  that  a 
separation  by  a  filter  or  smoothing  procedure  is  possible.  Below  a  smoothing  procedure  is 
described  which  gives  an  excellent  frequency  separation. 

Smoothing  procedures  have  the  property  to  provide  off  line  a  smoothed  value  at  time 
t,  from  measurements  before  and  after  tj..  If  the  procedure  is  repeated  for  varying  t^ 
tne  result  is  a  smoothed  variable  which  consists  of  the  lower  frequency-parts  of  the 
measurements.  The  smoothing  procedure  is  characterized  by  the  following  steps: 

-  From  the  measurements  y (t.  +  y)  the  data  within  a  data-window  of  the  length 

2V_Jt„  2)  are  selected. 

max 

-  The  measurements  are  approximated  by  a  second  order  polynomial 

y  (t^m  -  •o(ti)  +a1(t1)P  +«2(ti)  tf2/2  (1) 

-  The  differences: 

ylyii)  -  yiyp)  'dyiyiri 

are  weighted  with  a  weighting-function  g(lVM)  which  it  symmetrical  to  t^Pig.  2) 


Pig.  2 

Measurements  y  (-v^-  l 
Approximation  y (-~  -  - ) 
Weighting  function  g ( |  y  j ) 


-  The  constants  a  (t.J,  a.(t.)  and  a_(t.)  are  estimated  by  the  method  of 

least  squares.  1  1  A 

-  As  shown  later,  the  cut-off  frequency  of  the  smoothing  procedure  is  defined  by 
the  length  of  the  data  window  2  y  and  its  transfer  function  by  the  weight¬ 
ing-function  g  (  y  ) . 

-  If  the  data  window  is  shifted  along  the  time  scale,  the  smoothed  data  ate  given 
by  «G with  variable  i. 

In  detail,  the  evaluation  of  tha  polynomial  coefficients  and  of  the  transfer  func¬ 
tion  runs  as  follows: 


The  differences,  Eq . ( 1 )  are  weighted,  squared  and  integrated  from  -  y W!1V  to  ♦ V 


J^g<V>2  “  **t»*w)2  ** 


With  Eq.  (1),  differentiation  of /and  setting  the  differential  to  0  for  the  mini  hub  of  4 
results  ini 
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k  »  0,  1,  2. 

With  Eq.  (1)  thla  yields: 

. a  Xa,#a  |  ^2+k 

|  g(p>2  yky<V‘,,d*'  -  J  g(U)2(s0(ti)Wk+«1{ti)y  ?  J** 

■•U*! 


Uneven  powers  of  V  disappear  on  the  right  side  of  Eq.  (3)  during  the  integration 
and  the  following  relation  remains: 
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(4) 


The  matrix  on  the  right  hand  cide  is  non-singular  and  can  be  inverted  for  the  estimation 
of  a^iti).  Usually  only  aQ(ti)  is  of  interest,  sometimes  the  first  (a^)  or  second  (a2) 
derivative  is  required  too. 


for  the  determination  of  the  cut-off  frequency  and  transfer  function  niu)  of  the 
smoothing  procedure,  Eq  (4),  is  Fourier-transformed.  The  convolution  of  the  left  hand 
sind  of  Eq.  (4)  changes  into  the  product  of  two  Fourier-transformed  integrals.  The  mat¬ 
rix  of  the  right  hand  side  consists  of  constants  which  remain  unchanged  during  the 
Fourier-tranaformation.  Tho  result  ia  equation  (5): 
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The  desired  transfer  functions:  '  “** 
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(7) 


are  determined  by  solution  of  Eq.  (5). 

The  transfer  functions  depend  exclusively  on  the  weighting  function. gi  p)2  has  to  be 
chosen  such  that  the  smoothing  procedure  approximates  an  Ideal  low  pass  as  closely  as 
possible.  Therefore  In  the  following  section  different  weighting  functions  ere  examined. 


3.  WEIOWfUJC  PSjSCTfONS 


The  sleplest  weighting  function  is  a  rectangular  function  g0(P>  •  3.  The 
meeeurements  within  a  data -window  ere  weighted  equally  (Fig.  3). 


Fig.  3  Weighting  functions 
g*(  P)2  end  g2t  p»2 


The  amount  of  the  corresponding  transfer- function  70  (i<*>  )  ae  Eq.  (7)  is  shown  in 
Fig.  (4).  The  function  decreases  down  to  a  frequency  of  0.5/tf  max  very  rapidly  which 
results  in  a  sharp  frequency  cut-off.  For  increasing  frequencies  the  function  oscillates 
which  means  that  the  smoothed  signal  still  contains  parts  of  higher  frequencies.  To  elimi¬ 
nate  this  disadvantage  two  different  weighting-functions  are  tested.  These  functions  are 
used  in  [SJ  for  the  smoothing  of  power  spectral  densities. 
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The  amount  of  the  corresponding  transfer-functions  ^  (i^)^  t  7^  (iu)aQ 


Eq.  (8),  (9) 
is  shown  in  Fig.  (4) : 


7  1*  displayed  dependent  on  the  relative  frequency  f  «.»  whore  Z  p  the 

width  of  the  smoothing-function.  The  decrease  of  <f  ,  and  is  slighte?“h»«  of  T, 

but  the  oscillations  are  much  smaller,  especially  for  the  weighting-function  g,(  P  ). 


In  the  logarithmic  diagrams  of  Fig.  (5)  these  properties  are  shown  more  distinctive. 
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Amount  of  transfer  functions 
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For  comparison,  also  low-pass  filters  of  first  and  second  order  are  displayed.  Only 
with  the  weighting  function  g->(  y>  the  smoothing  procedure  yields  a  much  better  frequency 
separation  than  a  second  order  '>'■ indpass-fi] ter .  In  this  case  it  is  a  good  approximation  of 
an  ideal  low-pass.  Therefore  cue  weighting  function  g2  (  )  is  recommended  for  application 

in  the  described  smoothing  procedure.  The  cut-off  frequency  is  /  =  1/i£max  (5)).  The 

smoothing  procedure  now  separates  deterministic  o t  stochv.tic  signals  v(t j)  =  aQ (ti) 
from  stochastic  noise  (Ay  (t^))  which  If  approximately  stationary.  4 

4.  TEST  OF  GAUSSIAN  DISTRIBUTION 

Prior  to  the  estimation  of  stochastic  parameters  from  the  stochastic  noise  a  Gaussian 
probability  distribution  test  is  performed  to  prove  the  stochastic  behaviour  of  the  noise. 

Fig.  6  shows  an  example  of  this  test,  (probability  distribution  of  stochastic  gyro- 
drift  data  which  result  from  in-flight  measurements) .  The  criterion  is  atf^-test  and  a 
straight  line  in  the  Gaussian  probability-distribution  paper. 


•ururiaKEITSYCRTl.ii.iWS  Y3N  i33  hesjw.RTE:;: 


K  L  *  S  $  E  tUCuriGKEIT  STRICHLISTE 

*iR.  YON  f>!S  Elrs.  SUtiKE  I 


0  •UNCNOUCH  -1.174**7328.;.  *1 

1  *1.17*8673282.  »i  -*.*u5:>4»6iS.  *9 

2  -8.8115349619.  »3  -5.97433664G9.  *0 

3  -$.874334*409.  *0  -2.S3?i683204.  .0 

4  *2.9371683204.  .0  1.1641S32183.-10 

9  1.14415321R3.-13  2.937168329*.  *0 

6  2.93714*3206.  .3  $.174336*412.  .0 

7  $.*743366412.  *0  *.*i;S0*9*17,  -c 

*  «.*11$349*17.  »0  1.174*67328.  <1 

*  1,17486732*2.  *1  .UNENOUCH 


I 

0  C  | 

»  «  I ////»//// 

1*  2$  t ////17///X////X7 

17  42  I / ///X////X////X// 

28  73  |  / ///X////4////X////JC////X/// 

14  *9  I////X////X////X//// 

19  10*  I////X////X////X//// 

is  i2S  i  n/ rut  mum 

♦  152  I////X//// 

l  133  |/ 

I 


OtrrERENZEN  241SCHCN  HCSSUKC  UNO  KORNIL  YERTEILUNS 
3,2 

3.9 

-2.8 

2.7 

-4.3 


c . , 

3.2 


CHlOUlCUTTESi 
r.t,:au*0R4T  7.6514 


Fjg.  6  Probability  distribution  of  133 

gyro-drift  measurements.  X^~test 
and  Gaussian  probability  listribu' 
tion  paper-test. 


The  result  of  the  X2-test  (7.68  0.24)  as  well  as  the  approximately  straight  line  shows 
that  the  gyro  drift  is  Gauss-distributed. 

5.  ESTIMATION  OF  COVARIANCES  AND  POWER  SPECTRA 

In  the  preceding  three  sections  a  technique  has  been  derived  to  extract  stationary 
stochastic  system-  or  measurement  noise  data  from  measurements  which  contain  system  sig¬ 
nals,  deterministic  errors  or  trends.  low  the  stochastic  parameters,  covariances  and 
power  spectra]  densitites,  which  are  required  for  the  Kalman  filter  can  be  estimated.  As 
the  smoothing  procedure  runs  in  a  digital  computer  the  noise  output  is  a  discrete  time 
series.  Therefore  the  following  equations  are  given  in  discrete  formulation  too. 
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Correlation  function 


For  two  discrete  stochastic  series  y,(t.,)  and  y, (t.,  +  nr\AT)  with  j  =  o,  1/.., 
1-k,  m  «  0,1...  tc,  mean  value  o  and  constant  A  ■’T,  the  correlation  function  is: 

,  i  l-K 

oly1,y2(a&n  «  j— y^tj)  *  y2(tj  +  (10) 


y.  *  y~  :  autocorrelation 
y.|  /  '•  crosscorrelation 

For  negative  m: 

^yi,y2(_maT)  *  &  y2»y,  <«at> 

The  Covariance  is  given  by  ^  y1 ,  y?  (o) . 

Power  Spectral  Density 

In  the  frequency  domain  the  stochastic  functions  y. ,  y.  are  described  hy  the  power  spec¬ 
tral  density  (signal  power  per  Hertz  at  a  frequency  of^nAf  .  The  power  spectracl  den¬ 
sity  is  determined  by  the  discrete  Fourier- transformation  of  the  correlation  function. 


S  yt.y2<n&f) 


»y2(« at)|  - 

6T^fty1fy2(0)  +  (fty1*y2(BAT)  + /l*y2»y1  (i»T))' cosTS~~ 
+  i  (fcV,  »y2  (kAT)  +  Jl*y2*y1  (kAT)j  cos  IT  £  1 

-  ioT  (^y1#y2(nAT)  -  Ji y2»y1  (oat)  j-sin  7  —  f 
+  7  ^y^YjOtAT)  -  ^?*y2,y1  (kAT))-8in  T  — -  j 


"shavinon)0'!1-^  ”*  *  '  A'°  ^J^ftfrequency  resolution,  /g  =  1/2 AT, cut  off  frequency 

For  y1  i*  y2  5  has  an  imaginary  part  which  disappears  for  y,  “  y2* 

For  the  enhancement  of  the  statistic  certainty  of  the  power  spectral  density,  Eq,  (11). 
the  correlation  function.  Eg.  (10)  can  be  multiplied  by  a  weighting-function,  g(maT): 

^Yj.yjjtaAT)  -  g(mAT)  •  Ey1(y3(a6Ti  (12) 

The  weighting  function  has  a  length  of  2k AT.  Multiplication  in  the  time  domain  yields 
a  convolution  in  the  frequency  domain: 

{  (  t  ,y2{mAT)}  -  ^g(mAT)J#/[^y1ty2(mAT)| 

It  has  been  shown  [6]  that  the  weighting  function  g,  (nt^T)  «  0.54  +  0.46  cot  (T  «/k  ) 
smoothes  the  spectral  density  and  enhances  the  statistic  certainty  by  2.3.  The  convolution 
is  very  simple  because  ^  |g2(mAT)j  consists  only  of  3  values. 

The  result  is  given  by: 

/y^YjlO)  «  0,54  •  $\ ,fy2(0)  +  0.46  ^y},y2(Af) 

$ y^YjJnAf)  -  o.23  .  ^Ty^yjtta-l)  Af)  +0.54  /'yyy^oAmo.as/ywy.Un+UAf) 

Jy^yjlkAf)  «  0.46  ♦  ^y1,y2((k-n  of)  +  0.54  ^^.yjOtAf) 
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For  the  gyro  drift  data  of  Fig.  6  the  correlation  function,  the  power  spectral  den¬ 
sity  and  the  weighted  spectral  density  are  evaluated  and  displayed  in  Fig.  7  as  an  example. 
The  frequency  steps  are  0.0159/sec  and  the  validity  of  the  relation  f  f  d£  *  $L{o)is 
checked.  -  ^  ' 
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Fig.  7  Evaluation  of  correlation  function,  power  spectral 
density  and  convoluted  power  spectral  density. 
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The  table  shows  a  much  smoother  weighted  spectral  density  than  the  unweighted  J  . 

6.  ESTIMATION  OF  LINEAR  SHAPING  FILTERS 

One  goal  of  the  analysis  of  correlated  stochastic  noise  data  is  the  determination  of 
linear  shaping  filters  which  generate  coloured  noise  from  white  noise.  If  the  spectral 
density,  Eq.  (13)  is  known,  the  following  relation  is  valid[2]  : 

-  ITucj)!2^ 

rt  So  Power  spectral  density  of  white  noise  (const.) 

MyjtYjtw)  Power  spectral  density  of  coloured  noise 

7  (iuJ)  Transfer  function  of  the  shaping  filter. 

From  this  the  amount  of  the  shaping  filter  transfer-function  follows; 

|  J(UJ){  »  y,  y,  (w)/  So  (15) 

From  the  empiric  spectral  density  $ Y^'Y^W)  only  the  amount  of  the  shaping  filter 
transfer-function  can  be  determined  but  not  the  phase.  Therefore,  all  shaping  filters 
theoretically  can  be  used  for  the  generation  of  the  coloured  noise  which  fulfil  Eq.  (15) 
approximately.  In  reality  always  the  simplest  of  all  possible  shaping  filters  is  used 
because  the  phase  is  of  no  interest  for  the  noise. 

A  linear  shaping  filter  is  defined  by  its  structure  (differential  equation)  and  its 
parameters  (time  constants,  resonance  frequencies  and  damping  ratios) .  The  selection  of 
the  structure  and  the  estimation  of  the  parameters  is  feasible  by  the  following  method? 
The  amount  of  the  transfer  function,  Eq.  (15!  is  drawn  in  double-logarithmic  scale.  Two 
examples?  gyro  drift  and  velocity  measurement  error  are  shown  in  Fig.  8. 


Fig.  8  Square-root  of  power^ 
spectral  density  of  (7) 
gyro  drift,  (2)  velocity 
mea suremeo t-efxor , 
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For  a  skilled  person  it  is  relatively  easy  to  determine  the  structure  of  a  linear  filter 
which  is  a  good  approximation  of  the  empiric  value.  For  the  examples  of  Fig.  8  the  appro¬ 
priate  shaping- filter  structures  are  given  by  the  transfer  functions  (16)  and  (17)  in  the 
frequency  domain  (  u  resonance  frequencies/  f  damping  constants/  T  time  constants) 


7{iu) 


7(w,  _ 

_wr  +  4.aS4|L+i 

O7* 

Now  the  parameters  oc  of  the  shaping  filter  (for  the  examples  Tvi  '  ^  '  Wi  and  r  ° 

have  to  be  identified  so  that  the  empiric  value  \j~fZ  ~  ~7j  is  optimally  approximated. 
This  is  obtained  by  minimizing  the  equation:  r'1  ‘ 


7(ot) 


yj.yjtcj)  -  |T(iw/«-)| 


trj 


This  equation  is  non-linear  in  reference  to  .  One  possible  solution  is  the  development 
of  J(<t)  in  a  Taylor  series  [7J  .  The  Taylor  series  is  developed  for  ot  =  oc°  (first 
approximation)  and  has  three  terms.  For  the  scalar  cased  parameter)  this  yields: 


'J(cl)  -  7«tj|  +  .  «*-*•)  +  |!2 

i.a*  3*  *.4*  3 


(<x-ot )* 


—  Min 


The  minimum  of  oc  is  given  by: 

ur  ~  0  +H-L,  no, 

T 

For  several  parameters  a  vector  =  (*.,*_  .....  )  is  introduced.  Instead  of  Eq.  (20) 

the  following  equation  is  valid: 


3*4** 


12-  -4-  all-  £1 

3  ct  *  3^3^  9*i 


OC/*-** 


o  (21) 


<C  -  ft 


The  first  step  is  the  determination  of  the  differentials  £  and  H.  This  is  not  diffi¬ 
cult  but  tedious  and  is  therefore  omitted  here.  Then  roughly  estimated  parameters  «  are 
introduced  in  Eq.  (21)  and,  as  H  is  usually  non-singular,  improved  parameters  ^  can  be 
determined  by: 


*  -  at* 


These  parameters  are  inserted  as  in  Eq.  (22)  in  a  second  step  and  an  iteration 
can  be  started,  provided  that  the  evaluated  parameters  converge  towards  the  optimal  para¬ 
meters.  In  this  case  7/.cJ  (Eq.  (19))  is  reduced  from  one  iteration  step  to  the  next  itera¬ 
tion  step.  Q 

Unfortunately  practical  applications  have  shown  that  this  method  very  often  does  not 
converge,  because  of  too  large  steps  of  &  which  overshoot  the  optimal  parameters.  This 
difficulty  oan  be  removed  by  e  combination  of  Eq.  (22)  with  a  direct  search  method.  After 
the  determination  of  *  >  0  (Bq.  (22))  ^  (rfo  ♦  «/* o(  «c  -  *) )  ,  n  ■  1,2,... 10  is  evaluated. 
The  paratoeter  increment  ^//o(d  -  <*')  which  gives  the  smallest  value  of  7  i8  used  for  the 
next  iteration  step,  with  this  combined  technique  fast  convergence  oan  ba  obtained  in 
most  oases. 

The  example,  Table  1,  shows  the  iteration  results  of  the  velocity  measurement-errors, 

Fig.  8  and  Bq.  (17).  The  tour  parameters Tv. , u. ,  4,4  are  identified  in  four  itera¬ 
tion  steps.  Remarkable  in  the  reduction  or°  2  down  to  2  t  of  the  initial  value. 
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Parameter 

S. 

Tv* 

gj 

? 

Ausgangswerte 

1.  Iteration 

2.  Iteration 

3.  Iteration 

4  Iteration 

0.060 

0.054 

0.051 

0.052 

0.052 

12.0 

10.516 

9.839 

9.767 

9.767 

0.35 

0.317 

0.301 

0.295 

0.295 

0.5 

0.518 

0.555 

0.596 

0.596 

0.0732“ 

0.0103 

0.0019 

0.0015 

0.0015 

Table  1  Evaluation  of  the  shaping  filter-parameters 
for  the  velocity  measurement  errors. 


In  Fig.  9  the  transfer  functions  of  the  models  with  the  optimal  identified  parameters  are 
compared  with  the  empiric  values  of  Fig.  8.  The  fitting  of  the  shaping 


Fig.  9  Amount  of  transfer  function  of 
real  coloured  noise  (empiric 

values)  ( - )  and  its  model  with 

optimally  identified  parameters 
( - ) 

(?)  gyro  drift  (5)  velocity 
measurement  errors. 


filter  is  relative  good  in  the  lower  frequency  range.  A  refinement  of  the  model  for  higher 
frequencies  is  not  worthwhile  because  of  the  small  amounts  in  that  region. 

If  the  curve  fitting  does  not  improve  by  an  increasing  number  of  iterations  the  structure 
of  the  shaping  filter  should  be  altered. 

The  determination  of  the  shaping  filter-structure  and -parameters  concludes  the  analysis  of 
the  stochastic  system  and  measurement  noise.  Now  the  transformation  of  the  shaping  filter 
from  the  frequency-  into  the  time-domain  by  Laplace  or  Fourier-tables  is  required.  The 
resulting  differential  equation  is  included  in  the  Kalman  filter  model  and  the  filter  state 
vector  1b  augmented  by  the  order  of  the  shaping  filter. 

6.  CONCLUSION 

In  the  preceding  sections  methods  and  techniques  for  the  estimation  of  stochastic  para¬ 
meters,  which  are  necessary  for  the  design  of  optimal  filters  were  described.  These 
methods  have  been  derived  from  conventional  stochastic  operations  and  have  proven 
remarkable  effectiveness  in  a  lot  of  applications  [8]  .  The  parameters  are  evaluated  off 
line  with  a  digital  computer  from  measurements  which  should  be  stationary  in  respect  to 
the  system-  and  measurement  noise.  If  considerable  changes  of  the  noise-parameters  in 
dependence  of  deterministic  parameters  (e.g.  aircraft  velocity  or  weather  conditions) 
exist,  this  effect  can  be  modelled  too  by  Introduction  of  time-variable  noise-  and 
shaping-filter  parameters. 
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SUMMARY 

The  design  of  an  effective  operational  Kalman  filter  entails  an  iterative  process  of  proposing  ’lter- 
native  designs  of  different  levels  of  complexity,  "tuning"  each  for  best  attainable  precision,  and  trading 
off  performance  capabilities  and  computer  loading.  The  crux  of  such  a  design  effort  is  establishing  an 
adequate  model  upon  which  to  base  the  filter  -  a  model  describing  system  dynamics,  measurement  device 
characteristics,  and  statistical  properties  of  uncertainties  associated  with  this  structure,  all  of  which 
becomes  embodied  in  the  filter  algorithm.  Using  physical  insights,  order  reduction  and  model  simplifi¬ 
cation  techniques,  numerous  prospective  filters  are  proposed  for  a  given  application.  Once  this  is  accom¬ 
plished,  it  is  critical  to  be  able  to  assess  the  true  estimation  performance  of  any  given  filter  configu¬ 
ration  when  subjected  to  the  real  world  environment.  A  systematic  design  procedure  is  described  that 
exploits  these  stochastic  modeling  and  performance  analysis  capabilities,  and  an  example  is  used  to  empha¬ 
size  some  important  aspects  of  the  design  approach. 


1.  INTRODUCTION 

A  Kalman  filter  is  an  optimal  recursive  data  processing  algorithm  that  accepts  incomplete  noise- 
corrupted  measurements  from  sensors  to  provide  an  estimate  of  the  state  variables  that  describe  the  be¬ 
havior  of  a  dynamic  system.  It  combines  this  real-time  data  with  the  results  of  stochastic  modeling 
efforts,  namely,  (1)  mathematical  models  of  system  dynamics  and  measurement  device  characteristics,  (2) 
the  statistical  description  of  the  system  noises  and  disturbances,  measurement  errors,  and  uncertainties 
and/or  inadequacies  in  the  mathematical  models  themselves,  and  (3)  any  available  a  priori  statistical 
information  about  the  system  states,  to  generate  the  desired  state  estimate.  Under  the  assumptions  that  an 
adequate  system  model  can  be  expressed  in  the  form  of  a  linear  system  driven  by  white  Gaussian  noise,  its 
estimate  can  be  shown  to  be  optimal  with  respect  to  essentially  any  useful  criterion  of  optimal ity:  it  is 
the  minimum  mean  square  error  estimate,  the  generalized  least  squares  estimate,  the  minimizer  of  any 
symmetric  cost  criterion,  the  maximum  a  posteriori  estimate,  the  orthogonal  projection  of  the  true  state 
onto  the  span  of  the  measurements,  the  maximum  likelihood  estimate  if  there  is  no  a  priori  state  informa¬ 
tion  (and  superior  to  the  maximum  likelihood  estimate  if  a  priori  statistics  are  available!),  and,  perhaps 
most  importantly,  its  output  totally  defines  the  entire  Gaussian  conditional  density  function  for  the 
system  state  vector  conditioned  on  the  entire  history  of  measurements  that  have  been  processed  [1]. 

These  optimality  claims  are  impressive,  but  they  are  totally  dependent  upon  the  modeling  assumptions. 
Mathematical  models  of  both  the  system  structure  (state  dynamics  and  measurement  relations)  and  uncertain¬ 
ties  are  inherently  eidbodied  In  the  Kalman  filter  structure,  and  the  fidelity  of  these  models  dictates  the 
performance  of  the  filter  in  actual  application.  Attaining  an  adequate  mathematical  model  upon  which  to 
base  the  filter  is  the  crux  of  the  design  problem.  Thus ,  oespi te  "the  matfiematl cal  fcrmaTT sm  of  the  Kalman 
filter  approach,  a  substantial  amount  or  engineering  insight,  fundamental  modeling  capability,  and  experi¬ 
ence  is  required  to  develop  an  effective  operational  filter  algorithm. 

Moreover,  the  designer  typically  does  not  have  the  luxury  of  implementing  the  filter  based  upon  the 
best  descriptive  and  most  conplete  and  complex  model,  often  termed  the  "truth  model."  The  final  filter 
algorithm  must  meet  the  constraints  of  online  computer  time,  memory,  and  wordlcngth,  and  these  considera¬ 
tions  dictate  using  as  simple  a  filter  as  possible  that  meets  performance  specifications.  Consequently, 
the  designer  must  be  able  to  exploit  basic  modeling  alternatives  to  achieve  a  simple  but  adequate  filter, 
adding  or  deleting  model  conplexity  as  the  performance  neods  and  practical  constraints  require.  The  result 
is  that  he  often  generates  not  one,  but  several  proposed  filters  of  various  degrees  of  sophistication  and 
performance  potential,  and  a  tradeoff  analysis  is  conducted. 

Evaluation  of  the  true  performance  capabilities  of  simplified,  reduced  order  filters  is  thus  of  critical 
Importance  in  the  design  procedure.  Although  a  Kalman  filter  coaputes  an  error  covariance  matrix  internally, 
this  is  r  valid  depiction  of  the  true  errors  committed  by  the  filter  only  to  the  extent  that  the  filter's 
own  system  model  adequately  portrays  true  system  behavior.  It  is  very  possible  for  the  filter  not  to  per¬ 
form  as  well  as  It  "thinks"  It  does.  If  the  computed  error  covariance  Is  inappropriately  "small"  (in  norm, 
magnitude  of  individual  eigenvalues,  etc;),  so  is  the  coaputed  filter  gain:  the  filter  discounts  the  data 
from  the  "real  world"  too  much  and  weights  its  internal  system  model  too  heavily.  Such  a  condition  leads 
to  filter  state  estimates  not  corresponding  to  true  system  behavior,  with  a  simultaneous  Indication  by  the 
filter-computed  covariance  that  the  estimates  are  precise:  filter  divergence  is  exhibited.  One  significant 
task  in  the  overall  design  process  is  the  tuning  of  each  proposed  filter,  iteratively  choosino  the  design 
parameters  (covariance  matrix  entries  describing  the  statistics  of  uncertainties  associated  with  the 
filter's  dynamics  and  measurement  models)  that  yield  the  best  true  estimation  performance  possible  from 
that  particular  filter  structure.  This,  in  fact,  is  accomplished  by  choosing  the  design  parareters  so 
that  the  filter-conputed  error  covariance  i^  a  good  representation  of  the  true  error  covariance. 

The  design  of  an  effective  operational  Kalman  filter  entails  an  Iterative  process  of  proposing  alterna¬ 
tive  designs  through  physical  insights,  tuning  each,  and  trading  off  performance  capabilities  and  computer 
loading.  Section  z  discusses  the  development  of  numerous  proposed  filters  for  a  given  application. 

Section  3  then  develops  the  ability  to  analyze  the  performance  capability  of  any  Kalman  filter  configuration 
operating  in  the  real  world  environment.  With  such  performance  analysis  available,  Section  4  presents  a 
systematic  design  procedure  and  Section  5  provides  an  example  of  exploiting  these  results. 
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2.  PROSPECTIVE  FILTER  DESIGNS 

Any  prospective  Kalman  filter  Is  based  upon  a  design  model  of  state  dynamics  and  measurement  character¬ 
istics.  The  design  model  dynamics  equation  Is  a  linear  stochastic  differential  equation  for  the  n- 
dimenslonal  state  vector  x(t): 

dx(t)  =  F(t)x(t)dt  +  B(t)u(t)dt  +  G(t)d§(t)  (1) 

where  u(t)  Is  an  r-vector  of  deterministic  control  Inputs  and  $(t)  Is  s-dlmensional  Brownian  motion  of 
diffusion  g(t)  for  all  time  t  of  Interest,  with  statistical  description  given  by 

E{s(t) }  =  0  (2a) 


E{[B(t)  -  B(t')][|l(t)  -  £(t')]Tl  -  f  g(t)dr 


where  E{  }  denotes  expectation.  The  a  priori  Information  about  the  Initial  state  x(t  )  Is  provided 
in  the  form  of  a  Gaussian  density  specified  by  mean  ^  and  covariance 

E{x(tQ)}  •  Xq  (3a) 

-  Xo][x(t0)  -  *  P^  (3b) 

Often  Eq.  (1)  Is  written  less  rigorously  as 

x(t)  =  F(t)x(t)  +  B(t)u(t)  +  G(t)w(t)  (4) 

by  heurlstically  dividing  through  by  dt,  where  w(t)  is  zero-mean  white  Gaussian  noise  (the  hypothetical 
derivative  of  Brownian  motion  J3(t))  of  strength~"Q(t) : 

E{w(t))  -  0  (5a) 

E{w(t)wT(t+x)} « (5b) 

where  6(t)  is  the  Dirac  delta  function. 

At  each  sample  time  t. ,  an  m-dlmenslonal  vector  of  measurements  z(t,)  becomes  available,  modeled 
as  a  linear  combination  or  the  states  plus  additive  noise: 

i(tj)  ■  Hftj)  xftj)  +  v(tj)  (6) 

where  v(t|)  is  zero-mean  white  Gaussian  discrete-time  noise  with  covariance  R{tt ) : 

E(v(t,))-0  (7a) 

ElvU^ttj))  ■  R(t,)4tj  (7b) 

where  4^  Is  the  Kronecker  delta  (4^  ■  1  If  1  •  J;  4^  ■  0  If  1  i  j).  The  measurement  corruption 

noise  v(ti)  Is  usually  assumed  independent  of  the  dynamics  driving  noise  w(t)  for  all  times  t  and  t4 
(although ’this  is  readily  generalized  [1]).  ~  1 

Once  the  system  model  has  been  defined  by  the  structural  parameters,  l.e.,  the  time  histories  of 
•  and  the  statistics  of  uncertainties,  l.e.,  the  (x-.Pd  values  and  t1*»  histories,  the 
Kalman  filter  algorithm  can  be  specified.  Namely,  the  state°esclmate  and  error  covariance  are  propagated 


from  saaple 


to  the  next  sample  tine  t<  by  means  of  Integrating 


x(t/tM)  *  fUJxU/t^,)  ♦  6(t)u(t)  (B) 

'  SOtft/t^,)  ♦  £(t/tM)FT(t)  ♦  G(t)a(t)GT(t)  (9) 

where  the  notation  x(t/t,  ,)  corresponds  to  the  estimate  (conditional  mean)  of  x  at  time  t,  conditioned 
on  measurements  taken  through  saaple  time  t.  .,  and  P(t/t,  . )  Is  the  corresponding  conditional  state 
(and  error)  covariance.  These  are  propagated  from  tKe  Initial  conditions 

ittt-l'W  "  *UM*)  .  P(t1.,/t1,1)  •  P(tM*)  (10) 

using  the  results  of  the  measurement  update  at  time  t,  , ,  where  the  superscript  *  denotes  'after  measure¬ 
ment  Incorporation.'  Integration  of  (8)  and  (9)  yield?  the  best  prediction  of  x[tJ  before  the  measurement 
at  t{  Is  incorporated,  denoted  as  x(t{‘),  and  the  associated  error  covariance  £(tj*): 

i(t,‘)  -  xfyt,.,)  ,  P(t,‘)  •  P^/t,.,)  (11) 

In  fact,  the  solution  to  Eqs.  (8)  and  (9)  can  be  written  explicitly  as 

£(t <’)  •  ♦(t1.t1.1)x{t1./)  ♦  |  *  ilVt^rluft)*  (12) 

*1-1 

P(t,*)  •  ♦  |  i  ♦(t,,t)G{T)fl(T)GT(t}eT(t<.t)dt  (13) 

'Vi 

In  term  of  the  state  transition  matrix  *  associated  with  f(t)  In  Eq.  (I)  or  (4),  l.e.,  the  solution  to 
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*{t,t  )  =  £(t)*(t,t  )  and  *(t  ,t  )  =  L  This  form  of  propagation  relation  Is  especially  useful  for  filter 
applications  involving  time-invariant  system  models,  stationary  noises,  and  fixed  sampling  period,  as  a 
replacement  for  direct  numerical  Integration  of  (8)  and  (9).  At  sample  time  t,,  the  measurement  zjtj  Is 
Incorporated  Into  the  estimate  according  to  the  update  relations 


K(tt)  =  P(ti~)HT(ti)[H(t1  )P(t1")HT(ti )  +  R^)]-1 

(14) 

x(ti+)  =  J<( ti  ~)  +  K(ti  )[z^(ti )  -  H(ti  )x.(t1-)] 

(15) 

P(tf+)  =  P{tp  -  K(t1)H(t1)P(ti“) 

(16) 

Starting  from  the  initial  conditions  of  x.  and  given  by  Eq.  (3),  this  algorithm  recursively  pro¬ 
cesses  time  propagations  and  measurement  updates.  The  propagations  between  measurement  sanple  times 
inherently  use  the  system  dynamics  model,  Eqs  (1)  -  (5),  to  provide  the  predicted  state  and  error  covariance. 
Through  use  of  the  measurement  model,  Eqs.  (6)  and  (7),  it  also  generates  the  best  estimate  of  what  the 
next  measurement  will  be  before  It  actually  arrives.  Then  the  measuring  devices  are  sampled,  the  residual 
Is  computed  as  the  difference  between  these  measurements  and  their  predicted  values,  and  finally  the  filter 
gain  (itself  dependent  on  the  structural  and  statistical  models  of  (1)  to  (7))  optimally  weights  this 
residual  to  produce  the  updated  state  estimate.  Because  these  models  are  embedded  in  the  structure  of  the 
filter  algorithm,  the  performance  potential  of  any  Kalman  filter  Is  directly  a  function  of  the  adequacy 
of  Its  assured  models. 

A  systematic  design  procedure  will  encompass  the  generation  of  alternative  filter  designs,  each  based 
on  a  particular  set  of  models,  and  an  evaluation  of  realistic  perfornance  capabilities  and  conputer  loading 
for  each  one.  It  Is  possible  to  devise  filters  based  on  very  extensive  models,  and  in  fact  It  Is  useful  to 
investigate  the  performance  of  the  filter  based  on  the  most  complete  model,  known  as  a  “truth  model",  to 
establish  a  baseline  of  performance  to  which  to  compare  others.  However,  such  a  filter  Is  typically  more 
sophisticated  than  required  to  meet  performance  specifications,  and  it  is  prohibitive  computationally. 

The  deslgier  must  seek  simplified  filter  design  models  that  retain  the  dominant  features  of  the  original 
system  characteristics  and  provide  adequate  estimate  precision.  This  Is  probably  the  most  difficult  aspect 
of  designing  a  filter,  and  It  requires  substantial  understanding  of  the  real  world  system  and  of  stochastic 
modeling,  as  well  as  conpetence  in  filtering  theory. 

Suppose  a  large-dimensioned,  complex  system  model  existed  upon  which  a  filter  could  be  based  that  far 
exceeded  performance  requirements.  Since  the  number  of  multiplications  (time-consuaing  on  a  computer)  and 
additions  required  by  the  filter  algorithm  Is  proportional  to  nJ  and  the  number  of  storage  locations  is 
proportional  to  nJ,  where  n  is  the  state  dimension,  one  significant  means  of  decreasing  the  conputer  burden 
Is  t0  reduce  the  order  by  deleting  and/or  combining  (or  “aggregating")  states  [1-10],  There  is  often 
considerable" physical  ihsI’ghTlnto  the  re TaiVve~slgn> Seance  ofTarious  states  upon  overall  estimation 
precision,  that  suggests  which  states  might  be  removed.  States  with  consistently  small  rets  value,  such  as 
those  corresponding  to  higher  order  and/or  higher  frequency  system  modes  which  typically  have  lower  energy 
associated  with  them,  especially  warrant  inspection  for  possible  removal.  Other  noncriti cal  system  modes 
might  also  be  discarded,  especially  if  they  are  only  weakly  observable  or  controllable  [11,1?].  An  error 
budget  performance  analysis  of  the  most  conplete  filter,  to  be  discussed  in  the  next  section,  is  an  invalu¬ 
able  aid  to  this  state  dimension  reduction. 

In  many  applications,  system  models  arc  in  the  fora  of  the  fundamental  descriptive  equations  of  sow 
physical  system,  driven  by  time-correlated  stochastic  processes  whose  characteristics  match  those  of 
Physical  phenomena  such  as  noises,  disturbances,  and  sn  forth.  These,  in  turn,  are  modelled  as  the  outputs 
of  linear  'shaping  filters"  [1]  driven  by  white  Gaussian  noise,  with  dimension  and  defining  parameters 
chosen  so  that  these  model  outputs  have  statistical  properties  that  replicate  or  closely  approximate  empiri¬ 
cally  observed  weans,  autocorrelation  functions,  power  spectral  densities,  etc.  of  the  actual  physical 
phenomena.  Particularly  in  these  shaping  filters  are  substantial  order  reduction  efforts  usually  made. 

Often  a  high  dimensioned  shaping  filter  in  the  overall  "truth  model’  is  replaced  by  a  very  low  order 
shaping  filter,  such  as  a  first  order  lag  driven  by  aero-mean  white  Gaussian  noise: 

*(t)  •  -  [l/T]x(t)  ♦  w(t)  (17) 

where  the  strength  0  of  the  white  noise  w  is  ?e?/T  .  so  that  the  *  process  in  steady  state  has  wan  of 
aero,  wan  squared  value  and  variance  of  «} ,  autocorrelation  fimetion  of  E(x{t)x{t«t)!  *  <re*p(-|r  1/T)  so 
that  T  It  the  correlation  tire,  and  power  spectral  density  function  [?o'/T}/[w5  ♦  (1/T)*’]  so  that  the  band¬ 
width  or  x  is  (1/T).  The  values  of  <?*  and  T  are  treated  as  design  parameters  to  match  the  **m>irically 

s<W?r*<,1v4,ue  (or  low-frequency  power  spectra)  density  value  with  2o!T)  and  bandwidth 

with  (1/T)  ignoring  less  predominant  characteristics.  In  fact,  If  the  bandwidth  is  wide  coapared  to  the 
bandpass  of  the  system  driven  by  this  noise,  a  zero-state  trivial  shaping  filter  might  be  proposed:  whit# 
Gaussian  noise  of  strength  to  match  the  low-frequency  power  spectral  density  characteristics. 

It  wist  be  emphasized  that  deleting  states  and  coining  many  states  Into  fewer  'equivalent*  states 
must  be  evaluated  in  terms  of  resulting  filter  performance,  as  described  In  the  next  section.  Experience 
has  shew*  that  reductions  motivated  even  by  the  best  of  physical  Insight  can  sometimes  degrade  estimation 
accuracy  unacceptably.  Furthermore,  an  inappropriately  reduced  filter  of  state  dimension  n  can  often  be 
outperformed  by  a  filter  involving  fewer  than  n,  differently  chosen,  states.  The  extreme  case  of  this  would 
be  an  unstable  higher-dimensioned  filter  being  based  upon  an  irobservable  system  model  in  which,  for  In¬ 
stance,  two  states  correspond  to  different  physical  variables  but  are  Indistinguishable  in  their  effect  on 
the  model  outputs,  while  the  lcwer-dlmensloned  filter  model  combined  states  to  achieve  observability. 

Simplification  of  system  model  matrices  for  a  given  dimension  state  model  is  also  possible.  Oominated 

tenwjn  a  single  matrix  element  can  ^neglected,  as  in  replacing  the  state  transition  matrix  «  By - "~ 

U  ♦  l  6tJ  ,n  •  time-invariant  or  slowly  varying  model,  ignoring  higher  order  terms  in  each  elerent.  *>ro- 
entire  weak  coupling  terms  can  be  removed,  producing  matrix  elements  of  zero  end  thus  fewer  required 
iiYttpHcitlcnSv  S ewe t >•*$  suen  rej*>vil  aTISi?  decoupling  the  filter:  a  decentralized  design  achieved 
through  monsingular  perturbations  [13].  In  numerous  applications,  the  terms  that  c«  be  Ignored  co^irise 
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the  time-varying  nature  of  the  model  description,  or  at  least  the  most  rapid  variations,  so  that  a  time- 
invariant  model  (or  at  least  one  that  adults  quasi-static  coefficients)  can  be  used  for  the  basis  of  a 
computationally  advantageous  filter.  Furthermore,  a  given  problem  can  often  be  decomposed  via  singular 
perturbations  [13-19]  Into  a  number  of  slnpler  nested  problems,  with  "inner  loops"  operating  at  fast  sanple 
rates  to  estimate  (and  control)  "high  frequency1'  dynamics  states  (assunlng  slower  states  are  random 
constants)  and  "outer  loops"  operating  with  longer  sample  periods  to  estimate  (and  control)  slower  dynamics 
(designed  assuming  that  "faster"  states  have  reached  steady  state  conditions). 

The  number  of  multiplications  and  additions  required  by  a  filter  algorithm  can  be  minimized  by  trans¬ 
forming  into  a  canonical  state  space  representation,  since  the  resulting  system  matrices  embody  a  hTgK 
density  of  zeroes.  Obviously,  one  might  also  attempt  to  reduce  the  computational  burden  by  increasing  the 
sanple  period  of  the  filter.  If  performance  allows  this  option. 

The  methods  discussed  up  to  this  point  have  involved  the  generation  of  a  simplified  model,  with  subse¬ 
quent  filter  construction.  It  Is  also  advantageous  to  consider  approximating  the  filter  structure  itself. 
Because  of  the  separability  of  the  conditional  mean  and  covariance  equations  in  the  filter,  it  Is  possible 
to  precompute  and  store  the  filter  gains  rather  than  calculate  them  online.  This  precomputed  filter  gain 
history  can  often  be  approxliMted  closely  by  curye-fltted  simple  functions,  such  as  piecewise  constant 
functions,  piecewise  linear  functions,  and  weighted  exponentials.  Thus,  the  filter  covariance  and  gain 
calculations,  which  conprlse  the  majority  of  the  computer  burden,  are  replaced  by  a  minimal  amount  of 
required  computation  and  storage.  Online  practicality  can  be  enhanced  still  further  in  the  case  of  a 
filter  based  on  a  time-invariant  system  model  driven  by  stationary  noises.  In  many  such  applications,  a 
short  initial  transient  is  followed  by  a  long  period  of  essentially  steady  state  constant  gain,  and  the 
approximation  of  using  these  constant  gains  for  all  time  may  be  entirely  adequate  for  desired  performance. 

Of  course,  there  are  some  drawbacks  to  using  stored  gain  profiles.  Future  gains  do  not  change  appropriately 
when  scheduled  measurements  are  not  made,  due  to  data  gaps  or  measurement  rejection  by  reasonableness  tests 
on  the  residuals  [1],  Nor  can  the  prestored  gains  adapt  online  to  conpensate  for  filter  divergence  or  a 
system  envi ronment  that  Is  different  than  anticipated  during  the  design  phase  [20].  Finally,  lengthy 
simulations  are  usually  required  to  determine  a  single  gain  history  that  will  perform  adequately  under  all 
possible  conditions  for  an  actual  application. 

3.  PERFORMANCE  ANALYSIS 

Throughout  the  previous  section,  the  critical  significance  of  an  "adequate"  system  model  within  the 
filter  structure  was  stressed.  To  assess  the  capabilities  of  various  7T 1  ter  designs  re latlve  to  each' "other 
and  lo  a  set’ of  performance  specifications,  one  must  have  at  his  disposal  a  means  of  producing  an  accurate 
statistical  portrayal  of  estimation  errors  committed  by  each  filter  in  the  "real  world"  environment,  with¬ 
out  actuaHy  building 'aruTlestlng  eacfTYn  the  “real  world".  A  performance  analysis  as  depicted  In  Fig.  1 


Fig.  1  Performance  Evaluation  of  a  Kalman  Filter  design 
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fulfills  this  objective  by  replacing  the  "real  world"  measurement  generation  with  the  output  of  the  best, 
most  complete  mathematical  model  that  can  be  developed,  called  a  "truth  model"  or  reference  model  [1,21-24]. 
Extensive  data  analysis,  and  shaping  filter  design  and  validation  are  expended  to  ensure  that  this  provides 
a  very  accurate  representation  of  the  "real  world",  since  the  ensuing  performance  evaluations  and  systematic 
design  procedure  are  totally  dependent  upon  this  fidelity.  For  example,  a  very  good  generic  model  of  the 
errors  In  an  inertial  navigation  system  (INS)  has  been  constructed  In  the  form  of  a  linear  model  of  about 
70  states  driven  by  white  Gaussian  noise  [1,6,21,25-29];  thorough  laboratory  and  flight  testing  of  a 
particular  INS  allows  complete  specification  of  the  model  parameters  to  yield  the  "truth  model"  for  that 
particular  system.  On  the  other  hand,  the  design  models  used  for  the  basis  of  operational  aided-INS  Kalman 
filters  often  represent  these  same  characteristics  with  15  or  fewer  states,  with  less  but  acceptable 
fidelity. 


The  truth  model  Is  composed  of  a  stochastic  differential  equation  for  the  n. -dimensional  state  process 
x*(t),  where  the  subscript  t  denotes  "truth  model",  and  an  associated  output  equation  to  generate  the  "true" 
sampled-data  measurements  z^t,)  in  Fig.  1.  If  the  truth  model  itself  is  a  linear  model,  then  it  can  be 
expressed  by  equations  of  the  form  of  (l)-(7),  but  with  subscript  t  added  to  all  variables: 


Xj.Ct)  =  F^tjx^t)  +  Bt(t)u(t)  +  G^tjw^t) 

+  VtUi) 

where  w*(t)  is  zero-mean  white  Gaussian  noise  of  strength  Q* (t )  and 


(18) 

(19) 


where  w,(t)  is  zero-mean  white  Gaussian  noise  of  strength  (L.(t)  and  v* ( t ^ )  is  discrete-time  zero-mean  white 
Gaussian  noise  of  covariance  MtJ  and  Independent  of  wt ( tj ;  x*(t  )  ts  described  as  Gaussian,  of  mean 
x.  and  covariance  P*  .  These;  relations  do  not  yet  account  for  the  feedback  from  the  Kalman  filter  as 
depicted  in  Fig.  1;  tRe  required  modifications  will  be  described  to  simulate  such  feedback  of  filter  out¬ 
puts  to  the  "real  world"  for  control  purposes.  It  is  also  possible  to  consider  a  nonlinear  truth  model,  as 


i^(t)  °  l^U),  u(t),t]  +  <^(0^(0  (20) 

1^)  =  !lt[xt(t1  >,t1]  +  v^tj)  (21) 

with  the  statistical  description  of  uncertainties  given  by  (x.  ,  P.  ,  (L,  R  }  as  before.  In  fact,  even 
more  general  I  to  state  stochastic  differential  equations  can  Be  used,  aS  allowing  G.  to  be  a  function  of 
as  well  as  t,  to  produce  useful  Markov  state  processes  [1,20,30,31]. 

It  Is  desired  to  achieve  a  meaningful  comparison  of  filters  that  may  differ  substantially  in  state 
dimension  and  internal  model  specification,  when  each  is  subjected  to  the  realistic  measurement  environment 
generated  by  the  truth  model.  However,  for  any  given  application,  there  are  certain  variables  of  critical 
interest  no  matter  which  filter  Is  under  consideration,  and  every  proposed  filter  must  be  able  to  provide 
estimates  of  these  quantities  or  variables  functionally  related  to  them.  For  instance,  in  aided-INS  appli¬ 
cations,  position,  velocity,  and  often  attitude  variables  are  paramount,  and  any  additional  states  in  a 
particular  design  are  of  secondary  importance.  These  p  critical  variables,  denoted  here  as  the  stochastic 
process  y(t),  will  serve  as  the  basis  of  the  performance  analysis.  They  are  assumed  to  bo  related  to  the 
filter  states  through  a  linear  transformation,  so  that 

i(t/tM)  •  C(t)x(t/tM)  (22a) 

is  the  estimate  of  y(t)  for  any  tr(tj_j,tj),  using  Eq.  (8),  and 

'  £(t,)i(t ,“)  (22b) 

lit*)  *  C(tj)x(tj+)  (22c) 

are  the  estimates  of  y(t,)  before  and  after  Incorporation  of  the  measurement  z^t,).  Often  C  Is  time- 
invariant,  and  if  its  structure  is  [^  t  0],  then  y  is  simply  the  first  p  of  tM  n  filter  stales. 


As  shown  in  Fig.  1,  the  truth  model  state  process  x.(t)  can  also  be  used  to  generate  something  general¬ 
ly  unavailable  from  the  “real  world":  the  true  value  oTche  quantities  of  interest  at  any  time  t  as 

yt(t)  ■  Ct(t)xt(t)  (23) 

again  In  terms  of  a  linear  transformation  represented  by  a  p-by-n,  matrix  Ct(t).  Of  course,  Eqs.  (22)  and 
(23)  can  be  extended  to  nonlinear  functions  Instead  of  linear,  but  the  linear  function  case  Is  being 
emphasized  here.  Having  access  to  the  true  values  y,(t),  we  can  represent  the  true  error  committed  by  the 
particular  Kalman  filter  in  attempting  to  estimate  tne  quantities  of  interest,  when  subjected  to  reansTTc 
measurements,  as 


e^t)  •  y(t/tM)  -  ^(t) 

(24a) 

^(t,*)  »  y(tj‘)  -  y^t,) 

(2«b) 

£*(*/)  •  y(tj*)  -  y^tj) 

(2*0 

over  the  sarnie  period  before  sample  time  t,,  and  Just  before  and  after  wasurement  incorporation  at  time 
t,,  respectively.  If  tamulsive  feedback  control  from  the  filter  into  the  "real  world"  system  is  admitted 
[1,6,21-23,27],  then  another  error,  corresponding  to  after  both  measurement  updating  and  impulsive  control 
application,  is  also  of  Interest: 

•  y(t,K)  -  y^t,')  (24d) 

where  the  superscript  c  denotes  efter  control  is  applied.  The  objective  of  «  performance  analysis  is  to 
characterize  the  true  error  process,  Eq.  (24).  statistically.  !n  a  TEjjj  te  Carlo  analysis  tV,24j.  many 
samples  of  the  error  stochastic  process  are  proceed  ly  simulation,  and  (Ken  the  satple  statistics  are 


9-6 


computed  directly.  If  the  truth  model  Is  Itself  totally  linear  as  In  Eqs.  (18)  and  (19)  and  If  strictly 
linear  output  relations  (22)  and  (23)  and  only  linear  feedback  are  used,  time  histories  of  the  statistics 
themselves  can  be  computed  directly  In  a  mean  and  covariance  analysis  [1,21,23].  The  relations  for  con¬ 
ducting  either  analysis  will  be  developed  after  the  "feedback  gains"  block  of  Fig.  1  is  specified  more 
precisely. 


Ore  type  of  feedback  Is  Impulsive  control  or  dlscrete-tlme.reset,  In  which  quantities  in  the  “real 
world"  can  be  changed  instantaneously  based  upon  the  estimate  x(t,+).  For  example,  In  aided-INS  Kalman 
filters,  estimates  of  the  errors  In  position  and  velocity  Indications  of  the  INS  are  fed  back  to  the  INS 
for  correction  by  resetting  contents  of  conputer  memory  locations..  Once  x(t,+)  Is  computed,  the  reset 
control  Is  calculated  as  a  function  (assumed  linear)  of  It,  D.(t, )x(t,+),  and  after  the  control  Is  applied, 
the  truth  model  state  becomes  -c 


x^0)  «  x^)  -  Ot(ti)x(ti+)  (25) 

The  filter  should  be  “told"  that  this  feedback  to  the  system  has  occurred,  so  Its  state  estimate  Is  modi¬ 
fied  as 


xU^)  “  X(ti+)  -  DUjJxft^)  =  H-fitVlxU/)  (26) 

where  the  n-by-n  D(t^)  models  the  effect  of  feedback  through  the  actual  nt-by-n  gains  D^t, )  Into  the 

system.  This  x(tj+c)  replaces  x(t*+)  as  the  initial  condition  for  the  next  time  propagation.  Some  true 
system  variables  are  controlled  over  the  entire  sample  period  rather  than  inpulslvely,  which  can  be 
expressed  by  modifying  Eq.  (18)  to 


x^t)  *  F(.(t)xt(t)  -  Xt(t)x(t/t1.1)  +  l^(t)u(t)  +  Gt(t)wt(t) 

Again,  the  filter  should  be  Informed  of  such  feedback,  so  (8)  Is  changed  to 


x(t/tM)  =  F(t)x(t/tM)  -  X(t)x(t/tM)  ♦  B(t)u(t) 
»[F(t)  -  X(t)3x(t/t1_1)  ♦  B(t)u(t) 


(27) 


(28) 


Now  consider  Fig.  1  again:  If  the  truth  model  Is  linear,  then  the  entire  system  enclosed  by  the  dashed 
lines  is  itself  a  linear  system  driven  by  white  Gaussian  noises.  To  characterize  the  output  process 
e.(t)  from  such  a  system  model,  one  first  characterizes  the  Gauss-Markov  state  process  for  the  overall 
system  -  the  augmented  state  process  *(t)  coaposed  of  both  truth  model  states  and  filter  states: 


*,(*>  A 


*t(t) 

*(t/tM) 


(29) 


From  Eqs.  (27)  and  (28),  the  augmented  state  process  time  propagation  relation  Is 
j,(t)  ■  ^(tjxjt)  ♦  B,(t)u(t)  ♦  §,(0*^(0 

where 


£t(0 

-Xt(t)' 

vtJ* 

[F(t)-X(t)] 

.  1,(1)  * 

B(t) 

m  * 

.  5,(t)  • 

0 

•  * 

Solved  forward  from  ti 


x  (t  *c' 

V  1-1  ' 


t.  ,  with  the  Initial  conditions 
‘^1-1°) 

&  1-1*1 


(30) 


(3D 


(32) 


Measure nent  update  relations  are  obtained  by  realising  that  the  truth  model  state  is  unaltered  by  a 
uasuntaent, 

StO  •  *t(t,*)  (33) 


and  that  the  filter  update  can  be  written  as 

*(t,*)  -  id,*)  ♦  Kd^^d,)  -H(t,  )id,‘)] 

■  ti-itvsyv&t,’)  ♦  xu^d,)*^,)  ♦  ku^^u,) 
Putting  these  into  auganted  fom  yields 

•  Wia^P  ♦  WW 

r  I  o  -]  ro  i 

^(t,)  *  ^(t^^tt,)  '  ^(t,}  ^(t4)j 


(3d) 

(35) 

(36) 
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Simlarly,  using  Eqs.  (25)  and  (26),  the  iopulsive  control  update  can  be  represented  by 

'  W*a(t1+) 


(37) 


l 

0 


-MV 

[i-o(t,n 


If  feedback  is  not  enployed,  (^(t^)  is  sloply  an  (nt*n)-by-(nt+n)  identity  matrix,  so  that 

2*^1 +c>  *  yV>- 


Finally,  the  true  error  comnitted  by  the  filter  can  be  expressed  in  terms  of  the  augmented  state 
vector  as 


(38) 


i^t)  *  C^tjx^t) 

for  any  time  t  of  interest,  where,  from  Eqs.  (22)  -  (24), 
yt)  •  t-Ct(t)  C(t)] 


(39) 


(40) 


Equations  (29)-(3 2)  and  (35)-(40)  are  the  basis  of  a  Honte  Carlo  analysis,  allowing  simulation  of  many 
individual  samples  of  the  true  error  stochastic  process.  By  taking  appropriate  expectations  of  these 
results,  mean  and  covariance  analysis  relations  can  also  be  produced.  For  simplicity,  assume  all  quantities 
are  zero  mean  and  concentrate  attention  on  the  covariance  analysis  results  (these  are  readily  generalized 
to  the  case  of  nonzero  ^  and  u).  The  time  history  of  the  error  covariance 


yt)  =  E(et(t)etT(t)l 
is  then  the  desired  output,  and 
yt) .  E<yt)  x>, 

is  first  computed  as  a  means  of  obtaining  this  result.  The  appropriate  initial  conditions  are 

0’ 


W 


>to 


(41) 


(42) 


(43) 


Propagating  from  sample  time  t,  f  to  t,  is  accoetplished  by  integrating 

*  yuyt)  ♦  yof/m  ♦  (44) 

forward  from  the  initial  condition  PJ(tt..i*<') ,  as  seen  from  Eqs.  (30)  -  (32).  The  measurement  update 
relation  derived  from  Eqs.  (35)  and~T36j  H 

£«(0  *  VW'PSa^V  *  *S*(t,)Rt(ti )KaT(t| )  (45) 

From  (37)  and  (38),  the  impulsive  control  update  is 

yVC)  *  yt^^U/jO^d,)  (46) 

As  the  time  history  of  P  is  generated  recusively  using  (44)  *  (46),  the  desired  true  error  covariance  can 
be  obtained  simultaneously  via 

yt)  *  yoyoc/m  < 

as  derived  from  (39)  and  (40).  Because  the  augmented  state  system  model  is  a  linear  system  driven  by 
*d»Ue  Gaussian  noise,  the  covariance  relations  are  not  coupled  to  the  actual  measurement  history  reali¬ 
zations,  so  it  is  possible  to  perform  this  covariance  analysis  a  priori,  without  resorting  to  explicit 
iinulatlw  o<  measurement  process  samples. 

Although  a  covariance  analysis  is  computationally  more  efficient  than  a  ftonte  Carlo  study  and  so  snould 
be  exploited,  especially  in  initial  design  phases,  there  are  advantages  to  using  the  Hpnte  Carlo  approach 
in  addition.  First,  a  Honte  Carlo  study  encompasses  a  system  simulation  in  which  the  actual,  entire 
filter  algorithm  is  embedded.  As  such,  portions  of  the  simulation  can  be  replaced  by  actual  data  or  hard¬ 
ware  as  It  becomes  available  in  the  system  evolution,  ftoroowr,  sip»  errors  In  the  filter  algorithm  that 
may  not  be  readily  apparent  in  a  covariance  analysis  due  to  squaring  effects  become  evident  from  Hants 
Carlo  and  covariance  analyses  based  on  the  same  models  disagreeing  with  each  other.  Finally,  effects  of 
nonllnearlties  Such  as  device  saturation  or  neglected  tenet  in  attaining  linear  perturbation  equations 
cannot  be  evaluated  by  a  covariance  analysis  except  ir.  an  approximate  manner  based  on  describing  functions 
[22],  and  a  full  investigation  requires  a  Hunt*  Carlo  analysis. 

4.  USE  OF  PERFOftMNCE  ANALYSIS  IN  DESIGN 


Once  e  performance  analysis  capability  1$  established,  a  systematic  iterative  design  and  tradeoff  of 
proposed  filters  can  u*  conducted.  First  of  all,  performance  analysis  allows  proper  fljter  tyring 
{1 ,6, 22-24, 27, ». 32-35].  The  basic  objective  of  tuning  is  to  achieve  the  best  possible  estimation  accuracy 
from  a  proposed  filter  by  selection  of  filter  design  parameters  of  P^and  the  time  histories  of  £  and  R 
(see  Eqs.  (3),  (5),  and  (7)).  Basically,  P  is  the  determining  factor  ir.  the  initial  transient  perform¬ 
ance  of  the  filter,  whereas  the  £  and  R  histories  dictate  the  longer  tern  or  steady  state  perforvvwc*  end 
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tine  duration  of  transients.  These  covariances  not  only  account  for  actual  noises  and  disturbances  In  the 
physical  system,  but  also  are  a  means  of  declaring  how  adequately  the  filter-assumed  mode!  represents  the 
“real  world"  system.  Therefore,  the  simpler  and  less  accurate  the  model,  the  stronger  the  noise  strengths 
should  be  set  (through  addition  of  "pseudonoise"  to  the  noises  associated  with  true  physical  disturbance 
phenomena).  However,  it  is  difficult  If  not  Inposslble  to  declare  best  parameter  values  a  priori,  and  the 
specification  is  usually  the  result  of  an  iterative  search. 

When  tuning  a  filter.  It  Is  useful  to  compare  the  actual  estimation  error  statistics,  as  provided  by 
f>  (t),  to  the  filter's  own  representation  of  Its  error  statistics  through  Its  internally  computed  covari¬ 
ance  matrix  given  by  Eqs.  (9),  (13),  (14),  and  (16).  Superimposed  plots  of  “true"  and  fi  1  ter- coup uted 
root  mean  square  errors  in  estimating  Individual  quantities  of  interest,  as  depicted  in  Fig.  2,  are  an 


Fig.  2  Filter  Turning  Though  fcrforwMit*  Analysis,  (a)  Filter  wdemtimite*  its  tm 
errors:  divergence,  (b)  Filter  overestimates  Us  aw  errors:  tract  ioo  of 
messureawt  noise,  (c)  yell-tuned  filter. 
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inv.luable  aid  to  tuning  and  are  often  provided  interactively  by  performance  analysis  software  [23,24]. 

If,  due  to  mistuned  noise  parameters  such  as  attributing  inappropriately  snail  uncertainty  to  the  internal 
dynamics  model,  the  filter  underestimates  its  own  errors  as  in  Fig.  2a,  the  filter  "believes"  its  model 
output  too  much  and  does  not  weight  the  measurement  information  heavily  enough.  This  can  cause  filter 
divergence  [1,22,36-38]  if  the  discrepancy  is  significant  enough.  On  the  other  hand,  if  inappropriately 
snali  uncertainty  and  noise  corruption  is  associated  with  the  measurements,  the  filter  overestimates  its 
own  errors  and  weights  the  measurements  too  heavily,  expending  too  much  effort  tracking  the  noisy  data  and 
not  exploiting  the  benefits  of  its  internal  model  enough,  as  in  Fig.  2b.  By  choosing  the  noise  parameters 
so  .hat  the  overall  time  histories  of  actual  and  filter-computed  rms  errors  match  well,  the  actual  rms 
errors  are  effectively  reduced  at  the  same  time,  as  seen  in  Fig.  2c.  Allowing  the  filter  to  overestimate 
its  own  errors  slightly,  thereby  guarding  against  divergence,  is  a  commonly  adopted  means  of  generating  a 
'conservative1'  or  robust  filter  design,  able  to  withstand  modeling  inaccuracies  without  becoming  divergent 
[39].  Game  theoretic  minimax  approaches  [40]  have  also  been  used  to  design  filters  with  acceptable 
performance  over  .«■  entire  range  of  uncertain  parameter  values.  Filters  are  also  purposely  "robustifled" 
in  control  applications  in  order  to  maintain  closed  loop  system  stability  despite  large  variations  in 
system  parameters  or  operating  conditions  fror,i  those  assumed  during  the  design  phase  [41].  Another  more 
formal  result  than  iterative  pseudonoise  adjustment  is  provided  by  the  concept  of  a  minimum  variance 
reduced  order  estimator  [42,43],  but  again  the  performance  analysis  should  be  used  to  indicate  the  true 
capabilities  of  such  a  tuning  approach.  Furthermore,  both  offline  tuning  and  online  adaptation  can  be 
enhanced  by  exploiting  the  fact  that  the  residuals  of  a  well-tuned  Kalman  filter  based  on  an  adequate 
model  should  be  a  zero-mean,  white  Gaussian  discrete-time  process  with  covariance  of 
[H(t.j )P_( t - ” )H^ ( t^ )  +  R^.)]  as  computed  in  the  filter  in  Eq.  ( 14 ) [T ,20 ,22] . 

Once  a  particular  filter  has  been  tuned,  an  error  budget  [1,22,33-35]  can  be  established.  This  is  a 
depiction  of  the  contributions  of  individual  error  sources  to  overall  estimation  errors,  consisting  of 
repeated  performance  analyses  in  which  single  or  small  groups  of  error  sources  in  the  truth  model  are 
"turned  on"  individually.  If  the  filter  under  test  were  based  upon  the  full-scale  truth  model,  such  an 
error  budget  would  suggest  potential  choices  of  states  to  neglect  in  a  reduced-order  design  (increasing 
the  strength  of  appropriate  noises  to  account  for  such  deletion)  that  would  yield  the  least  perfotmance 
degradation.  On  the  other  hand,  if  the  filter  under  test  were  a  proposed  practical  design  and  tuned 
properly,  this  error  budget  would  indicate  dominant  sources  of  error,  which  may  warrant  either  better 
models  for  error  compensation  within  the  filter  or  better  system  hardware  (i.e.,  changes  in  the  truth 
model)  if  better  performance  is  sought.  Sensitivity  of  estimation  precision  to  parameter  variations 
in  the  filter  or  system  hardware  can  be  oBTaThed  in  a  straightforward  manner  by  repeated  covariance 
analyses  embodying  the  parameter  changes,  or  adjoint  methods  [44]  can  be  used  to  describe  local  sensitivity 
to  small  variations  in  many  parameters  simultaneously. 

A  systematic  desig-  procedure  would  first  entail  developing  a  "truth  model"  to  portray  actual  system 
behavior  very  accurately,  as  vaTTdated  with  laboratory  and  operational  test  data;  if  it  is  nonlinear,  it 
should  be  linearized  about  an  appropriate  nominal  for  later  covariance  analyses.  The  Kalman  filter  based 
upon  the  "truth  model"  is  generated  as  a  benchmark  of  performance;  for  this  filter,  there  is  no  "tuning" 
to  be  accomplished  and  the  fi lter-computed  covariance  is  the  desired  true  error  covariance.  Simplified, 
reduced  order  system  models  are  then  proposed  by  deleting  and  combining  states  associated  with  nondominant 
effects,  removing  weak  coupling  terms,  employing  approximations  such  as  constant  gains,  and  the  like  - 
this  part  of  the  design  effort  requires  substantial  physical  insights  into  the  problem  at  hand.  Then  a 
covariance  performance  analysis  of  each  proposed  Kalman  filter  is  conducted;  as  an  iteration  within  this 
step,  each  filter  is  tuned  to  provide  best  possible  performance  from  each.  A  Monte  Carlo  analysis  of  the 
most  promising  designs  is  generated,  as  Is  a  performance/computer  loading  tradeoff  analysis  to  select  a 
final  design.  This  chosen  design  is  then  Implenented  on  the  online  computer  to  be  used  in  the  actual 
system;  for  numerical  stability  and  numerical  precision  of  the  online  fi’ter  at  modest  wordlength,  this 
implementation  is  best  accomplished  in  square  root  or  U-D  covariance  factorization  form  [1  ,43,45-47]. 
Finally,  checkout,  any  required  final  tuning,  and  operational  test  of  the  filter  is  performed.  Even  in 
this  last  phase,  performance  analyses  can  be  used  to  investigate  and  extend  the  results  observed  in  online 
filter  operation  [48].  Through  such  a  design  procedure,  a  logical  decision  process  based  on  sufficient 
empirical  data  is  Incorporated  into  the  filter  Implementation. 

5.  EXAMPLE 


Currently  the  Air  Force  is  developing  tactical  weapon  systems  that  will  uf^o^d  precision  standoff 
delivery  of  ordnance.  One  such  system  is  a  glide  vehicle  with  midcourse  and  terminal  navigation  ano 
guidance  accomplished  through  use  of  a  strapdown  Inertial  navigation  system  (INS)  aided  by  a  radiometric 
area  correlator  (RAC). 

Two  different  low-cost  strapdown  inertial  systems  are  competing  for  implementation.  Although  both  use 
conventional  accelerometers  to  measure  specific  force,  one  INS  employs  laser  gyroscopes  to  measure  angular 
rates,  while  the  other  uses  conventional  dry  gyros.  This  difference  will  be  seen  to  have  a  significant 
impact  on  the  two  systems'  error  characteristics  and  on  Kalman  filter  performance  capabilities. 


As  the  glide  vehicle  flies  a  desired  trajec  ory,  the  RAC  provides  a  number  of  accurate  position  fixes 
by  correlating  a  radiometric  "picture"  of  the  terrain  Immediately  below  the  vehicle  with  a  prestored 
reference  map  of  that  region.  The  number  of  such  fixes  is  limited  by  the  amount  of  computer  memory 
allotted  for  the  reference  maps,  rive  or  six  being  a  practical  upper  bound. 


A  Kalman  filter  was  designed  [49]  to  combine  the  Information  received  from  the  INS  and  RAC,  to  estimate 
the  errors  being  committed  by  the  INS,  and  to  feed  back  corrective  signals  to  remove  these  estimated 
errors.  Because  of  the  restricted  amount  of  computer  memory  allocated  to  the  Kalman  filter  (less  than 
1000  words),  the  proposed  design  is  very  simple  -  two  decoupled  three-state  filters.  However,  the  adequacy 
of  such  a  simple  design  to  meet  performance  specifications  was  subject  to  significant  question,  so  a 
covariance  analysis  has  been  conducted  to  determine  estimation  precision  capabilities  in  a  realistic 
environment  [50 j. 

Certain  aspects  of  this  study,  such  as  RAC  performance  characteristics  and  a  detailed  portrayal  of  the 
glide  vehicle  trajectory  (basically  a  nonmaneuvering  glide  with  terminal  pitchover  and  descent),  are  not 
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available  for  public  release  at  this  time.  For  this  reason,  the  analysis  results  are  presented  in  the 
form  of  percentages  or  unsealed  graphs. 


The  proposed  design  is  actually  composed  of  two  decoupled,  three-state  Kalman  filters,  each  maintaining 
estimates  of  the  INS  position,  velocity,  and  attitude  angle  error  states  along  a  single  coordinate  direction 
(east  and  north  are  the  chosen  axes).  Thus,  the  six  state  variables  being  estimated  are  ix  ,  «Xe  -  east 
and  north  components  of  the  error  in  the  INS-indicated  position;  <$v  ,  6vn  -  errors  in  INS-indicated 
velocity;  *  ,  $  -  errors  in  INS-indicated  attitude  (tilts).  e 


The  dynamics  model  upon  which  the  filters  are  based  is 
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(48) 


and  a  similar  model  for  <5x  ,  ov  ,  and  a.  In  this  equation,  g  is  the  magnitude  of  gravity  and  R  is  the 
equatorial  radius  of  the  Earth.  The  driving  term  w  ,  Isa  white  Gaussian  noise  to  model  acceleration- 
level  errors,  and  w  «  is  an  independent  Gaussian  noise  to  model  errors  associated  with  attitude  error 
angular  rates,  This  equatlon  is  in  the  form  of  Eq.  (4)  with  constant  £  and  G  and  B  s  0,  i.e.,  a  linear, 
constant  coefficient,  stochastic  differential  equation  driven  by  stationary  zero-mean  white  Gaussian  noise 
with  constant  strength  £  in  (5b). 


The  sampled-data  measurements  made  available  to  the  filter  algorithms  are  generated  by  differencing 
INS-indicated  position  and  RAC-indicated  position.  For  instance,  INS-indicated  east  position  would  be  the 
true  position  plus  the  error  state  6xft : 

Xe-INS  3  Xe-TRUE  +  5xe  (49^ 


whereas  the  RAC-indicated  east  position  would  be  modeled  as  the  true  position  corrupted  by  white  Gaussian 
noise  v: 


VRAC 


Xe-TRUE  '  v 


(50) 


Differencing  Eqs.  (49)  and  (50)  at  sample  time  t^  yields 


xe- INS ( t1 ) 

'  Xe-RAC 

«xe(ti)  + 

v(ti ) 

"6xe<V 

D  o  o] 

4W 

+  v(t. 


(51) 


This  equation  and  the  corresponding  result  for  the  difference  of  north  position  indications  are  of  the 
form  of  Eq  (6)  with  constant  H  ana  v  being  a  scalar  discrete-time  zero-mean  white  Gaussian  noise  with 
time- varying  autocorrelation  as  In  (7). 

Formulation  of  a  viable  system  representation  as  a  pair  of  independent  three-state  models  depends  upon 
Insights  and  assumptions  closely  tied  to  the  structure  of  both  the  full-scale  INS  error  model  and  the 
measurement  error  modal.  First,  the  generally  accepted  nine-state  model  of  INS  error  characteristics,  in 
which  position,  velocity,  and  attitude  error  about  three  coordinate  directions  (east,  north,  up)  are 
totally  intercoupled  [25-29],  can  be  partitioned  according  to  the  state  subsets  («x  ,  4v  ,  ♦  ), 

(«V  «v  ,  O,  ♦  ,  and  (4x  ,  «v  ).  The  last  set  is  only  weakly  coupled  to  the  first  slvennstates ,  and 
in  fact  is  totally  decoupled  from  them  if  the  vehicle  were  at  rest  end  the  Earth  were  nonrotating.  More¬ 
over,  the  errors  in  this  vertical  channel  are  kept  suitably  small  with  altimeter  aiding  external  to  the 
filter,  so  these  two  states  are  ignored. 

Under  the  same  assunptlon  of  the  vehicle  at  rest  on  a  nonrotating  Earth,  the  first  three  sets  of  stetes 
also  decouple,  with  the  first  two  being  characterized  by  Schuler  oscillations  and  depicted  as  In  Eq.  (48). 
When  the  vehicle-centered  east-north-up  frame  moves  over  a  rotating  Earth,  response  mode  modification  and 
Intercoupling  occur.  But  the  error  oscillations  are  still  Schuler  dominated  since  the  Schuler  angular  rate 

\/g/RQ  is  significantly  greater  than  coupling  terms  on  the  order  of  Earth  rate  q  relative  to  inertial 
space  and  vehicle  position  angular  rate  relative  to  the  Earth  (I.e.,  velocity  component  divided  by  RJj  and 
Schular  rate  squared  dominates  vehicle  position  angular  accelerations.  Cross  coupling  occurs  among  the 
three  attitude  error  states  predominantly  due  to  nonzero  Earth  rate  and  vehicle  velocity,  and  the  rate  of 
change  of  velocity  error  states  couple  Into  those  attitude  errors  through  nonzero  specific  force.  For 
Instance,  terms  to  be  added  to  the  right-hand  side  of  Eq,  (48)  Include 


0 

'(VS) 


V 

♦n 

J 

s. 

where  f  and  f  are  specific  force  components,  and  u  and  u  are  components  of  angular  rate  of  the  east- 
north-ujj  frameVith  respect  to  inertial  space;  additional  cross-coupling  effects  also  develop  among  the 
nine  error  states.  The  uncertainties  w  ,  and  w '  ,  in  Eq.  (48)  partially  account  for  these  neglected  terms, 
though  in  a  rather  crude  manner.  Were  ft  not  for  the  severe  computer  memory  restriction,  more  explicit  in¬ 
corporation  of  these  effects  and  inclusion  of  the  azimuth  error  state  *  would  warrant  attention  as  a 
means  of  enhancing  performance.  Especially  for  the  application  envisioned  here,  in  which  flight  is  along 
a  rather  benign  glide  trajectory,  the  simplistic  model  given  in  Eq.  (48)  might  well  suffice. 

Decoupling  into  two  separate  filters  also  requires  that  the  measurements  introduce  no  nonnegligible 
cross  coupling.  In  fact,  the  measurement  gradient  matrix  H  associated  with  the  full-scale  INS  error  model 
does  not  intercouple  the  two  horizontal  channels  with  each  other  or  the  vertical  channel.  Moreover,  if 
v,(t.)  and  v^(tj)  are  the  measurement  noises  associated  with  east  and  north  position  differences,  as  in 
Eqs, '(51)  and  (7),  then  the  equi probability  ellipsoids  for  the  vector  [v, (t.)  v2(ts)]‘  must  either  have 

their  principal  axes  aligned  with  the  reference  coordinate  directions  or 'be' (nearly)  circular,  so  that 
there  is  no  cross  correlation  to  Intercouple  the  two  horizontal  channels.  This  poses  no  difficulty  for 
this  particular  problem. 

The  Kalman  filter  based  upon  a  model  described  by  Eqs.  ( 48)- (51 )  is  given  by  Eqs.  (8)-(16).  To  specify 
the  filter  algorithm  completely,  the  dynamic  noise  strength  £  in  Eqs,  (5),  (9),  and  (13),  the  measurement 
noise  strength  tine  history  R(tJ  in  Eqs.  (7)  and  (14),  and  the  initial  state  covariance  P  in  Eq.  (3)  must 
be  established  for  both  filters.  Finding  the  best  such  values  iteratively  through  a  process  of  filter 
tuning  will  be  discussed  subsequently. 

In  actual  operation,  the  filter  (the  two  3-state  filters  can  be  viewed  as  a  single  decoupled  6-state 
filter)  is  driven  by  measurements  z(t.)  from  INS  and  RAC  hardware,  and  provides  estimates  of  the  states, 
x(t/),  which  are  used  as  corrective  feedback  to  the  INS.  For  analysis  purposes,  the  "real  world"  environ¬ 
ment  is  replaced  by  a  “truth  model,"  as  in  Fig.  1,  and  the  quantities  of  Interest  are  the  six  states 
being  estimated. 


VI. 


The  truth  model  required  In  the  performance  analysis  can  be  described  by  (18)  and  (19)  with  Bt  =  0, 
I,  and  H.  constant.  Table  1  presents  the  46  states  of  the  truth  model  for  the  laser  gyro  system 


Table  1  Truth  Model  State  Description 


Slate 

Laser  gyro  INS  system 
f*a  term  Q,  term 

Conventional  gyro  INS  system 

P#  term  Q,  term 

Basic  INS: 

Position  errors  (3) 

(1500  ft)1 

0 

0500  ro1 

0 

Velocity  errors  (3) 

(2  ft/,)1 

0 

(2  ft/!)1 

0 

Attitude  errors  (3) 

(0.5  millirad)2 

7.6x10  n 
rad2/s 

(O.S  millirad)2 

0 

Accelerometer,. 
Accelerometer  bl*xs  (J) 
(d»y-to-d»y  nonrepe«l»blllty> 
Accelerometer  sole  factor 
errors  (3) 

Accelerometer  Input  Mis 
mlullinmentt  (6) 
Accelerometer  bUues  (3) 

(f,  -60  min) 

Accelerometer  bltsei  (3) 

(Tj  *•  IS  min) 


Ortvlty  knowledge: 
Grtvlty  deflection,  (2) 
(rf,  -lOn.  ml.) 
Qrsvlty  »nomily(l) 
(Jj  -60n  ml.) 


dyrosi 

Oyro  drift  t*te  blnses  (3) 


(250m)1 
(500  ppm)1 
(lOirc*!)' 
(40m)1 
(20m)! 


(26*»)l(e»U) 

<l7(i»)»<no«h) 

(35m)1 


(O.Wii**/),)1 


Gyro  sc*lef»ctoe  errors  (3)  (lOOppm)1 

Oyro  Input  Mis  mtulliMUMnti  (4)  (6  irc-ir 

Oyrodtiflt»te(3) 

(V, «  40  win) 

Oyro  drift  rite  (3) 

(7j»)5mln> 

I'lewltlve  drift  coefficients  (4) 
l * "setultltre  drift  coefficient!  (J) 


o  laxwi1 

0  (405.4  ppm)1 

0  (SOirc-s)1 
IP„!T,  (60m)1 
lP„n,  (30 M)1 


yv/ef,  (24m) '(till) 
(|7)k|) ‘(north) 

iPaV.d,  (35m)1 


1.47x10' 11  (1 .0  de«/h) 1  (roll  **ls) 
red1/!1  (l.SSdce/hl’lotheriwol 
0  (500  ppm)1 

0  (30  ire-!)1 

(0,4  deg/hl^oec) 
(O.adef/M’Uif) 
(O.ldej/h)1  (oeu) 

(0.3  def/h)1  (leu) 
(X-Odej/h/f)1 
(0.1  dej/h/*1)1 


KAC  blues  (2) 

Altimeter: 

Altimeter  bUs(l) 

(d«150n.  ml.) 

Alt  tret  er  Kile  feettx  error  (I) 


^(ttonc-tiK'k) 

^(croudrick) 


/*„(ilon*lt»c«) 
Pk  (crou-ltuk) 


(500  ft)1 
0  (0.03)1 


[29,50-52]  and  the  61  states  corresponding  to  the  conventional  gyro  system  [29,50].  Associated  with  each 
state  are  its  initial  variance  (appropriate  diagonal  term)  and  white  driving  noise  strength  ((L  diagonal 
term);  both  and  CL  are  assumed  to  be  diagonal.  For  states  that  are  modeled  as  random  bias  processes 
(the  outputsor  undrlten  Integrator  shaping  filters),  the  appropriate  FL  term  is  given  and  the  0,  term  Is 
given  as  zero.  For  states  that  are  modeled  as  first  order  Harkov  processes  (outputs  of  first-order  lags 
driven  by  white  noise),  the  £L  term  is  described  in  terms  of  the  term  and  correlation  time  T  in  such  a 
manner  as  to  yield  stationary'processes. 


The  first  nine  states  are  the  variables  used  to  describe  the  error  characteristics  of  an  INS,  Although 
each  IKS  under  consideration  Is  a  strapdown  system,  this  error  model  can  be  expressed  conveniently  with 
respect  to  an  east-north-up  coordinate  frame  125,29].  The  <L  tenas  associated  with  attitude  errors  are 
due  to  gyro  drift  and  will  be  discussed  subsequently. 


Accelerometer  errors  are  described  by  means  of  a  day-to-day  non  repeatability  bias,  scale  factor  error, 
two  input  axis  misalignments,  and  two  first-order  Markov  process  states  for  each  accelerometer.  Un¬ 
certainty  in  the  knowledge  of  gravity  also  enters  the  truth  model  state  equations  at  the  acceleration 
level.  The  errors  between  the  true  geoid  and  the  assumed  ellipsoid  for  INS  navigation  computations  have 
been  described  by  means  of  first-order  Markov  process  models  [29],  with  mean  square  values  and  correlation 
distances  as  described  in  Table  1.  If  a  correlation  distance  is  denoted  as  d  and  the  vehicle  velocity 
magnitude  as  v,  a  corresponding  correlation  time  is  generated  as  T  =  d/v,  thereby  yielding  the  (L  term 
expression  in  the  table. 

Gyro  errors  are  depicted  by  a  drift  rate  bias  state  (or  Brownian  motion  state  for  the  laser  gyro, 

i.e.,  the  output  of  an  integrator  driven  by  white  Gaussian  noise),  scale  factor  error,  two  input  axis  mis¬ 

alignments,  two  first-order  Markov  process  states,  two  g-sensitive  drift  coefficients  (spin  and  input 
axes),  and  one  g2-sensitive  drift  coefficient  (major  spin-input  coefficient)  for  each  gyro.  For  the  laser 
gyros,  only  the  first  four  of  these  nine  states  are  Included,  since  the  others  are  essentially  nonexistent. 
Another  marked  difference  from  conventional  gyros  is  embodied  in  the  drift  rate  model.  A  typical  gyro 
drift  rate  model  is  conposed  of  the  sum  of  first-order  Gauss-Markov  components  with  an  additive  white 

Gaussian  noise.  In  conventional  gyros,  the  time-correlated  contributions  dominate  the  very  wideband 

(white)  component,  and  the  latter  is  often  neglected.  However,  for  laser  gyros,  the  wideband  (modeled  as 
white)  component  predominates;  its  noise  strength  is  given  by  the  terms  driving  INS  attitude  errors  in 
Table  1.  A  final  difference  of  the  two  gyro  types  Is  the  set  of  multiple  table  entries  for  certain  con¬ 
ventional  gyro  states.  On  the  Markov  process  states,  oav  denotes  output  axis  vertical,  while  lav  means 
input  axis  vertical.  The  roll  axis  gyro  drift  rate  bias  entry  is  higher  than  the  others  because  a  different 
gyro  design  is  employed  to  withstand  and  Indicate  the  larger  range  of  rates  that  can  occur  about  this  axis. 
In  the  laser  gyro  INS,  the  gyro  sensitive  axes  are  canted  off  from  the  vehicle  body  axes  to  distribute 
high  roll  rates  among  three  identical  gyros. 

Although  Table  1  shows  accelerometer  errors  to  be  very  similar  in  the  two  inertial  systems,  the  gyro 
characteristics  are  significantly  worse  In  the  conventional  gyro  INS.  The  low-frequency  power  spectral 
density  value  of  the  Gauss-Markov  drift  rate  components  in  the  conventional  gyro  Is  three  orders  of  magni¬ 
tude  worse  than  the  laser  gyro  white  noise  component.  Moreover,  drift  rate  biases,  scale  factor  errors, 
and  misalignments  are  considerably  greater;  and  the  g  and  g2  errors  have  no  counterpart  in  the  laser  gyro 
system. 

The  errors  in  the  RAC  data  are  modeled  as  a  corruptive  white  Gaussian  noise  plus  a  bias.  This  is  a 
necessarily  unsophisticated  model  of  RAC  error  characteristics,  since  only  sparse  and  Incomplete  perform¬ 
ance  data  were  available  at  time  of  truth  model  development.  Nevertheless,  these  data  were  sufficient  to 
estimate  appropriate  noise  strengths  and  to  indicate  that  bias  effects  were  not  negligible.  The  strength 
of  the  two-dimensional  white  noise,  v^.  in  Eq.  (Vj),  was  found  to  be  well  modeled  as 

R^t,)  =  [e  •  h^)]2!.  (52) 

where  h(tj  is  the  vehicle  altitude  and  8  is  a  parameter  with  classified  numerical  value.  Each  bias  was 
modeled  as  a  random  constant  with  mean  zero  and  variance  as  shown  in  Table  1,  again  the  numerical  values 
being  classified.  Although  physical  reasoning  could  lead  to  altitude- depen  dent  variances  on  the  bias 
states  as  well,  the  available  data  were  not  consistent  or  complete  enough  to  warrant  this  formulation. 
Because  high  statistical  confidence  could  not  be  placed  in  this  model,  a  study  of  performance  sensitivity 
to  bias  model  parameter  variations  was  deemed  essential;  this  will  be  discussed  further  in  the  analysis 
presentation. 

Finally,  the  altimeter  errors  are  described  in  terms  of  a  first-order  Markov  process  noise  plus  a 
scale  factor  error.  The  altimeter  is  used  to  damp  out  the  Inherently  unstable  vertical  errors  in  the  INS, 
and  so  its  errors  drive  certain  INS  error  states  in  the  truth  model. 

The  covariance  analysis  technique  was  first  used  to  tune  the  proposed  filter  for  use  in  each  of  the 
two  I  NS/RAC  system  configurations  [50].  The  filters'  P_  and  time  histories  of  0  and  R  were  iteratively 
modified  to  yield  minimum  rms  values  of  the-  estimationerror  e,  components  for  all  time  of  Interest.  For 
this  application,  terminal  position  errors  ire  especially  Important,  but  the  entiie  history  of  all  errors 
must  be  considered  to  preclude  being  outsid:  the  bounds  of  a  prestored  RAC  map  at  an  update  time  and  to  ' 
Insure  sending  proper  corrective  control  cotmands  during  the  terminal  phase  of  flight. 

Figure  3  plots  the  rms  error  (in  log  scale)  In  the  east  position  estimate  provided  by  the  filter  tuned 
to  the  laser  gyro  system.  To  aid  the  tuning  process,  these  "actual"  rms  errors  were  compared  with  the 
filter's  own  representation  of  Its  errors  -  its  own  computed  covariance  £.  Despite  the  simple  filter  form 
and  the  fact  that  a  constant  Q  is  used  for  all  time,  the  filter-computed  rms  error  history  essentially 
duplicates  the  results  shown  In  Fig,  3.  Moreover,  this  condition  does  effectively  yield  the  best  estimate 
precision.  The  results  •for  the  other  five  filter  states,  and  those  for  the  conventional  gyro  system,  are 
very  similar. 

For  coaputational  simplicity,  it  was  proposed  to  approximate  the  integral  term  in  Eq.  (13)  by  a 
diagonal  matrix  [49].  The  original  such  design  was  found  to  be  severely  out  of  tune,  and  even  the  best 
tuning  achievable  with  a  diagonal  matrix  form  yielded  noticeably  degraded  performance,  foe  degradation  was 
naturally  least  in  the  channels  for  which  direct  measurements  were  available,  I.e.,  position  errors,  mid 
these  are  the  estimates  of  primary  interest  for  this  application.  However,  the  «wputation  of  three  off- 
diagonal  terms  in  a  synmetric  3x3  matrix  is  not  burdensome.  Moreover,  a  followup  stuty  has  indicated  a 
substantial  increase  In  importance  of  Wiese  off-diagonal  terms  for  obtaining  good  performance  along  more 
highly  dynamic  trajectories  with  optimized  measurement  sample  times.  Therefore,  weapon  system  development 
and  testing  was  pursued  with  the  design  changed  to  Incorporate  these  terms. 

An  error  budget  was  generated  to  depict  the  contributions  of  individual  error  sources  to  the  ms  errors 
throughout  tne  vehicle  flight.  Once  the  filter  was  tuned,  repeated  covariance  analyses  were  conducted, 
each  with  a  single  error  source  removed.  Table  2  presents  the  results  for  rms  position  errors  »t  the 
terminal  time.  From  this  table,  It  is  evident  that  the  RAC  errors  have  the  greatest  influence  on  estimate 
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Fig.  3  RMS  Error  In  East  Position  Estimate 

precision  at  the  terminal  time.  This  Is  caused  by  the  extreme  accuracy  of  low-altitude  RAC  position  fixes 
and  the  fact  that  the  last  two  fixes  are  taken  shortly  before  the  end  of  flight  to  maximize  the  benefit  of 
the  limited  nuntier  of  updates.  Error  budgets  for  estimation  errors  earlier  in  the  flight  reveal  an  Increased 
Importance  of  INS  sensor  errors. 

Table  2  also  reveals  that  the  laser  gyro  INS  configuration  outperforms  the  conventional  gyro  system,  as 
would  be  predictable  from  relative  precision  of  Instruments  as  described  In  Table  1.  Also,  the  white  noise 
gyro  drift  rate  model  In  the  filters  Is  appropriate  for  a  laser  gyro,  whereas  a  first-order  Markov  process 
model,  requiring  an  additional  state  per  filter,  would  be  a  significantly  better  model  for  a  conventional 
gyro.  The  table  also  shows  that  the  gyro  errors  In  the  conventional  INS  system  play  a  more  dominant 
relative  role  In  degrading  performance  than  the  same  errors  do  In  the  laser  gyro  INS.  These  trends  are 
accentuated  at  earlier  times  In  the  flight,  especially  In  the  case  of  dynamic  trajectories. 

Because  of  the  significance  of  RAC  errors  and  the  sparse  amcirt  of  test  data  concerning  bias  errors 
.  In  thlS'devIce,  the  sensitivity  of  estimation  accuracy  to  varying  bias  levels  was  analyzed.  Table  3  demon¬ 
strates  the  effect  of  varying  the  RAC  bias  variance  from  zero  to  four  times  the  value  listed  In  Table  1. 

These  results  and  those  depicted  In  Table  2  reveal  that.  If  performance  requirements  are  not  met,  seeking 
a  better  RAC  system  would  be  more  beneficial  than  proving  the  INS  precision.  Similarly,  If  the  filter 
complexity  could  be  Increased,  It  would  be  most  advantageous  to  Incorporate  a  better  ndel  for  th*  errors 
In  the  RAC  system  position  data. 

Direct  estimation  of  RAC  biases  by  adding  a  fourth  state  to  each  filter  1$  not  feasible:  adding  the 
model  5  •  w,  to  Eq.  (48) ,  and  modifying  Eq.  (SI)  to  let  z  be  («x_  ♦  b  ♦  v),  yields  an  unobservable  system 
model.  Basically,  the  filter  would  not  be  able  to  provide  valirestlmates  of  and  b  separately.  Im¬ 
proved  estimation  performance  can  be  achieved  by  replacing  Eq.  (£i)  with 

aft,}  •  ([e  •  h(tt)32  ♦  A^ni 
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Table  2  Error  Budget 


Table  3  Sensitivity  to  RAC  Bias 


%  of  terminal  rms  nav  errors 

Laser 

Conventional 

Error  source  removed 

gyro  INS 

INS 

None  (baseline) 

100 

100 

(  =  107.5%  of 

laser  error) 

Accel  errors 

100 

99.9 

Gyro  errors 

100 

98.1 

Initial  condition 

100 

100.0 

RAC  bias 

95 

% 

All  RAC  errors 

9 

=ac  f  ■  i  =ta 

11 

%  of  terminal  rim  nav  errors 


RAC  bias  model 

Laser 

Conventional 

standard  deviation 

gyro  INS 

INS 

0 

95 

96 

standard 

100 

100 

2  x  standard 

115 

119 

in  the  filter  formulation,  where  R.  scales  with  the  variance  of  the  RAC  bias  in  the  truth  model.  Although 
bias  errors  are  not  directly  compensated,  the  better  model  for  nms  measurement  errors  yields  more  proper 
weighting  of  update  information,  and  thus  enhanced  performance.  However,  these  conclusions  are  based  upon 
the  adequacy  of  the  truth  model  depiction  of  RAC  errors.  Once  enough  performance  data  can  be  analyzed  to 
have  statistical  confidence  in  the  RAC  error  model,  the  preceding  modification  and  other  means  of  enhance¬ 
ment  can  be  fully  exploited  in  the  operational  filter. 

Thus,  because  of  severe  restrictions  on  computer  time  and  memory  allotted,  a  very  simple  Kalman  filter 
was  designed  to  update  a  strapdown  inertial  system  with  position  fixes  from  a  radiometric  area  correlator. 
Nevertheless,  its  performance  has  been  analyzed  and  found  to  meet  specifications. 

6.  CONCLUSIONS 

Attaining  an  adequate  model  upon  which  to  base  a  Kalman  filter  is  an  essential  aspect  of  designing  an 
operational  online  filter  algorithm.  Numerous  methods  have  been  presented  for  generating  models  and 
filters  of  substantially  different  levels  of  complexity,  state  dimension,  and  performance  potential. 
Another  Integral  part  of  a  systematic  design  approach  is  the  realistic  evaluation  of  any  proposed  filter's 
performance  in  estimating  quantities  of  Interest  when  subjected  to  the  real  world  environment.  Such 
performance  analysis  capability  has  also  been  described,  and  once  again  the  adequacy  of  modeling  efforts, 
here  In  the  form  of  producing  a  "truth  model"  that  accurately  depicts  the  real  world  regardless  of  its 
required  complexity.  Is  shown  to  be  an  Issue  of  primary  importance.  The  design  process  Itself  is  composed 
of  Iteratively  proposing  alternative  filter  desigis,  tuning  each  for  best  performance  admitted  by  its 
structure,  evaluating  error  budgets  and  sensitivities  to  parameter  variations,  and  trading  off  performance 
capabilities  and  computer  loading  to  yield  the  final  algorithm  for  implementation. 
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SUMMARY 

A  simple  extended  Kalman  filter  is  designed  to  track  targets  using  outputs  from  a  forward-looking  in¬ 
frared  (FLIR)  sensor  as  measurements.  It  exploits  knowledge  unused  by  current  correlation  trackers  - 
size,  shape,  and  motion  characteristics  of  the  target,  atrospheric  jitter  spectral  description,  and  back¬ 
ground  and  sensor  noise  characteristics  -  to  yield  enhanced  performance.  Monte  Carlo  performance  analyses 
indicate  that  the  ability  of  a  nonadaptive  four-state  filter  to  track  a  realistic  distant  point  source 
target  with  an  error  standard  deviation  of  0.2  picture  elements  under  expected  tracking  conditions  sur¬ 
passes  the  correlation  trackers'  abilities  by  an  order  of  magnitude. 

Although  very  accurate  tracking  performance  is  achieved  under  nominally  assumed  conditions,  robustness 
studies  portray  a  significant  degradation  when  the  filter's  internal  model  does  not  depict  the  target's 
intensity  profile  or  motion  characteristics  well.  Background  noise  properties  are  shown  to  be  of  secondary 
Importance  at  expected  slgnal-to-nolse  ratios.  These  studies  enphasize  the  need  for  good  models  and 
adaptivity  within  the  filter  structure. 

In  order  to  track  air-to-air  missiles  at  close  range,  an  eight-state  filter  incorporates  a  modified 
target  dynamics  model  as  well  as  online  adaptation  to  target  shape  effects,  changing  target  motion  charac¬ 
teristics,  and  maximum  signal  intensity.  It  is  shown  to  possess  considerable  performance  potential  for 
highly  maneuverable  targets  despite  background  clutter. 


1.  INTRODUCTION 

Current  research  and  development  efforts  are  examining  several  methods  of  tracking  a  target  for  the 
purpose  of  depositing  high  power  laser  energy  on  that  target  in  the  presence  of  several  disturbances. 

These  disturbances  include  any  effect  that  can  cause  relative  motion  between  the  beam  and  target,  such  as 
target  motion,  atmospheric  jitter,  vibration  in  optics  mirrors,  and  sensor  measurement  errors. 

One  tracking  method  under  investigation  employs  a  forward-looking  Infrared  (FLIR)  sensor  together  with 
a  correlation  algorithm  to  provide  relative  target  position  information  to  the  laser  pointing  system.  The 
FLIR  sensor  generates  averaged  outputs  of  an  array  of  infrared  detectors  as  they  are  mechanically  scanned 
through  a  limited  field  of  view.  The  digitized  outputs  can  either  be  stored  or  displayed  on  a  cathode  ray 
tube  (CRT)  in  real  time,  each  output  corresponding  to  the  average  intensity  over  one  pictu'  !  element 

(pixel).  The  horizontal  and  vertical  scanning  of  the  detectors  through  the  FLIR  field  of  view  results  In 

an  array  of  pixels  called  a  frame  of  data,  with  normal  frame  rates  on  the  order  of  30Hz.  Because  of  this 
rapid  measurement  rate,  attention  can  be  confined  to  a  pixel  array  smaller  than  the  entire  frame  for  track¬ 
ing  purposes:  an  8-by-8  arrey  is  a  typical  tracking  window,  yielding  a  tolerable  amount  of  computer 
storage  and  loading. 

The  correlation  algorithm  [1  -  4]  first  stores  a  complete  set  of  Intensity  data  from  the  FLIR  outputs, 
and  then  correlates  those  data  with  the  new  information  at  a  later  time.  In  this  manner,  it  estimates  the 
two-dimensional  position  offset  from  one  set  of  data  to  the  next,  which  can  be  used  to  generate  commands 
to  keep  the  system  centered  on  the  target.  This  type  of  tracker  needs  no  prior  information  about  the  type 

of  target  to  perform  the  tracking  ftmctlon,  and  so  it  is  well  suited  to  many  general  applications. 

In  many  practical  tracking  problems,  however,  the  type  of  target  being  tracked  will  be  known,  even  if 
in  a  very  general  sense.  This  implies  that  certain  target  parameters  such  as  shape,  size,  and  motion 
characteristics  will  either  be  known  or  could  be  estimated.  Moreover,  the  statistical  effects  of  atmos¬ 
pheric  disturbances  on  radiated  wavefronts  are  known  and  could  supply  information  to  a  tracker  that  would 
aid  in  separating  the  true  target  motion  from  the  apparent  motion  (jitter)  due  to  these  disturbances. 

This  separation  is  iroortant  since  the  wavefront  of  the  high  energy  laser  will  not  undergo  the  same  dis¬ 
tortion  In  the  atmosphere  as  the  infrared  wavefronts  emanating  from  the  target. 

The  purpose  of  this  effort  is  to  exploit  the  knowledge  about  potential  target  characteristics  and 
atmospheric  jitter  in  the  design  of  an  extended  Kalman  filter  [5-7]  to  replace  the  current  correlation 
algorithm  in  the  tracking  loop.  Initially,  a  simple  extended  Kalman  filter  algorithm  is  designed  to  track 
a  point  source  (distant)  target  with  rather  benign  dynamics,  based  on  FLIR  measurements  assumed  to  be 
corrupted  by  temporally  and  spatially  uicorrelated  noises,  and  assuming  all  system  defining  parameters 
have  known  nominal  values  [8,9],  Section  2  establishes  the  required  mathematical  models  and  basic  filter, 
and  Section  3  analyzes  its  performance  capabilities  In  a  realistic  environment  via  Monte  Carlo  analysis 
[8,9].  It  consistently  outperforms  the  correlation  tracker  under  nominally  assumed  conditions,  employing 
knowledge  unused  by  that  tracker  to  yield  the  enhanced  capability.  Robustness  studies  are  conducted  in 
Section  4  to  indicate  how  much  filter  performance  degrades  when  an  accurate  portrayal  of  the  tracking 
problem  differs  from  that  assumed  in  the  filter  desigi  [10,11].  Of  specific  interest  are  variations 
in  (1)  the  height,  spread,  shape,  and  orientation  of  the  target  intensity  pattern  in  the  FUR  image  plane, 
(2)  target  motion  characteristics,  and  (3)  background  noise  rms  value  and  both  spatial  and  tenportl 
correlations.  This  investigation  provides  Insights  Into  a  prioritized  list  of  design  modifications  and 
online  adaptation  capabilities  required  to  allow  this  type  of  filter  to  track  highly  maneuverable  targets, 
with  spatially  distributed  and  changing  image  intensity  profiles,  against  background  clutter.  The  sub¬ 
sequent  sections  delineate  specific  modeling  and  adaptation  methods  to  yield  a  filter  capable  of  accurate 
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tracking  of  an  air-to-air  missile  at  close  range,  with  performance  evaluations  of  proposed  designs  achieved 
by  Monte  Carlo  simulations  [11,12]. 

2.  MODELS  AND  FILTER  FOR  3ENIGN  TRACKING  TASK 


As  originally  conceived,  the  problem  of  interest  was  accurate  tracking  of  a  point  source  target  based 
on  FLIR  measurements,  to  provide  appropriate  Inputs  to  a  pointing  controller  [8,9].  In  essence,  this 
Involves  determining  the  pointing  errors  in  two  dimensions  from  the  center  of  the  FLIR  field  of  view, 
given  measurements  of  average  intensity  level  over  each  of  64  pixels  in  an  8-by-8  "tracking  window"  array 
provided  by  the  FUR  at  a  30  Hz  rate.  Many  applications  for  which  this  system  is  being  considered  do  in 
fact  require  the  acquisition  and  tracking  of  targets  at  long  ranges.  Because  of  these  distances,  even 
very  large  targets  appear  as  point  sources  of  Infrared  radiation  and  can  be  accurately  modeled  as  such. 
Due  to  the  physics  of  wave  propagation  and  optics,  the  resulting  intensity  pattern  on  the  FUR  image 
plane  can  be  modeled  as  a  bivariate  Gaussian  function  with  circular  equal  Intensity  contours.  This  is  a 
special  case  of  elliptical  contours  as  depicted  in  Fig.  1,  in  which  a  ,  =  o„2  =  og.  Letting 
^xpeak^t'*  ypeak^)  ^ocate  th®  center  of  the  pattern  relative  to  the9centar  of  the  8-by-8  pixel  array, 

the  apparent  target  Intensity  model  for  circular  equal  intensity  contours  is 


Itarget^x’S,t^  =  ^x  exp£*  2^"  ^V^eak^  +  t^y  “  ypeak^^  ( 

where  the  peak  Intensity  value  I  and  glint  dispersion  a-  were  originally  assumed  to  be  known.  The 
apparent  location  of  the  target  factually  the  sum  of  effects  due  to  true  target  dynamics,  atmospheric 
disturbances,  and  vibration  [denoted,  respectively,  by  subscripts  d,  a,  and  v): 


Vak^)  “  xd^  +  xa^  +  xv^  (2a) 

Wt}  ’  yd(t)  +  yaW  +yv(t)  (2b) 

The  objective  of  the  tracker  is  to  estimate  x,,(t)  and  yTt)  so  that  they  can  be  regulated  by  closed-loop 
control.  a  a 


A  generally  applicable  model  for  benign  target  dynamics  is  desired,  one  which  is  simple  and  yet 
accounts  for  the  time-correlated  behavior  of  realistic  targets.  To  fulfill  these  objectives,  an  inde¬ 
pendent  first-order  Gauss-Markov  model  in  each  direction  was  chosen,  as  produced  by 

.  xd(t)  ■  -D/Td]  xd(t)  ♦  wdx(t)  (3) 

where  T.  is  the  characteristic  correlation  time  of  the  target  and  wd  is  a  zero-mean  white  Gaussian  noise 
with  autocorrelation  function  ox 

E  (wdx(t)wdx(t+Y)}  -  2od2/Td  «(t)  (4) 

so  that  od  is  the  rms  value  of  xd(t),  and  similarly  for  yd(t).  By  suitable  choice  of  od  and  Td,  sanples 
from  these  processes  can  be  made^o  exhibit  anplltude  and°rate-of-change  characteristics  appropriate  to  a 
variety  of  long  range  targets  as  seen  in  the  image  plane.  Again,  these  two  parameters  were  originally 
assumed  to  be  known. 


Atmospheric  disturbance  causes  an  apparent  offset  of  the  location  of  the  intensity  pattern  known  as 
jitter.  Through  spectral  analysis  of  this  phenomenon,  it  has  been  shown  [13,14]  that  x  (t)  and  y  (t)  can 
each  be  adequately  modeled  as  the  output  of  a  third-order  linear  shaping  filter  [6]  with  transfer*function 
[Ku,u2(s-rw1r1(s+tt2)‘2]  ,  drtven  by  unit-strength  white  Gaussian  noise.  Here  *n  14  rad/sec, 

*2  ^  860  rid/sec,  and  K  can  be  adjusted  to  obtain  the  desired  rms  jitter  characteristic  on  the  output  and 
was"assuwd  to  be  known. 

Vibrations  of  the  FUR  system  can  also  cause  relative  pointing  errors.  However,  for  this  study.  It  is 
assumed  to  be  on  a  groimd-based  stable  platform,  so  vibration  effects  are  neglected.  For  airborne  appli¬ 
cations  and  other  scenarios,  vibration-induced  effects  may  warrant  considerably  more  attention. 


Thus,  the  target  Intensity  pattern  given  by  (1)  has  been  fully  described.  However,  this  pattern  is  not 
directly  available  for  observation;  it  is  corrupted  by  background  noise  and  inherent  FUR  errors  first. 
There  are  various  forms  of  background  noise,  ringing  from  nl^it  sky  background  to  clutter,  i.e.,  from  zero 
to  high  tine  and  spatial  correlations.  FLIR  errors,  such  as  thermal  noise  and  dark  current  effects,  can  be 
modeled  a*  teeporaily  and  spatially  uncorrelated  noise.  Letting  z.Ut.)  denote  the  measurement  available 
at  time  t{  of  the  average  Intensity  over  the  pixel  in  the  Jth  row  ind  Rth  coluem  of  the  8-hy-8  array,  then 


W"  JLgfonof  Vtt) 

jkth  pixel 


(5) 


where  A.  is  the  area  of  one  pixel,  n,.(t.)  models  the  FUR  noise  effects,  and  b>.  (t.)  models  the  background 
effect$pon  the  jkth  pixel.  Arreyingrthe  64  scalar  equations  (S)  in  a  single  measurement  vector  yields  a 
measurement  model  of  the  form 


l(^)  •  ♦  n(t1 )  ♦  b(t|) 


(6) 


where  x  is  the  output  of  en  eight-state  linear  dynamics  system  model  (one  state  equation  as  given  by  (3) 
and  three  coupled  linear  equations  to  generate  x  ,  and  similarly  for  the  y  axis),  and  the  other  vectors  are 
of  dimension  64.  From  (S)  and  (6)  it  can  be  seen  that  Ji  represents  the  effect  of  the  point-spread  function 
given  in  (1).  Note  that,  In  such  a  formulation,  the  spatial  correlation -of  background  noise  is  readily 
represented  by  the  off-diagonal  eleimnts  of  the  64-by-64  matrix  E{b(tj)b'(tj)h 


The  model  just  developed  accounts  for  time-correlated  dynamics,  bandwidth  effects  of  jitter,  and  other 
pertinent  characteristics.  An  extended  Kalman  filter  could  be  based  on  this  model  to  perform  the  desired 
tracking  task. 

First,  to  enhance  computational  feasibility,  the  model  can  be  simplified  to  some  degree.  In  view  of 
the  discrepancy  between  the  two  break  frequencies  of  the  atmospheric  disturbance  shaping  filter  and  the 
greater  Importance  of  the  lower  frequency  characteristics,  atmospheric  disturbance  effects  x  and  y  were 
approximated  as  the  outputs  of  first-order  systems  with  break  frequency  uj  (thus  preserving  proper 
spectral  shape  at  the  significant  frequencies  below  u^).  This  yielded  a  four-state  linear  time-invariant 
model  of  the  form 

x(t)  -  Fx(t)  +  w(t)  (7) 

with  diagonal  f^and  stationary  white  Gaussian  Input  w(t).  Thus,  the  filter  equations  for  propagating  the 
state  estimate  x  and  error  covariance  P  from  sample  time  t,+,  (after  measurement  Incorporation  at  that 
time)  to  t^'  (before  measurement  update)  become  [6]: 

£(tt’)  ■  ixtt^)  (8) 

Ett^)  -  +  Qd  0) 

where  the  state  transition  matrix  *  (also  diagonal)  and  Input  covariance 

♦{trT}  £!T(trt)dT  (10) 

Vi 

are  constant  and  readily  calculated  once  offline.  Because  of  the  low  state  dimensionality,  linearity, 
and  time  Invariance  of  the  dynamics  model,  these  filter  time  prew.qailons  are  especially  simple. 

Furthermore,  each  measure  rent  equation,  (5),  was  &1a*>1tf'ed  as  well.  First,  the  two-dimensional 
Integral  term  Is  teplaced  by  ltarget^xck,ycj’V  w^ere  (;tck,ycj^  the  lQca-1on  of  the  center  of  the 

jkth  pixel.  Second,  the  combined  effects  of  n  and  b  in  (5)  are  represented  by  a  single  vector  v, 
assumed  to  have  spatially  and  tenporally  uncorrelated  components  of  constant  and  equal  variance” 


E{v(tj )vT(tj)l  - 


Vtj  • 


These  simplifications  are  made  to  reduce  complexity  substantially,  but  are  subject  to  reevaluatton  If 
performance  capabilities  are  Inadequate. 

The  large  number  of  ireasurements  cause  a  cosputational  loading  probUsi  in  the  normal  extended  Kalman 
filter  formulation  of  the  measurement  update: 

Ktt,}  •  P( t< * )HT{t< ) [H(ti )£,( ti *)HT( t4 )  +  itt^r1  (12) 

P(t/)  -  Pit,*)  -  K<t1)H{t1)P(t1’*)  (13) 

x(t^)  *  Jt(tt“)  ♦  *<t,)(i{t,)  -  }>  (14) 

where  h  Is  defined  In  (6)  and  approximated  componentwise  by  U#rget  evaluated  at  the  center  of  each  pixel, 
rather“than  Its  spatial  average  over  the  entire  pixel,  and  H(t})fs  the  partial  of  h  with  respect  to  x, 
evaluated  at  x(t,  ).  The  gain  calculation  In  (12)  requires“lnversion  of  a  64-by-6T  matrix.  To  circumvent 
this  burden,  TT2 >  and  (13)  are  replaced  by  the  equivalent  Inverse  covariance  form  [6,7]: 

•  Sf’tt,*)  ♦  HT(t, )£*’(«, )H(t,)  (15) 

P(t,*)  •  [P'^t/))'1  08) 

K(t()  *  07) 

This  form  only  requires  two  A-by-4  matrix  Inversions  online;  R“'(t<)  is  constant  and  Is  generated  once 
offline  (It  is  also  diagonal  If  (11)  Is  used). 

3.  PERFORMANCE  ANALYSIS  UNDER  NOMINAL  CONDITIONS 

The  performance  capabilities  of  the  extended  Kalman  filter  were  evaluated  and  compared  to  those  of  the 
correlator  algorithm  by  mams  of  a  Monte  Carlo  analysis  [8,9].  In  this  analysis,  the  full-scale  model 
developed  In  Section  2  was  used  to  generate  suple-by-saaple  simulations,  differing  In  particular  realisa¬ 
tions  dimwn  from  random  noise  sources.  Sample  statistics  of  the  tricking  errors  committed  by  each 
algorithm  were  computed  on  the  basis  of  20  simulation  runs  (chosen  by  observing  convergence  of  computed 
statistics  to  consistent  values  as  tbs  mater  of  runs  were  Increased). 


This  analysis  was  directed  at  tour  areas  of  prlmsry  Interest: 

1}  Performance  as  a  function  of  stgnal-to*notse  ratio  S/N  defined  hare  as 


Ratios  of  20,  10,  and  1  mire  Investigated. 


2)  Performance  as  a  function  of  Intensity  pattern  size  (spot  size  on  Image  plane)  relative  to  pixel 
size:  Gaussian  beam  dispersion  was  set  to  both  3  and  1  pixel. 

3)  Performance  as  a  function  of  the  ratio  of  rms  target  motion  to  rms  atmospheric  jitter:  (o  ,/o  ) 

values  of  5,  1,  and  0.2  were  considered.  0  a 

4)  Performance  as  a  function  of  target  correlation  time:  targets  with  TH  of  both  1  and  5  sec.  were 

simulated.  0 

All  of  these  studies  were  conducted  under  nominally  assumed  conditions:  as  the  parameters  defining  the 
real-world  environment  were  changed,  the  filter  was  (artificially)  provided  knowledge  of  their  value. 

Thus,  there  was  no  purposeful  model  mismatch  between  the  filter  and  the  actual  tracking  environment;  such 
Important  robustness  studies  are  described  In  the  next  section.  Also,  the  filter  was  implemented  In 
open  loop  for  this  initial  analysis:  computed  offsets  were  nrt  fed  to  a  pointing  control  system  to  be 
nulled  out. 

In  order  to  optimize  Its  performance,  the  filter  must  be  tuned  by  adjusting  the  strengths  of  both  the 
dynamic  driving  noises  and  measurement  corruption  noises.  For  the  simulations  conducted,  the  FUR  and 
background  noises  in  (5)  and  (6)  were  assumed  independent,  spatially  and  temporally  uncorrelated,  and 
Gaussian,  so  R  In  (11)  was  set  equal  to  the  sum  of  the  variances  for  n1k  and  b,k.  Adequate  tuning  results 
when  the  strength  of  the  white  noise  terms  and  correlation  times  for  both  the  simulation  and  filter  target 
dynamics  models  are  set  equal,  and  when  the  rms  values  for  atmospheric  jitter  for  both  models  (of  different 
order)  are  equated.  Fig.  2  depicts  the  actual  versus  filter-computed  error  standard  deviation  committed 
in  estimating  atmospheric  jitter  in  the  horizontal  direction  for  a  typical  case 

(S/N  ■  10,  0^  3  3,  Oj  »  oa  =  1,  Td  =  1);  in  this  and  all  other  cases,  the  adequacy  of  both  the  ;  reposed 

filter  tuning  and  the  order  reduction  of  the  filter  dynamics  model  is  deacnstrated  by  the  good  agreement 
between  the  two  curves.  When  the  s1<yta1-to-noise  ratio  Is.  decreased  to  one,  the  filter  tends  to  under¬ 
estimate  Its  own  errors  (by  about  the  same  margin  It  overestimates  them  In  Fig.  2)  using  the  tuning 
philosophy  described,  due  to  the  mismatch  between  true  and  filter  models  becoming  more  apparent;  this  can 
readily  be  remedied  by  Increasing  the  filter  0,  entries  If  desired.  In  fact,  for  a  "conservative"  or 
robust  filter  that  Is  able  to  withstand  modeling  errors  yet  still  provide  good  estimation  performance 
(beyond  the  bare  minimus  of  nondivergent  characteristics  [15]),  one  might  want  to  tune  the  filter 
purposely  so  that  It  overestimates  its  own  errors  somewhat  [6,7].  Had  biased  estimates  been  a  problem  for 
this  application.  It  could  have  been  combatted  by  tuning  so  as  to  match  filter-computed  error  variances 
and  actual  mean  square  errors  [7];  Incorporating  the  “bias  correction  term"  from  second  order  filtering 
[7,16]  or  Implementing  an  entire  second  order  filter  [5,7,16]  would  be  prohibitive  computationally. 

For  the  typical  case  and  tuning  philosophy  depicted  In  Fig.  2,  Ffg.  3  portrays  the  sample  mean  error 
»lo  (standard  deviation)  committed  by  the  filter  in  estimating  the  target  true  horizontal  location.  By 
conparlson.  Fig,  4  depicts  the  performance  of  the  correlator  algorithm  under  the  saw  conditions.  In  this 
case,  both  algorithms  yteld  rather  unbiased  estimates,  but  the  filter  error  standard  deviation  Is  only  one 
fourth  that  of  the  correlator  after  the  5  sec.  simulation  (and  the  correlator  performance  Is  steadily 
worsening). 

The  filter  tuning  is  independent  of  the  Gaussian  glint  dlsporelon  (spot  >1xe).  Furthermore,  if  S/N 
Is  adjusted  by  changing  only  1  In  (18),  It  is  also  independent  of  the  slytal-to-noise  ratio.  This 
allows  portrayal  of  perfortoanc™5f  a  singly  tuned  filter  in  different  tracklngenvlronments.  Tables  I  and 
II  present  a  comparison  of  the  two  algorithms  In  wen  and  la  tracking  error  for  the  three  slqnal-to-noise 
ratios  examined.  Table  I  pertaining  to  the  case  of  «  ?  pixels  and  Tabic  II  to  •  l  pixel. 

TABLE  I 

HE  A.  l  ERROR  AND  lo  ERROR  COMPARISONS  WITH  s»  •  3  PIXELS 

0 


Correlation  Tracker  Extended  Kalman  Filter 
mean  error  to  error  wan  error  W  error 
S/N  (pixels)  (pixels)  (pixels)  (pixels.' 


20 

0,5 

1.5 

0.0 

0.2 

10 

3,0 

3.0 

0,0 

0.2 

1 

15.0 

30.0 

0.0 

0.8 

TABLE  II 

MEAN  ERROR  AND  lo  ERROR  COMPARISONS  WITH  o„  •  1  PIXEL 

9 


KmlttlcrtTmtor  Extended  Kalman  niter 
win  error  lo  error  wan  error  lo  error 
S/N  (pixels)  (plwts)  (pixels)  (pixels) 


20 

7.0 

8.0 

0.0 

0.2 

10 

8.0 

10.0 

0.0 

0.2 

1 

15.0 

30.0 

0.0 

0.8 

HK> 


These  are  results  at  the  end  of  the  5  sec.  simulations  and  each  represents  an  average  of  values  generated 
from  Monte  Carlo  simulations  using  the  three  different  values  of  a  Jo,-  The  extended  Kalman  filter  per¬ 
forms  well  as  the  slgnal-to-nolse  ratio  Is  lowered,  with  no  noticeable  change  between  the  ratios  20  and 
10,  and  exhibiting  only  a  slight  degradation  In  performance  at  S/N  »  1.  It  consistently  outperforms  the 
correlation  tracker,  especially  at  lower  s1<pial-to~noise  ratios.  In  fact,  the  correlation  algorithm 
repeatedly  exhibited  a  divergent  characteristic  In  the  more  difficult  tracking  environments.  As  shown 
in  Table  II,  decreasing  the  dispersion  of  the  Gaussian  Intensity  function  seriously  affected  the  correlator 
tracking,  whereas  the  filter  is  essentialy  unaffected. 


When  the  ratio  of  rms  target  motion  to  rms  atmospheric  jitter  was  decreased  from  5  to  1,  the  mean 
filter  error  remained  clo^e  to  zero,  but  the  lo  value  Increased  from  0.2  to  0.5  pixels  (for  the  case  of 
S/N  *  20,  o  =3,  Td  *  1).  Decreasing  It  further  to  ojo  =  0.2  resulted  in  a  return  to  a  level  of  about 
0.2  pixels.9  This  seems  to  imply  that  the  filter  is  abre  to  distinguish  between  true  and  apparent  target 
motion  more  easily  when  there  Is  a  significant  amplitude  difference  between  the  two  effects.  In  all  three 
corresponding  cases,  the  correlation  algorithm  had  a  lo  error  of  about  one  pixel,  so  that  even  in  the 
worst  case,  the  filter  surpassed  the  correlator's  tracking  ability  by  a  wide  margin. 

Increasing  the  target  correlation  time  from  1  to  5  sec.  had  no  discernible  effect  on  the  performance 
of  either  tracker.  However,  a  larger  variation  In  T,  might  well  demonstrate  that,  as  the  characteristics 
of  the  true  target  dynamics  and  atmospheric  jitter  become  more  distinctly  different,  the  fllteris  better 
able  to  separate  the  effects  and  enhance  tracking,  whereas  the  correlator  cannot  perform  this  function. 

4.  ROBUSTNESS  OF  FILTER 


The  marked  performance  Improvement  over  that  attained  by  a  correlation  algorithm  was  achieved  by  a  filter 
based  on  appropriate  modeling  assumptions,  parameter  values  and  tuning.  However,  this  raises  the  robust¬ 
ness  Issua  of  how  much  filter  performance  degrades  when  an  accurate  portrayal  of  the  tracking  problem 
differs  from  that  assumed  In  the  filter  design.  In  this  section,  first  the  sensitivity  of  the  estimation 
performance  to  large  variations  in  parameter  values  within  assured  model  forms  Is  depicted.  Then  sensitivity 
to  variations  in  the  basic  structure  of  the  appropriate  models  is  presented.  Finally,  insights  into 
required  design  modi  ft  cations  and  online  adaptation  capability  are  stxanaHzed  [10,11}. 

4.1  Sensitivity  to  Model  <‘ara«teter  Mismatches 

The  extended  Kalman  filter  was  based  upon  nominal  perimeter  values  of 

(1)  maximum  intensity  level,  I  «  10  units  (arbitrary  scale) 

(?)  glint  dispersion  (spread), »  3  pixels 

(3)  target  dynamics  rms  value,  oi  *  a  (rms  Jitter)  »  1  pixel 

(4)  target  dynamics  correlation  riwe.T.  *  1  see 

(5)  signal-to-noi  ie  ratio,  S/N  »  10  (i.e.,  r»S  background  noise  »  1  *  0.1  1W)() 

When  the  actual  environment  was  well  modeled  by  these  parameter  values,  the  standard  deviation  of  the 
errors  in  estiautir.^  target  states  x.  and  y.  urn  each  0.56  pixel,  and  0.55  pixel  In  estimating  atmospheric 
Jitter  states  x  and  y  .  Since  all  errors  Tn  these  studies  were  essentially  zero-mean,  these  are  also  ros 
error  values.  * 

Table  lit  summarizes  the  effect  of  varying  the  true  value  of  these  parameters  in  the  Hint*  Carlo 
siar.dation  (20  runs  per  evaluation)  without  altering  the  values  in  the  filter.  The  resulting  actual  error 


TABLE  III 

ACTUAL  ESTIMATION  ERROR  AVERAGE  STANDARD 
DEVIATIONS  WITH  MODEL  PARAMETER  MISMATCHES 


Tn* 

Target  v 
(In  pixels) 

Jitter  <? 

Parameter 

.US_£LwM 

/  1 

3.7 

1.7 

'-•It 

.56 

.55 

.70 

1-5 

/ 1 

3.0 

1.5 

°0  '1  J* 

.56 

.55 

9  U 

.65 

.63 

f.3 

.38 

Mr 

M 

2.6 

.55 

1.6 

/? 

.66 

.65 

VI  ’* 

.56 

.55 

d  Is 

.43 

.46 

/ 1 

3.8 

8.0 

S/M  *  5  TO* 

.56 

.55 

l  M 

.55 

.57 

*  *  design  conditions,  valuts  assumed  by  filter 

standard  deviations  (averaged  over  the  5-sec  simulations)  In  estimating  target  position  and  atmospheric 
jitter  are  presentee  as  each  parameter  is  separately  varied  from  the  desiyi  conditions. 

For  the  first  two  robustness  studies,  the  real  world  I  and  o  descriptors  of  the  target  intensity 
profile  were  allowed  to  vary.  When  I  is  1,  the  real  FLTfr images9are  of  a  target  much  more  highly  masked 
by  backgrouid  noise  than  the  filter  assumes  (S/N  is  actually  I  rather  than  10),  with  resulting  severe 
degradation.  Such  a  low  S/N  in  fact  produces  poor  performance  even  without  a  parameter  mismatch  [8,9],  so 
this  result  is  expected.  However,  when  I  is  increased  to  20,  the  filter  again  has  difficulty,  apparently 
due  to  searching  for  the  wrong  shape  of  intensity  profile  due  to  mismodeled  I  .  When  the  true  target 
intensity  is  less  spread  out  than  the  filter  assumes  (o  =  1  pixel),  the  real^fmage  can  move  substantially 
within  the  large  envelope  being  sought  by  the  filter,  wrth  significant  deterioration  in  performance.  On 
the  other  hand,  when  the  real  image  is  larger  than  assumed,  estimation  accuracy  Is  acceptable:  the  Intensity 
peak  can  be  located  rather  precisely. 

The  filter  assumed  a  rather  benign  target  trajectory,  as  is  appropriate  for  distant  targets.  In  the 
next  set  of  robustness  studies,  the  rms  values  and  correlation  times  of  the  first  order  Gauss-Markov  pro¬ 
cess  were  allowed  to  vary  from  design  val'ies.  When  o^/u  is  set  to  0.2,  the  real  target  motion  amplitudes 
are  less  than  assumed  by  the  filter,  and  the  estimation  accuracy  is  acceptable  (the  filter  overestimates 
its  own  errors,  and  smaller  errors  could  be  achieved  with  correctly  assumed  oh)-  Unacceptably  large  errors 
are  produced  when  the  filter  underestimates  the  dynamics  anplltudes  (O(j/oa  *  5).  Correlation  time  vari¬ 
ations  by  a  factor  of  five  have  insigiificant  effect,  yielding  somewhat  greater  errors  when  the  true  target 
exhibits  higher  frequency  motion  than  assumed,  and  smaller  errors  when  the  trajectories  are  more  time- 
correlated  than  anticipated. 


Finally,  mismatches  in  the  background  noise  model  were  investigated.  As  in  the  case  of  varying 
l_jx.  changing  the  rms  value  for  background  noise  affects  S/N,  but  the  trends  in  performance  differ  for 
tne  two  cases.  Again,  low  S/N  results  in  poor  estimation  (even  without  mismatches).  However,  when  S/N 
is  high  and  Ima.  is  properly  modeled,  the  assumed  target  intensify  shape  is  correct  while  the  corrupting 
noise  is  actually  less  than  assumed,  and  the  fi'ter  tracks  the  target  well. 


4. 2  Sensitivity  to  Variations  in  Model  Structure 

The  filter  under  investigation  w,v  desired  to  track  distant  point  targets  with  lew  angular  rate  and 
acceleration  capabilities,  against  a  temporally  and  spatially  uncorrelated  background.  Now  if  is  desired 
to  establish  the  robustness  of  the  *i  Iter  to  the  structure  of  the  assumed  models,  considering  shorter 
range  targets  such  that  (l)  shape  effects  becoro  significant  and  (2)  target  dynamics  can  become  more 
violent,  and  also  considering  scenario;  In  which  background  noise  can  be  highly  correlated,  both  spatially 
and/or  temporally.  One  does  not  necessarily  expect  accurate  tracking  by  the  filter  under  thase  very 
different  conditions;  rather.,  the  goal  is  a  prioritized  list  of  the  characteristics  of  the  new  scenario 
that  cause  the  severest  performance  degradation. 

In  fact,  nas  errors  double  when  tn*  equal  Intensity  contours  are  ellipses  with  major  axis  dimension 
ten  times  that  of  the  minor  axis,  instead  of  circular  as  assumed:  such  an  intensity  pattern  would  be 
representative  of  sons  air-to-air  missile  targets,  Unaodelcd  or  atsaodclcd  target  motion  (especially 
involving  persistent  nonzero  mean  velocities  and  accelerations,  or  varying  degrees  of  maneuvering  in  a 
single  scenario)  also  have  a  wry  serious  effect  on  tracking  ability.  Extensive  performance  analyses 
showed  that  loss  of  track  occurs  consistently  whenever  unmodeled  motion  allows  the  target  to  wove  out  of 
the  field  of  view  in  one  or  two  swple  periods,  even  with  a  closed-loop  system  assumed  able  to  null  out 
any  estimated  errors  in  a  single  xa*s>le  period:  the  lack  of  a  viable  target  velocity  estimate  is  a 
critical  shortcoming  in  this  environment-  Wswdeled  background  noise,  misrepresented  In  spatial  and 
temporal  correlation  as  well  at  in  n»s  value,  does  not  have  significant  effect  at  moderate  expected 
values  of  S/N.  For  exaaple,  at  the  nominal  S/N  of  10,  introducing  exponential  spatial  correlation 
tyrmetri tally  in  all  directions  with  a  correlation  listwee  of  1.5  pixels  increases  the  rws  tracking 
error  by  .03  pixels,  while  introducing  both  this  spatial  correlation  and  a  long  tenoral  correlation  (such 
that  the  correlation  coefficient  for  a  given  pixel  from  one  staple  Ur*  to  the  next  Is  0.95)  increases  the 
raw  error  by  only  0.1  pixel.  Ifcten  S/N  is  reduced  to  1,  such  eomtalic-ns  cause  consistent  loss  of  track, 
but,  as  alreadv  teen,  very  tow  S/N  degrades  performance  greatly  even  in  U*  absence  of  mi  two  deling  in  the 
filter. 

*. 3  Insists  from  Robustness  Analysis 

Thus,  to  generate  a  filter  capable  of  tracking  air-to-air  missiles  in  background  clutter,  one  mutt 
include  the  following  aspects  in  the  design: 

1)  ability  to  estimate  site,  shape,  and  orientation  of  the  target  image; 

2)  online  estimation  of  target  intensity  height  ?„  since  it  is  incertain.  varying,  and  important 
to  filter  residual  generation  and  tracking  performance; 

3)  ability  to  predict  future  position  by  mintaininq  at  least  a  velocity  estimate  in  addition-  to  a 
position  estimate  (acceleration  estimates  may  well  be  required  also); 

4)  adaptation  to  maneuvers  (detecting  a  maneuver  not  predicted  by  the  filter,  via  residual  nor.ilortng, 
and  responding  appropriately  as  through  gain  changing}. 

itortover,  spatial  and  teaporal  correlation  of  background  r.otse  need  not  be  twdeled  in  the  filter  for 
expected  S/N  values.  The  neat  sections  establish  the  design  and  capabilities  of  a  filter  with  these 
features. 


S.  ELLIPTICAL  EQUAL  INTENSIFY  CONTOURS 
5.1  Basic  Npdel 

Analysis  of  neat  fUR  data  indicated  that  air-to-air  missile  images  could  be  well  approximated  by  4 
bivariate  Gaussian  Intensity  pattern,  but  with  elliptical,  rather  than  circular,  equal  intensity  contours. 


The  ratio  of  major  and  minor  axis  magnitudes,  (aq;/og2)  as  in  Fig*  1»  typically  ranges  from  1  to  about  10, 
depending  on  the  aspect  angle  of  the  missile.  Moreover,  the  magnitude  of  og2  varies  with  range  from 
target  to  the  tracker. 

For  development  of  the  filter,  it  was  assumed  that  the  semimajor  axis  of  the  ellipse  could  be  aligned 
with  the  missile  velocity  vector  (Ignoring  small  angle  of  attack  and  sideslip  angle).  Since  target 
velocity  in  the  FlIR  image  plane  Is  to  be  estimated,  vx  and  vy  are  used  to  establish  the  angular  orienta¬ 
tion  of  the  ellipse  major  axis.  Letting  Ax;  and  ^  be  measured  from  UpeakM ^peak^)  alon9  the 
pal  axes,  the  target  Intensity  model  (1)  becomes 

Wt^l^V1*  =  ‘»xexp{-  ?£{AV0g1>2  +  (Ax2/og2)2]}  °9) 

where  !„„,  og;,  and  o  2  are  treated  as  uncertain  (slowly  changing)  parameters  to  be  estimated  simultane¬ 
ously  with  the  states.  Various  methods  of  estimating  these  uncertain  parameters  where  considered 
[7,17-24],  Including  treating  their,  as  additional  states,  multiple  model  Bayesian  estimation  for  discretized 
parameters,  full-scale  and  approximated  maximum  likelihood  methods,  and  least-squares  techniques. 

5.2  Estimation  of  ogl  and  og2 


(20) 


Very  good  performance  and  small  computational  burden  were  achieved  by  generating  the  estimates 
Og^(t^)  and  ag2 ( )  that  minimized  the  quadratic  cost 

C[Z.( t1 ) ,  a]  *  Uitj)  -  hjx^'),  a]}T  (zj^)  -  h[x(t,'),  t^;  a]} 

as  a  function  of  a,  where  Z(tj)  is  the  measurement  history  iz(t, ),...,  z(t^ ) } ,  a^  is  the  vector  of 
uncertain  parameters  to  be  estimated,  and  h  is  as  defined  in  (6)  but  usTng  Itarget  as  defined  in  (19). 
This  can  be  viewed  as  a  least-squares  approximation  to  a.,  estimate  based  on  maximizing  the  likelihood 
function  fn  f[x(t.),  z[t;)lz(t.  ,),  a]  with  respect  to  both  x(t,)  and  a.  Usually,  one  might  seek  a 
weighted  sum  of  quadratics  of  tRe  most  recent  N  residuals  Instead  of  a  cost  involving  only  the  single 
current  residual  as  in  (20)  for  better  performance,  but  the  64-dimensional  measurement  in  this  problem 
provides  significant  spatial  averaging  to  supplant  temporal  averaging.  In  fact,  use  of  (20)  yields  very 
acceptable  results.  A  recursive  gradient  solution  to  minimizing  (20)  was  inplemented  as 


i(V  ■  a(Vi)  *  k(3cT/^)|x(t1'),a(ti.1)  (21) 

i.e. ,  a  single  gradient  step  Is  taken  each  sample  perloa,  with  k  a  scalar  step-size  control  value 
(established  empirically  as  0.001)  and  with  the  partial  derivative  evaluated  using  the  currently  available 
state  and  parameter  estimates  x[t,‘;  a^(t^i)]  and  a(t^.i),  respectively.  Many  terms  required  in  the  evalu¬ 
ation  of  this  partial  derivative  are  already  available  from  the  filter  gain  computations.  Fig.  5  Is 
indicative  of  the  performance  of  this  simple  algorithm;  it  displays  the  first  half  second  of  a  representa¬ 
tive  single  sample  time  history  of  estimates  of  on]  and  o^,  when  true  values  were  a  ,  =  5  pixels,  og2  1  1 
pixel,  while  the  filter  was  initialized  with  oq-|  *  oq2  *  J’pixels.  With  these  parameter  values,  the  mean 
and  standard  deviation  of  errors  in  estimating  ogi  afld  og2  all  assumed  average  values  (time  averaged  over 
the  4.8  sec  following  tne  0.2  sec  transient  period  obvious  in  Fig.  5)  of  approximately  0.15  pixel.  These 
results  were  obtained  for  constant  true  oqi  and  oq2*  for  a  trajectory  at  constant  radius  from  the  tracker; 
similarly  good  results  were  obtained  when^the  aspect  angle  of  the  missile  varied  so  that  “true”  og;  and 
og2  In  fact  varied. 


5.3  Estimation  of  I 


Although  Uj,  could  have  been  treated  In  like  manner,  what  was  eventually  implemented  was  a  more  direct 
use  of  measurement  information  that  provided  excellent  performance  with  very  small  computational  burden. 
Simply  selecting  the  highest  observed  pixel  intensity  at  time  ti  as  an  estimate  of  InaX  was  explored,  but 
it  sufficed  due  to  both  background  noise  corruption  effects  and  a  bias  even  in  the  absence  of  noise.  If 
there  were  io  noise,  the  maximum  pixel  intensity  is  the  average  intensity  over  the  pixel  closest  to  the 
centroid  ot  the  Gaussian  intensity  profile,  which  is  less  than  the  value  I^x  at  the  centroid.  This  bias 
is  a  function  of  the  centroid  location,  oq;,  and  0*2  •  and  can  be  substantial  for  small  ogi  and  og2- 
Assuming  the  centroid  is  located  at  the  center  of  tne  pixel,  on  the  average,  a  bias  function  b(ogi,  Oq?) 
can  be  developed;  for  this  feasibility  study,  a  second-order  polynomial  fit  approximation  was  usfla.  Thus 
an  estimate  based  on  a  single  time  sample  of  measurement  data  is 

fj(tj)  «  n^x[2k(t1 ) ;  1  <  k  <_  64]  -  bto^ft^),  ^(^..j)]  (22) 

and  this  is  time  averaged  with  previous  estimates  to  reduce  the  valance  due  to  background  noise: 


f(t1)  ■  C  f(ti_1)  +  [i  -  c]i-| (t^ ) 


(23) 


Performance  capabilities  are  indicated  by  a  set  of  simulations  in  which  the  missile  was  flown  on  an  iner- 
tially  straight  trajectory  such  that  at  t  ■  3  sec,  it  was  at  a  minimum  range  of  10  km  from  the  tracker, 
with  v  »  500  m/s  and  vu  ■  300  m/s  as  seen  in  the  FlIR  image  plane  (each  pixel  is  a  20  urad  square).  At 
that  minimum  distance,  the  "true"  values  were  agi  ■  3  pixels,  og2  ■  1  pixel,  and  for  the  *rfto1e  simulation 
"true"  Iwx  »  25.  Selecting  the  highest  pixel  intensity  yielded  an  I^x  with  mean  of  24.47  and  standard 
deviation  of  1.12;  the  latter  statistic  is  comparable  to  the  rms  backgroind  noise  Just  time 

averaging  via  (23)  with  c  ■  0.8  but  with  no  bias  correction  yielded  mean  and  o  of  24.53  and  0.33,  res¬ 
pectively.  Using  bias  correction  only  via  (22)  yielded  24.99  and  1.16  respectively,  while  using  (22) 
and  (23)  together  resulted  in  a  mean  of  24.99  and  0  of  0.30.  These  results  were  achieved  with  simultaneous 
estimation  of  and  og2,  with  precision  comparable  to  that  discussed  earlier. 


6.  TARGET  MOTION  COMPENSATION 

6.1  Simple  Six-State  Filter 


As  indicated  previously,  at  least  the  target's  velocity  must  be  estimated  in  addition  to  its  position, 
to  predict  Its  position  one  sample  period  ahead  for  appropriate  tracking  controller  command  generation.  A 
velocity  estimate  was  also  required  In  the  previous  section  to  orient  the  elliptical  Intensity  contours. 

The  slnplest  possible  dynamics  model  for  FLIR  plane  motion  that  Includes  velocity  states  would  be 

x(t)  -  v(t)  (24a) 

vjt)  =  w(t)  (24b) 

with  w  white  Gaussian  noise  with  autocorrelation  E{w(t)jJ(t  +  t)1  =  Q(t)S(t),  and  Q[t)  chosen  (adaptively) 
to  provide  an  adequate  representation  of  target  naneuverabillty.  Such  a  model  only  Increases  the  filter 
state  dimension  from  four  to  six,  and  the  dynamics  model  remains  linear.  Though  computationally  simple, 
the  filter  based  on  such  a  model  does  not  yield  very  good  performance  for  this  application.  For  Instance, 
Fig.  6  presents  the  mean  error*  1  standard  deviation  In  estimating  horizontal  position  for  a  20-run  Monte 
Carlo  simulation  of  the  InertlaHy  straight  trajectory  described  previously,  with  true  1^  =  25, 
ogi  =  5  pixels,  og2  *  1  Pixel,  and  background  noise  rms  value  of  2.  In  fact,  this  plot  corresponds  to 
a  case  of  estimating  £  online  after  an  acquisition  phase,  as  discussed  subsequently,  but  Is  representative 
of  results  In  which  Q  Is  artlflcally  tuned  offline  at  a  constant  value  (after  acquisition)  for  good  perfor¬ 
mance  on  this  type  of  trajectory.  The  projection  of  the  InertlaHy  constant  velocity  into  the  FLIR  image 
plane  changes  with  time  as  the  tracker  rotates  to  maintain  the  target  In  the  center  of  Its  field  of  view: 
an  unmodeled  noninertial  acceleration  is  thus  created,  manifesting  Itself  In  the  positive  slope  of  the 
mean  error  depicted  In  the  figure.  Moreover,  velocity  estimates  diverge  significantly  over  the  last 
second,  yielding  eventual  loss  of  track  If  the  simulations  were  over  longer  periods.  That  the  trend  In 
the  figure  Is  due  to  noninertial  acceleration  was  corroborated  by  tracking  a  missile  at  a  constant  range, 
yielding  essentially  zero-mean  error  and  o  ^0.2  pixels  for  all  time  with  no  divergence.  The  tracker 
control  signals  which  cause  the  noninertial  acceleration  could  be  made  available  for  filter  compensation 
and  lnproved  performance.  However,  less,  benign  target  trajectories  further  justified  the  need  for  a 
better  dynamics  model. 

6.2  Preferable  Eight-State  Filter 

As  a  result,  an  eight-state  filter  was  generated  that  estimated  acceleration  In  the  FLIR  plane  as  well, 
as 


x(t)  »  v(t) 
v(t)  -  a(t) 
a(t)  «  w(t). 


(25a) 

(25b) 

(25c) 


Alternative  models  of  acceleration,  such  as  an  exponentially  time-correlated  process  model  (which  intro¬ 
duces  an  additional  uncertain  parameter,  the  correlation  time)  and  a  constant  turn-rate  model  [25,26]  of 


a(t)  •  -  iA(t)  +  w(t) 

W  r  |v(t)  X  a(t)|/|v(t)| 


2 


(26a) 

(26b) 


ted.  To  provide  acquisition  capability,  the  filter 
entries  corresponding  to  target  states:  25  pixel  » 


(which  yields  nonlinear  dynamics)  were  considered,  but  (25)  was  explored  most  fully  because  of  Its 
simplicity  and  performance  potential.  In  a  duplicate  tracking  environment  as  used  to  generate  Fig.  6, 
the  eight-state  filter  produced  much  Improved  tracking  performance  as  Indicated  In  Fig.  7  (without  being 
provided  control  signals  for  compensation). 

6.3  Acquisition 

The  preceding  results  reflect  a  filter  provided  with  perfect  initial  state  knowledge,  so  recovery 
from  realistic  Initial  condition  errors  was  Investigated. 

Initial  covariance  L.  was  assumed, diagonal  with  large 

2000  p1xels2/sec%  and  100  plxelsvsec*,  respectively.  Further,  the  2  values  were  maintained  at  a  high 
value  (600  plxelsvsec5)  for  0.5  sec  after  Initialization.  With  8  ^sec  true  Initial  velocity  error  In 
each  direction,  performance  Is  as  depicted  In  Fig.  8:  acquisition  is  accomplished  in  about  half  a  second, 
followed  by  tracking  capability  as  portrayed  previously. 

6.*  Adaptive  Tunlrg 

Adaptive  estimation  of  the  dpi  ami  cs  noise  covariance  matrix  was  Investigated  to  allow  self-tuning  to 
an  uncertain  and  dynamically  changing  environment.  The  ability  to  adjust  filter  bandwidth  online  was 
considered  necessary  because  an  air-to-air  missile  can  exhibit  a  wide  range  of  dynamic  characteristics. 
Various  methods  of  covariance  estimation  were  considered.  Including  maximum  likelihood,  multiple  model 
Bayesian  adaptation,  and  correlation  and  covariance  matching  techniques  [7,17-24,27-29].  Oue  to  perfor¬ 
mance  and  computational  considerations,  the  approximation  to  maximum  likelihood  estimation  first  described 
In  [27]  was  employed:  If  the  filter  covariance  propagation  and  update  aquations  are  as  given  in  Eqs.  (9) 
and  (13),  than  an  estimate  of  J^ttj)  is  provided  by 


where 


VM  ■  (vw  j.1£(Wte(tj>fcT<tj)  ♦  p(tj4)  -i^tj.,)#1] 


(27) 


s  x(tj)  -  x(tj). 


(18) 


(29) 


Equation  (27)  can  also  be  derived  heuristlcally  by  noting  that 

E{«x(tj)fixT(tj) }  »  K(tj)H(tj)P(tj) 

and  substituting  this  and  (9)  Into  (13),  and  approximating  the  ensemble  average  In  (29)  by  a  tenporal 
average  over  the  most  recent  N  sanple  periods.  To  reduce  storage  requirements,  a  fading  memory  approxima¬ 
tion  to  the  finite  memory  result  (27)  was  Implemented  as 

9d(tf)  -  k  Q^.])  +  [1  *  k4n(V  (30) 

where  (L^t.,)  Is  a  single  term  from  the  summation  In  (27).  The  parameter  k  was  empirically  set  to  0.8  as 
a  tradeoffr'ke (0.8,  1.0)  provides  smoother  estimates  while  ke(0,  0.8)  provides  more  rapid  response  to  true 
changes  In  dynamics.  On  the  inertlally  straight  trajectory  and  similar  benign  trajectories,  the  filter 
with  adaptive  (L  estimation  performed  approximately  the  same  as  a  filter  with  artifical  offline  tuning  to 
a  given  trajectory. 

However,  this  adaptation  was  Insufficient  for  abrupt  significant  maneuvers.  In  one  set  of  simulations, 
the  missile  initiated  a  20  g  pullup  at  t  »  4  sec  of  the  previously  defined  trajectory.  With  a  field  of 
view  of  160  yrad  by  160  urad,  such  an  acceleration  can  move  the  image  3  to  4  pixels  laterally  in  two  sample 
periods.  Moreover,  virtually  all  of  the  position  Information  for  the  lateral  direction  (i.e.,  region  of 
high  Intensity  profile  gradients)  is  within  1  or  2  times  ogj  (a  1  pixel  here).  The  CL,  appropriately  low 
for  the  benign  portion  of  the  trajectory,  did  not  respond  fast  enough  to  preclude  loss  of  track.  Although 
the  filter  residuals  were  of  large  magnitude  when  the  maneuver  was  conducted,  the  gains  were  too  low  to 
weight  them  enough;  by  the  time  the  gains  grew,  there  was  little  overlap  of  the  actual  and  predicted 
intensity  profiles,  so  the  residuals  were  insignificant. 

6.5  Significant  Maneuver  Indication  and  Appropriate  Filter  Response 

To  maintain  track  required  several  matrix  coefficients  (filter  gain  and/or  covariance  matrix  P,  as  well 
as  Qj)  and  some  state  estimates  to  change  significantly  in  a  single  sample  period:  for  instance,  True 
elevation  acceleration  changes  from  0  to  2400  plxels/sec?  when  the  lateral  maneuver  begins.  One  possible 
variable  for  reliable  and  rapid  indication  and  quantification  of  a  maneuver  is  the  scalar 

«S(t1)T«x(ti)  -  tr[ax(ti)«j<T(t1)]  (31) 


where 


txfV  •  P(tJ)HT(ti)R'1(t1)U(ti)  -  hCx^ht,]}  (32) 

and  the  filter  gain  shown  in  (32)  is  used  because  of  dimensionality  considerations,  as  discussed  In 
Section  2,  Since  inappropriately  small  P(tt)  values  is  part  of  the  maneuver  compensation  problem,  and 
since  R(t, )  Is  assumed  diagonal,  an  alternative  indicator  Is  the  magnitude  or  separate  components  of  the 
aval  laFle  vector 

-  H^t,)^)  -  hj&tp^])  (33) 

where  4x(t, )  •  P(t|)4y(t,)/R(t,).  At  the  sanple  Instant  after  initiation  of  the  20  g  maneuver,  at 
t  *  4.03  sic.,  Toth  (31)  and  the  elevation  position  conponent  of  (33)  increased  by  an  order  of  magnitude, 
then  returned  to  their  normal  magnitudes  on  the  following  sample  Instant,  t  ■  4.07  sec  (because  the  true 
and  projected  Images  had  already  diverged  far  enough  laterally,  about  5  pixels,  to  generate  essentially 
no  overlap  and  thus  very  small  residuals).  Therefore,  a  maneuver  was  detected  by  one  of  these  Indicators 
surpassing  a  magnitude  threshold,  with  the  components  of  4y(t,)  providing  direction  Information  about  the 
unwodeled  maneuver  as  well. 

Upon  maneuver  detection  the  appropriate  response  Is  (1)  to  Increase  the  filter  gain  directly,  rather 
than  to  allow  slow  Increase  via  Qj  estimation,  (2)  to  incorporate  undated  state  estimates  for  reprocessing 
the  previous  sample  period's  state  estimate  tin*  propagation  and  for  future  propagation,  and  (3)  to  expand 
the  field  of  view  (e.g. ,  treating  averaged  intensities  of  2-by-2  arrays  of  pixels,  Instead  of  Individual 
pixels,  as  filter  measurements).  Increased  gain  was  Introduced  by  reverting  to  an  acquisition  cycle  of 
very  high  relnltlallied  P  and  high  (L  for  0.5  sec  thereafter.  To  reprocess  the  state  propagation,  a  com¬ 
fit  function  was  established  for  waodeled  position  displacements  as  a  function  of  <jgi.  o0j,  and  the  first 
two  components  of  «jf(tt>;  these  dlspticewnts  were  assumed  to  be  the  result  of  an  untodeled  acceleration 
acting  over  tpe  previous  sample  period.  If  a  maneuver  Is  detected  at  tine  t, ,  the  acceleration  state 
estimates  ofj(t.*Jare  modified  by  these  calculated  Increments,  and  x(tT)  recomputed.  Incorporating  these 
changes  postponed  divergence  for  about  15  senile  periods.  A  furthered  hoc  adaptation  used  the  six  target 
state  components  of  ti(tt)  -  x(t7)1  throughout  the  acquisition  period  as  an  Indicator  of  immodeled  target 
dynamics,  which  was  tfcn’adde?  during  the  next  sample  period  to  the  standard  propagation  equations.  This 
allowed  nondlvergent  results  for  the  entire  simulation  period.  Expansion  of  the  field  of  view  was  not 
evaluated,  but  It  Is  a  useful  means  of  maintaining  track  at  the  expense  of  some  resolution. 

Although  these  modifications  allowed  track  to  be  maintained,  the  required  artificial  Introduction  of 
nonitro  acceleration  estimates  points  cut  an  important  shortcoming  of  aero-mean  acceleration  models  as 
In  (25)  or  exponentially  time-correlated  process  descriptions.  The  constant  turn-rate  model  (26)  has 
been  shown  to  be  more  representative  of  many  airborne  targets  at  close  range  [25,26],  so  current  efforts 
are  considering  its  incorporation  Into  the  filter  structure. 

7.  conclusion 

A  sliple  four-state  esttnfrd  Kalman  filter  has  been  developed  to  track  a  distant  point-source  target 
with  benign  dynamics,  using  outputs  from  a  forward-looking  Infrared  (FlIR)  sensor  as  measurements.  As 
shewn  In  part  by  Figs.  3  and  4,  it  consistently  outperforms  the  currently  used  correlation  tracker,  with 
the  most  substantial  improvement  being  gained  In  scenarios  with  law  signal-to-noist  ratio  and/or  small 


target  spot  size  relative  to  detector  (pixel)  size.  The  filter  exploits  knowledge  unused  by  the  correlation 
tracker  -  size,  shape  and  motion  characteristics  of  the  target,  atmospheric  jitter  spectral  description, 
and  background  and  sensor  noise  characteristics  -  to  yield  the  enhanced  performance.  However,  robustness 
studies  have  revealed  a  serious  degradation  in  tracking  performance  when  the  filter-assumed  model  does  not 
represent  the  actual  tracking  environment  well.  Indicating  appropriate  design  modifications  to  generate 
a  filter  capable  of  tracking  less  benign  targets  In  a  realistic  and  more  uncertain  environment. 

Figs.  7  and  8  are  Indicative  of  performance  capabilities  of  the  resulting  eight-state  adaptive  tracker 
for  realistic  but  not  overly  harsh  trajectories  of  a  missile  viewed  at  short  range.  Good  tracking  per¬ 
formance  is  achieved  on  the  basis  of  estimating  target  velocity  and  acceleration  as  well  as  position, 
assuming  elliptical  intensity  profile  contours  with  major  axis  aligned  with  the  estimated  velocity  vector, 
and  adaptively  estimating  Irax,  o^,  aqo,  and  CL.  To  address  more  dynamic  environments,  detection  and 
appropriate  response  to  maneuver  initiation  have  been  investigated  with  some  success,  but  rather  tenuous 
ad  hoc  modifications  were  required  due  to  assumed  zero-mean  acceleration  models.  Current  research  is 
concentrated  on  better  target  acceleration  models  and  on  different  filter  forms  to  exploit  these  models, 
such  as  the  multiple  model  adaptive  filtering  algorithm  that  probabilistically  weights  the  outputs  of  a 
bank  of  filters,  each  based  on  one  of  the  alternative  models.  Also,  work  Is  being  accomplished  on  a 
generalization  in  which  target  Intensity  patterns  are  uncertain  or  irregular  enough  to  discount  a  bivariate 
Gaussian  model  and  Instead  to  preprocess  the  measurements  to  provide  the  entire  h^  function  adaptively  to 
the  filter,  as  through  spatial  modal  decomposition. 
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SUMMARY 

In  order  to  increase  the  accuracy  levels  of  the  inertial  navigation  systems  (INS)  ad¬ 
ditional  external  measurements  were  used,  such  as  those  provided  by  radar  equipment, 

TACAN  facilities,  MLS  and  so  on.  The  combination  of  the  different  data  is  often  carried 
out  by  using  the  methods  of  Kalman  filtering,  which  need  sufficiently  exact  error  models 
especially  for  the  INS.  As  far  as  strapdown  systems  (SDS)  are  concerned  the  development 
of  mathematical  descriptions  of  the  error  behaviour  leads  to  high-order  models,  because 
the  dynamic  environment  of  the  SDS  has  to  be  taken  into  account.  However,  for  real-time 
navigational  computations  it  is  necessary  to  provide  low-order  error  models.  In  the  paper 
account  is  given  of  how  adaptive  low  order  error  models  were  developed  and  adaptive  fil¬ 
tering  applied.  The  results  were  checked  by  using  measured  and  simulated  SDS  data. 


LIST  OF  SYMBOLS 

SUBSCRIPTS/SUPERSUBSCRIPTS 

b  body-fixed  axes 

n  navigational  axer 

i  inertial  reference  coordinate  frame 

e  earth-fixed  axes 

x,  y,  z  long,  cross,  vertical  directions 

N,  E,  D  north,  east,  down  directions 

ac  accelerometer 

gy  gyro 

symbol  vector  expression 

symbol  matrix 

symbol  *  true  value  +  error 

NAVIGATIONAL  SYMBOLS 


M 

a 

v 

* 

X 

h 


X 

V 

t. 

t 


transformation  matrix 

bank,  pitch  and  asimuth  angles 

angular  rates 

accelerations 

velocities 

geographical  latitude 

geographical  longitude 

height 

earth  radii 

gravity 

angular  or  velocity  increments 

earth  angular  rate 

track  angle 

flight  path  angle 

sampling  period 

time 


SENSOR  ERROR  COEFFICIENTS 

«*»  axes  misalignment 

D*  fixed  drifts 

D*  mass  unbalance  drifts 

Dn  anisoelasticity  drifts 

D*  fixed  scale  factor  errors 

D**1  quadratic  nonlinearity  scale  factor  errors 

D**  asymmetric  scale  factor  errors 

D*"  angular  acceleration  drifts 

0”*  anisoinertia  drifts 

««  axes  misalignment 

B*  bias 

B*  fixed  scale  factor  error 

B*  asymmetric  scale  factor  error 

B"  quadratic  nonlinearity  scale  faotor  error 

>**  cubic  nonlinearity  scale  factor  error 

B1*  cross  coupling 


H,y  angular  momentum  of  the  rotor 

A9V  rotor  transverse  moment  of  inertia 

C9y  rotor  polar  moment  of  inertia 

5hbar  bias  of  the  barometric  altimeter 

SYMBOLS  FOR  THE  FILTER  ALGORITHMS 

t  state  vector 

estimated  state  vector 
9  estimation  error 

A  state  space  notation  of  the  error  model  (transitions  matrix) 

*  time-discrete  transition  matrix 

B»  E »  Q  covariance  matrices 

I  gain  matrix 

x’ ,  P'  predicted  state  vector  or  covariance  matrix 

H  measurement  matrix 

q,,,  qvv  square  root  of  the  diagonal  elements  of  the  matrix  Q 

MATHEMATICAL  OPERATIONS 


E 

S-a*  J 


A 

f  (x) 
i 


expectation  operator 
cross-product 
identity  matrix 
error  term 
difference  term 
function  of  x 
dx/dt 

transponse  of  a  vector  or  matrix 
mean  value 


1 .  INTRODUCTION 

Inertial  navigation  systems  (INS)  are  used  in  civil  and  military  aviation  and  are 
needed  in  all  cases  where  autonomous  navigation  is  essential  (space  flights,  missiles, 
navigation  on  land  and  at  sea) .  Up  to  now  most  of  the  INS  have  had  a  platform  mechanisa¬ 
tion!  gyros  and  accelerometers  are  mounted  on  a  platform  that  is  isolated  cardanically 
from  the  angular  motions  of  the  carrier.  A  double  integration  of  the  accelerometer  sig¬ 
nals  gives  the  ground  speed  and  the  position  of  the  carrier.  The  heading  and  attitude 
angles  are  contained  in  the  directions  of  the  gimbals. 

If  the  sensors  are  “strapped  down“  on  the  carriers  directly  no  gimbals  and  servo¬ 
motors  are  necessary.  This  type  of  INS  mechanisation  is  called  a  strapdown  system  (SDs) . 
The  accelerometer  signals  measured  in  a  body-fixed  coordinate  frame  are  transformed  to  a 
navigational  reference  frame  by  means  of  the  gyro  signals.  This  results  in  the  following 
advantages  in  comparison  with  the  so-called  platform  systems  /I / t 

-  simple  mechanical  construction 

-  the  provision  of  accelerations  and  angular  rates  in  body-fixed  axes 

-  easy  maintenance  due  to  the  modular  construction 

-  the  economical  provision  of  redundancy  by  means  of  skewed  sensitivity  axes. 

However,  against  these  advantages  must  be  weighed  certain  drawbacks! 

-  increased  demands  on  the  efficiency  of  the  navigation  computer 

-  extreme  demands  on  the  accuracy  of  the  sensors,  which  have  to  measure  the  full 
dynamic  environment  of  the  SDS.  In  contrast  to  the  platform  mechanisation,  the  gyros 
do  not  operate  as  sensors  for  zero  signals. 

The  accuracy  levels  of  the  inortlal  systems  are  frequently  insufficient  on  their  own, 
and  for  this  reason  additional  external  measurements  arc  used,  such  as  those  provided  by 
radar  equipment,  TACAN  facilities,  MLS  and  so  on.  The  strapdown  signals  are  exact  in  a 
short  time  period  but  the  navigation  errors  increase  with  the  time.  The  combination  of 
this  INS  error  behaviour  with  the  complementary  error  statistics  of  the  radio  signals  is 
often  carried  out  using  the  methods  of  optimal  filtering.  Here  it  Is  important  that  tho 
dynamic  error  behaviour  of  the  SDS  should  be  carefully  modelled,  i.e.  described  mathe¬ 
matically.  The  conformity  of  the  error  model  with  the  real  error  behaviour  of  the  SDS  is 
of  major  importance  for  the  accuracy  level  of  an  aided  SDS.  In  the  development  of  suitable 
error  models  it  is  necessary  to  find  a  compromise  between,  on  the  one  hand,  a  mathemati¬ 
cally  simple  description  of  the  error  behaviour  that  satisfies  real-time  computation  re¬ 
quirements  and  as  far  as  possible  creates  no  numerical  problems  and,  on  the  other  hand, 
a  determination  of  the  actual  error  behaviour,  Including  all  Important  sources  of  error, 
which  ia  as  accurate  as  possible.  In  comparison  with  the  platform  systems  it  Is  far  more 
difficult  to  find  such  a  compromise  in  the  case  of  SDs,  since  the  sensor  errors,  and 
therefore  the  system  errors  as  well,  depend  to  a  very  large  extent  on  the  dynamic  en¬ 
vironment  of  the  SDS.  According  to  the  structure  of  a  particular  flight  path  it  is  pos¬ 
sible  for  s  few  isolated  sensor  errors  or  a  large  number  of  different  sensor  errors  to 
have  an  effect  at  system  level.  The  structure  of  the  flight  path  can  change  several  timea 
during  a  flight,  e.g,  if  there  are  sectlona  with  only  a  few  manoeuvres  or  even  with  s 
straight  flight,  or  sections  with  extreme  manoeuvres  like  the  terrain  following  flights. 

An  error  model  that  conforms  to  the  real  SDS  error  behaviour  for  all  possible  cases  will 
thus  generally  lead  to  an  unacceptably  sophisticated  error  model  with  regard  to  the  real¬ 
time  computations  or  numerical  problems  Involved.  In  this  paper  the  attempt  is  made  to 


solve  the  problem  by  using  adaptive  error  models  and  adaptive  filtering,  thereby  avoiding 
some  of  the  drawbacks  of  SDS.  As  an  example  measured  and  simulated  strapdown  data  were 
used  to  demonstrate  and  to  check  the  methods  suggested  here. 


2.  TECHNIQUES  FOR  THE  DEVELOPMENT  AND  CHECKING  OF  SDS  ERROR  MODELS 


The  error  model  that  will  be  developed  can  only  be  checked  properly  if  adequate  crite¬ 
ria  are  available  to  check  the  conformity  between  the  model  and  real  world.  A  high-order 
error  model  often  serves  as  a  reference  for  discussing  a  simplified  low-order  error  mo¬ 
del  that  has  been  derived  from  it.  The  differences  between  the  two  error  models  can  then 
be  interpreted  statistically  in  terms  of  covariance  matrices. 

A  disadvantage  of  these  techniques  lies  in  the  assumption  of  a  linearized  high-order 
model  as  a  reference  model,  which  leads  to  problems  in  the  case  of  SDS.  It  is  scarcely 
possible  to  express  nonlinear  algorithm  errors  of  the  strapdown  navigation  equations, 
for  instance,  in  a  linear  state  space  notation.  In  these  conditions  it  proved  to  be  bet¬ 
ter  to  include  nonlinear  effects  in  a  SDS  simulation  and  to  interpret  the  results  of 
the  simulation  as  a  reference  system  corresponding  to  the  real  error  behaviour.  It  is 
then  possible  to  compare  the  sensor  and  system  errors  estimated  by  a  Kalman  filter  that 
is  to  operate  on  the  basis  of  simplified  error  models  with  the  known  errors  derived  from 
the  simulation  results.  It  is  also  possible  to  check  the  covariance  matrix  P  from  the 
self-diagnosis  of  the  Kalman  filter  by  determining  the  expectation  values  with  reference 
to  the  differences  from  the  simulated  and  estimated  errors. 


In  the  subsequent  chapters  both  methods  are  applied  in  a  complementary  manner.  The 
construction  of  the  simulation  of  a  SDS  is  described  in  detail  in  111 . 

2.1  GENERAL  ERROR  EQUATIONS  OF  STRAPDOWN  NAVIGATION  SYSTEMS 


The  derivation  of  the  strapdown  error  model  is  based  on  the  navigation  equations,  a 
knowledge  of  which  is  assumed  here  /3,  4/.  The  equations  can  be  given  by 
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The  individual  navigation  equations  ate  differentiated  with  respect  to  the  errors 
«Snb  *  «vn  ,  4*  ,  41  ,  4h  ,  4a^  ,  4W*.  (fl) 

and  the  following  relation  is  used 


•Sa,  *  ’  [£.*»]•  S„b  <♦' 

Then,  after  carrying  out  numerous  transformations,  approximations  and  rearranging*, 
one  obtains  the  linearised  system  error  model  in  a  state  space  notation.  The  vertical 
speed  and  the  vertical  position  must  be  aided  by  the  signals  of  a  barometric  altimeter 
/S /.  The  causes  of  eensor  errors  that  can  be  identified  are  listed  in  Tab.  1  below.  The 
values  given  here  are  based  on  the  data  from  the  TBLBDYNE  SDG5  gyro  and  the  SYSTRON  DOW¬ 
NER  4833A-1PX  accelerometer,  and  are  interpreted  as  incapable  of  further  compensation. 


The  sensor  errors  listed  in  Tab.  1  cause  misalignments  or  velocity  errors  at  the 
system  error  level  corresponding  to  the  integrals 


4w^ (t) 


t 

4v„  *  /  £  .  (t)  •  4a.  (t)  •  dt  (11) 

— n  ‘  =nb  — b 

o 

Some  of  the  sensor  errors  can  be  ignored  as  regards  their  effects  at  system  level  in 
the  case  of  realistic  flight  paths  in  which  high  angular  rates  or  high  accelerations  only 
occur  in  periods  that  are  substantially  shorter  than  the  total  flying  time,  ha  regards  the 
unbalance  drift  of  the  gyros 


a  misalignment  corresponding  to  the  fixed  drift  of  0.01  °/h  only  occurs  if  the  horisontal 
accelerations  a,,  a,  are  O.S  g  for  the  whole  duration  of  the  flight.  In  realistic  flight 
paths  this  is  only  possible  by  means  of  constant  manoeuvring.  However,  this  requires  high 
angul'.r  rates  which,  in  combination  with  the  misalignment  of  the  sensitivity  axes  of  the 


contribute  so  much  to  the  misalignment  at  system  level  that  the  unbalance  drifts  can  be 
ignored.  Thus  sm  angular  rate  of  only  1  °/sec  produces,  with  a  misalignment  of  S  arcs,  a 
drift  of  approx.  0.1  °/h  according  tv>  Kg. (13).  The  drift  caused  by  the  vertical  component 
is  a  constant  o.o?  °/h  in  flights  thbt  contain  fewer  manoeuvres  and  can  be  considered  as 


a  fixed  drlf*.  of  the  gyros.  A  corresponding  conaideration  holds  good  for  the  anisoelastt- 
citv  drifts,  ilils  requires  constant  horiicmtal  accelerations  of  0.3  g  to  produce  a  drift 
of  0.01  °A,  and  cav,  therefore  also  be  ignored  at  system  level. 


The  nrror  modal  for  the  accelerometers  can  also  be  simplified  considerably  for  rea¬ 
listic  flight  path  on  the  basis  of  Eq.tll).  in  F»g.  1  the  individual  errors  are  repre¬ 
sented  a*  functions  of  the  maximum  accelerations  that  occur.  For  accelerations  up  to  a 
maxlfeius.  of  1  g  it  is  Sufficient  to  model  the  bias  and  the  axes  misalignment  of  the  »c- 
celerc&uters,  since  even  with  a  constant  acceleration  of  1  g  the  errors  are  still  ten 
times  smaller. 


Similar  considerations  for  all  important  sensor  errors  leave  11  gyro  and  6  accelero¬ 
meter  errors.,  i.o.  a  total  of  23  eenaor  errors.  The  error  model  shown  In  Fig.  2  contains 
a  total  of  33  state  variables  which  can  be  subdivided  as  follows; 


3  angular  errors 
3  velocity  errors 
3  position  errors 

1  bias  of  the  barometric  altimeter 
3  axsa  misalignments 
3  fixed  drifts 
3  fixed  scale  factor  errors 
3  asymmetry  scale  factor  errors 
3  quadratic  nonlinearity  scale  factor 
1  angular  acceleration  drift 
1  anlsotnertia  drift 
3  axes  misalignments 
3  bias  errors 


9  system  errors 
1  altimeter  error 

j?  \  i  gyro  arrors 
errors  j 

S  acislatttlon  errors 


In  contrast  to  a  platform  mechanisation,  the  sensor  errors  now  produce  the  errors 
at  system  level  via  the  transformation  matrix  Cnt) ,  CnD  itself  is  a  trigonometric  function 
of  the  actual  attitude  and  heading  angles.  In  addition  the  sensor  errors  depend  on  the 
dynamic  environment  of  the  SDS.  All  together  this  leads  to  a  total  error  behaviour  of  a 
SDS  that  depends  to  a  very  large  extent  on  the  actual  manoeuvres  flown.  The  number  of 
sensor  errors  having  an  effect  at  system  level  varies  considerably  according  to  the  par¬ 
ticular  flight  path. 

2.2  PROCEDURES  EOF,  THE  EXAMINATION  OF  ERROR  MODELS  FOR  STRAPDOWN  SYSTEMS 

Fig.  3  shows  the  construction  of  the  software  for  the  examination  of  error  models 
for  SDS.  It  consists  essentially  of  4  parts: 

-  the  simulation  of  an  unaided  SDS,  including  the  calculation  of  the  reference  flight 
path, 

-  the  design  of  simplified  low-order  models  by  means  of  the  covariance  analysis  methods 
and  on  the  basis  of  simulated  as  well  as  measured  sensor  signals, 

-  the  Kalman  filter  that  contains  simplified  low-order  error  models  for  the  SDS  and  at 
least 

-  the  comparison  of  assumed  sensor  errors,  or  of  system  errors  determined  from  the  simu¬ 
lation,  with  errors  estimated  by  Kalman  filtering. 

The  simulation  of  the  SDS  shown  in  the  upper  part  of  Fig. (3)  begins  with  the  selec¬ 
tion  of  tracks,  flight  path  angles  and  velocities.  Based  on  this  input  the  sensor  sig¬ 
nals  are  calculated  for  the  gyros  or  the  accelerometers  in  terms  of  angle  or  velocity 
increments  and  are  then  fed  into  the  sensor  error  model.  The  corresponding  errors  of 
position,  velocity  and  attitude  and  heading  angles  are  then  obtained  by  comparing  the 
results  of  the  subsequent  navigation  calculation  with  the  data  of  the  reference  flight 
path.  All  these  values  are  stored  on  a  magnetic  disc  for  further  use.  The  software  for 
the  simulation  of  a  SDS  has  been  developed  at  DFVLR  and  a  detailed  description  is  con¬ 
tained  in  I1{. 

For  the  design  of  simplified  low-order  models  4, ,  the  high-order  reference  error 
models  1  were  used  and  for  both  the  discrete  error  and  covariance  propagation  equations 
were  calculated. 
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The  statistical  interpretation  of  the  difference  between  the  corresponding  compo¬ 
nents  of  the  state  vectors  x,  and  x  leads  to  the  design  of  the  system  noise  matrix  g. 

The  Kalman  filter  algorithms  that  now  follow  use  the  data  from  the  simulated  SOS 
and  are  based  on  the  simplified  low-order  wodels  developed  by  eovarlance  analysis  cal¬ 
culations.  The  state  vector  covers  3  jsoBltlon  errors,  3  velocity  errors  and  3  angular 
errors,  as  veil  as  a  certain  number  of  sensor  errors,  such  as  gyro  drifts,  scale  factor 
errors,  accelerometer  bias  etc.  The  measurement*  required  for  the  Kalman  filter 
can  be  obtained  from  the  position  signals  of  the  simulated  reference  flight  path,  which 
l®  known  exactly.  In  that  case  the  reference  flight  path  is  considered  as  an  external 
measurement  carried  out  by  a  tracking  radar  system.  Together  with  the  initial  covariance 
matrix  the  system  noise  matrix  g,  and  the  covariance  matrix  assumed  for  the  simulated 
measurement*:  the  Kalman  filter  algorithms  Have  all  the  necessary  output  data  at  their 

disposal.  The  principles  of  Kalman  filtering  for  aided  Inertial  navigation  systems  have 
been  described  in  detail  In  /S,  ?,  8/.  The  result  of  the  filtering  is  the  estimation 
vector  fi  and  the  covariance  matrix  £. 

At  the  end  of  the  programme  system  the  comparison  Is  made  between  the  assumed  sen¬ 
sor  error®,  or  system  errors  determined  from  the  simul  ited  SOS,  and  the  estimated  errors 
based  on  simplified  low-order  models.  The  differences  determined  In  this  way  correspond 
to  the  so-called  "true  estimation  error®"  and,  according  to  the  theory  of  Kalman  filter¬ 
ing,  must  correspond  with  the  predicted  covariance  matrix  £.  This  relationship  Is  ex¬ 
pressed  by  Kq. (18) 

£  -  E  (xgTJ  (18) 

Roughly  speaking  the  Eq.dS)  means  that  the  estimation  errors  must  lie  within  the 
le-llmlts  defined  by  the  diagonal  elements  of  the  covariance  matrix  £.  The  aim  when  de¬ 
riving  simplified  low-order  models  is  to  use  as  few  state  variables  as  possible,  while 
still  satisfying  Eg. (18),  and  in  so  doing  to  obtain  acceptable  values  for  the  le-Hfcit*. 

2.3  THE  SELECTION  OP  3  TYPICAL  FLIGHT  PATHS 

The  decisive  factor  In  the  error  behaviour  of  a  SDS  is  the  dynamic  environment  of 
the  system,  and  for  this  reason  3  examples  of  typical  flight  paths  will  be  given  to 
llluattate  this  error  behaviour.  Pig.  4  contains  the  flight  paths  and  the  corresponding 
flight  levels,  and  Pig.  S  shews  the  simulated  and  measured  bank-rates  and  cross-accele¬ 
rations. 


FLIGHT  PATH  1 

Flight  path  1  is  a  simulated  straight  flight  of  approx.  100  min  duration,  containing 
4  full  turns.  The  flight  path  is  at  a  constant  altitude  and  the  ground  speed  is  approx. 
300  kts.  This  flight  path  is  selected  to  simulate  the  structure  of  the  flight  path  of  a 
civil  airliner  flying  from  a  point  A  to  a  point  B  as  directly  as  possible.  Angular  rates 
anti  accelerations  only  occur  during  the  initiation  and  conclusion  of  the  turns  and  during 
take-off  and  landing,  and  their  values  then  are  approx.  +  15  °/s  or  +0.3  g. 

FLIGHT  PATH  2 

Flight  path  2  was  flown  by  a  F104  combat  aircraft.  A  SDS  made  by  the  TELEDYNE  com¬ 
pany  was  part  of  the  instrumentation  system  /9/.  The  path  contains  several  manoeuvres 
carried  out  at  a  speed  of  approx.  700  kts.  The  profile  of  the  altitude  contains  climb 
and  descent  phases,  with  descent  speeds  r,p  to  approx.  100  kts.  These  manoeuvres  produce 
bank-rates  of  up  to  80  °/s  and  cross-accelerations  of  up  to  0.3  g.  Noisa  terms  whose 
standard  deviations  can  be  estimated  at  approx.  7  °/s  or  0.1  g  are  superimposed  on  the 
sensor  signals  /10/. 

FLIGHT  PATH  3 

Flight  path  3  was  the  result  of  a  sophisticated  simulation  /2/  and  is  selected  to 
represent  a  remotely  piloted  vehicle  (RPV)  flight  path.  This  is  shown  clearly  by  the 
altitude  profile,  which  contains  a  simulation  of  "terrain  fcllowing".  The  flight  path  is 
based  on  the  following  parameters: 

-  catapult  take-off  with  a  flight  path  angle  of  20°  and  acceleration  to  a  speed  of 
275  m/s, 

-  total  flying  distance  approx.  280  km,  flying  time  approx.  40  min, 

-  altitude  profile  during  the  mission:  high-low-low-high,  with  a  total  of  2  low-level 
stages  (schematized  terrain  following)  having  a  total  length  of  approx.  31  km, 

-  vertical  manoeuvres  of  up  to  approx.  +  2  g, 

-  horizontal  turns  of  up  to  approx.  +  3  g. 

These  extreme  manoeuvres  produce  bank-rates  of  up  to  +  50  °/s,  and  accelerations  of  up 
to  5  g. 


3.  THE  DERIVATION  OF  ADAPTIVE  ERROP.  MODELS 

It  is  only  possible  to  model  a  few  sensor  error  coefficients  in  a  low-order  model 
that  is  suited  to  real-time  applications  within  the  Kalman  filter  algorithms.  For  this 
reason,  if  one  only  models  the  fixed  gyro  drift,  for  instance,  the  actual  error  behaviour 
would  only  be  adequately  described  during  a  section  without  manoeuvres.  Durinc  a  full 
turn,  for  example,  a  misalignment  to  che  gyro  sensitivity  axes  of  6  arcs  at  an  angular 
rate  of  only  360  °/3  min  produces  a  system  misalignment  after  one  turn  of 

6d.  »  a  =  6  arcs  •  360  °/3  min  5  0.2  °/h  36  arcs/turn  (19) 

o  z 

This  value  is  20  times  larger  than  the  assumed  fixed  gyro  drift  of  0.01  °/h.  Howovor , 
if  only  a  few  sensor  error  coefficients  in  a  low-order  error  model  are  available  for  the 
whole  flight,  a  possible  solution  is  to  switch  between  various  simple  and  predefined  mo¬ 
dels.  The  criterion  for  switching  depends  on  the  dynamic  environment  of  the  SDS  at  a  gi¬ 
ven  moment,  i.e.  the  error  model  has  adaptive  qualities.  The  remaining,  non-iaodelled  part 
of  the  error  budget  of  the  SDS  can  then  be  interpreted  as  system  noise. 

The  angular  rate  u,  ,  for  example,  can  serve  as  a  simple  criterion  for  switching  bet¬ 
ween  2  different  low-order  sensor  error  models, 

rms  of  w  »  1  °/s  *  sensor  error  model  1  (20) 

rns  of  ®z  >  1  °/s  -»  sensor  error  model  2  (21) 

The  state  vector  x,  the  error  model  |  and  the  covariance  matrix  P  consist  of  one  part 
which  is  non-owitchable  (corresponding  to  the  system  errors)  and  another  part  which  can 
be  switched  (corresponding  to  the  sensor  errors).  If,  for  example,  one  proceeds  on  the 
basis  of  9  SDS  errors  and  models  the  sensors  by  means  of  3  coefficients,  the  roault  Is 
an  order  of  12  for  the  Kalman  filter.  If  the  verticul  components  can  be  calculated  se¬ 
parately  /II/,  a  total  of  only  10  state  variables  remain. 

-  *  CE'  CD'  4vN'  iVE'  4*'  4X'  d1*  d2'  d3^T'  (22) 
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non-switchable  system  errors  switcheblc  sensor  errors 

According  to  the  nature  of  the  dynamic  environment,  the  coefficient*  d,  to  dj  de¬ 
scribe  various  sensor  errors  for  the  duration  of  a  flight.  Fig.  6  shows  the  principle  of 
the  switching  procedure.  For  example,  the  procedure  could  start  with  the  state  vector  x, , 
the  error  model  ♦,  and  the  covariance  matrix  g(  .  At  this  moment,  x, ,  j,  and  g,  take  into 
account  the  fixe3  gyro  drift.  3  minutes  later  the  angular  rate  e breaches  the  rms  value 
of  2  °/s,  i.e.  the  criterion  for  switching  has  been  fulfilled,  fhe  elements  of  x, ,  j, 
and  g!  corresponding  to  the  sensor  errors  at  thin  moment  are  then  stored  on  the”co«*putcr 
buffer  and  the  vector  x,  and  the  matrices  P,  are  retrieved  from  the  buffer.  x}, 
and  P,  might  now  contain,  for  example,  in  addition  to  the  system  errors,  3  fixed“scsle 
factor  errors  of  the  gyros.  The  system  errors,  of  course,  are  non-switchable  but  trans¬ 
ferred  if  a  switching  procedure  occurs.  After  a  further  3  minutes,  for  exampls,  the  ran.- 


value  of  the  angular  rate  at  again  decreases  to  zero  and  the  storage  procedure  is  then 
repeated  in  reverse,  i.e.  x,,  and  P2  are  stored  in  a  buffer  and  x, ,  $1  and  Pt  are 
used  again  in  the  Kalman  filter  algorithms.  The  sensor  errors  estimated' in  the~period 
when  x, ,  and  Pj  were  valid,  are  now  used  again  as  initial  conditions. 

The  purpose  of  this  switching  procedure  is  constantly  to  model  those  sensor  errors 
in  the  error  model  that  predominate  in  the  error  budget.  Moreover,  since  the  Kalman  fil¬ 
ter  algorithms  adopt  the  results  of  the  last  estimates  as  initial  conditions  when  certain 
sensor  error  coefficients  are  used  again,  it  is  sufficient  to  have  relatively  short, 
though  frequently  recurring,  periods  in  order  to  improve  the  covariance  matrices  P.  The 
procedure  of  switching  is  especially  efficient  in  the  case  of  flight  paths  that  can  be 
divided  up  into  sections  where  only  a  few  but  different  sensor  errors  predominate.  In 
this  case  a  number  of  predefined  low-order  sensor  error  models  are  sufficient  to  describe 
the  overall  system  error  behaviour  for  the  whole  duration  of  a  flight.  In  order  to  estab¬ 
lish  low-order  sensor  error  models  that  are  to  describe  the  real  system  error  behaviour 
with  sufficient  precision,  it  is  necessary  first  to  determine  the  predominant  sensor 
error  coefficients. 


3.1  LOW-ORDER  SENSOR  ERROR  MODELS  FOR  FLIGHT  PATH  1 


The  flight  paths  of  commercial  airliners  are  characterized  by  their  small  number  of 
manoeuvres,  which  include  relatively  low  angular  rates  and  correspondingly  low  values  for 
the  accelerations.  This  means  for  the  error  budget  of  a  SDS  that  the  fixed  gyro  drift 
produces  the  main  contribution.  The  errors  proportional  to  u  are  effective  only  for  the 
duration  of  the  manoeuvres  (e.g.  changes  of  course,  holdings) .  Assuming,  as  above,  that 
only  a  few  manoeuvres  occur  -  i.e.  that  the  duration  of  the  manoeuvres  is  considerably 
smaller  than  the  total  flying  time  -  then,  in  principle,  on  account  of  the  integral 
Eq. (10)  only  the  sensor  errors  dependent  on  u,  will  produce  a  system  error  which  can  no 
longer  be  ignored  in  comparison  with  the  contribution  of  the  fixed  gyro  drifts.  For  the 
given  flight  path  1,  which  contains  a  total  of  4  full  turns  with  a  duration  of  3  min 
each,  a  ratio  of  12  min/100  min  for  the  angular  rate  is  obtained.  The  other  compo¬ 
nents  of  the  angular  rates  u»_ ,  only  occur  for  a  short  time  during  the  beginning  and 
the  end  of  the  turns,  and  all  the  sensor  errors  dependent  on  ,  w,  can  therefore  be 
ignored  in  the  system  error  budget.  The  following  6th-order  gyro  error  model  is  lefti 
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As  regards  the  accelerometer  errors  for  flight  path  1  it  is  sufficient  tc  consider 
only  the  bias  values,  because  a  commercial  airliner  usually  flies  standard  turns  charac¬ 
terized  by  negligible  accelerations. 


6% 


B°  B°)T 

y  z 


Thus  for  flight  path  1  the  following  sensor  error  models  are  compared t 
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model  1/1 > 
model  1/2* 
model  1/3 i 
model  1/4* 
model  1/5  * 


fixed  gyro  drift 
accelerometer  bias 

fixed  scale  factor  error  D«  and  axes  misalignment  of  the  gyros  t’y,  tfy 
switch  between  the  models  1/1  and  1/3 

this  model  contains  all  23  sensor  errors  and  le  considered  as  a  reference. 


Fig.  7  contains  the  time  histories  of  the  angular  error  *0  for  the  e  different  error 
models  mentioned  above.  It  can  be  seen  how  the  overall  system  error  is  ptoduced  by  the 
individual  sensor  errors*  the  approximate  linear  increase  of  is  caused  by  the  fixed 
gyro  drift  (model  1/1)  and  the  system  error  contributions  of  the  fixed  scale  factor  er¬ 
ror  and  the  axea  misalignment  (model  1/3)  correaponds  clearly  to  the  manoeuvres  simula¬ 
ted.  During  the  turns  the  fixed  scale  factor  error  leads  to  s  system  misalignment  that 
could  be  calculated  as  follows* 


«_  *  o'?  •  •  AT  »  3  •  to”5  •  •  3  Sin  =  0.01°  (27) 

D  z  3  min 

This  value  can  be  read  off  directly  from  the  height  of  the  steps  in  the  time  histo¬ 
ry  of  the  system  misalignment  caused  by  model  1/3.  During  tho  straight  lines  of  flight 
path  1  model  1/3  does  not  produce  any  additional  error  contribution.  Because  the  con¬ 
tribution  of  the  accelerometer  bias  (model  1/2}  Is  rather  small  model  1/4,  which  is  able 
to  switch  between  models  1/1  and  1/3,  fits  the  reference  error  behaviour  of  model  1/5 
best. 


3.2  LOW-ORDER  SENSOR  ERROR  MODELS  FOR  FLIGHT  PATH  2 

Flight  path  2  flown  by  a  F104  combat  aircraft  contains  a  large  number  of  extreme 
manoeuvres.  The  discussion  of  the  causes  of  errors  by  using  plausibility  methods,  as  In 
flight  path  1,  is  no  longer  possible  in  this  case.  For  this  reason,  sll  the  sensor  errors 
were  analysed  individually  with  reference  to  their  effect  at  system  level*  this  was  done 
by  calculating  the  corresponding  error  propagation  Eq. (14,16).  Especially  striking  in 
this  connection  was  the  large  error  contribution  of  the  anisoertla  tens 
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A  further  error  contribution  is  provided  by  the  sensor  errors  proportional 
angular  rates.  As  has  already  been  shown  even  small  angular  rates  lead  to  large 
errors.  For  this  reason  the  following  sensor  error  models  were  compared: 


to  the 
system 


model  2/1 : 
model  2/2: 
model  2/3: 
model  2/4: 
model  2/S: 


fixed  gyro  drift  I  i 
scale  factor  error  d“,  D^1 

anisoertia  term  D*“  i  i  . 
switch  between  model  2/1  ard  a  model  consisting  of  D»,  Dj  1  and  DQ 
reference  model  containing  23  sensor  error  coefficients. 


Fig.  8  shows  the  time  histories  of  the  east  position  errors.  The  model  2/3  provides 
a  position  error  that  is  too  large  in  comparison  with  the  reference  values  (model  2/5) . 
The  fixed  gyro  drift  (model  2/1)  and  the  scale  factor  errors  (model  2/2)  produce  a  posi¬ 
tion  error  with  a  negative  sign.  The  time  history  of  the  error  behaviour  caused  by  the 
model  2/4  is  nearest  the  reference.  At  the  end  of  this  real  flight  of  approx.  40  min,  the 
following  error  budget  is  obtained: 


2/1 

2/2 

2/3 

2/4 

2/S 

east  position  error 
after  40  min 

-500  m 

-500  m 

6000  m 

5000  m 

5000  m 

The  effect  whereby  some  sensor  errors  are  cancelled  out  depends  on  the  sign  of  the 
sensor  error  coefficients  and  the  type  of  manoeuvres  flown.  If  the  aircraft  executes  con¬ 
trary  manoeuvres  such  as  the  following  bank  angle  commands  0/30°/0°/-30o/0of  then  some 
errors  will  always  cancel  each  other  out  to  some  extent.  In  this  case  a  position  error 
during  flight  can  be  partly  reduced  by  selecting  appropriate  manoeuvres. 

3.3  LOW-ORDER  SENSOR  ERROR  MODELS  FOR  FLIGHT  PATH  3 


Similar  considerations  obtain  for  flight  path  3,  which  is  meant  to  simulate  a  RPV 
path,  as  for  flight  path  2.  High  angular  rates  and  accelerations  were  assumed  in  order  to 
simulate  the  extreme  dynamic  environment  of  a  SDS  installed  on  a  RPV.  After  the  calcula¬ 
tion  of  all  the  individual  sensor  errore  the  following  5  sensor  errbr  models  are  left  for 
comparison: 


model  3/1: 
model  3/2: 
model  3/3: 
modol  3/4: 
model  3/S: 


fixed  gyro  drift 

fixed  scale  factor  error  o“,  0“ 

anisoertia  term  0*"  . 

switch  between  model  3/1  and  a  model  consisting  of  dJ1,  D,  and  D0i 
reference  modol  identical  to  tho  models  1/5  and  2/5. 


The  effect  of  error  contributions  on  the  system  level  with  different  signs  is  espe¬ 
cially  noticeable  in  the  time  histories  of  the  east  position  errors  of  flight  path  3 
shown  in  Fig.  9.  While  the  contribution  of  tho  anisoertia  term  (model  3/3)  Increases  up 
to  approx.  -4  km,  the  fixed  scale  factor  errors  (model  3/’)  produce  a  position  error  of 
up  to  spprox,  2  km.  The  effect  of  the  fixed  gyro  drift  is  negligible  in  tho  simulated 
RPV  flight  path  3  (model  3/1).  Error  model  3/4,  which  switches  between  the  error  models 
defined  sbove,  describes  with  sufficient  accuracy  the  position  error  caused  by  all  the 
different  sensor  errors.  The  rat.k  of  the  sensor  error  model  3/4  Is  only  3. 


3.4  THE  QUALITIES  OF  THE  SELECTED  ADAPTIVE  SENSOR  ERROR  MODELS 


For  flight  path  1,  2  and  3  sensor  error  models  1/4,  3/4  and  3/4  are  selected  as 
being  suitable  for  use  for  Kalman  filtering  and  real-time  applications.  These  sensor 
error  models  oxhlbit  the  following  qualities: 

-  The  sensor  error  models  contain  only  3  state  variables,  i.e.  the  rank  of  the  system 
error  model  Is  12,  or  only  10  in  tho  esse  of  separate  vortical  velocity  and  vertical 
position  errors. 

-  Switching  between  these  error  models  in  accordance  with  different  flight  sections  with 
many  or  few  manoeuvres  guarantees  adequate  adaptive  qualities. 

-  The  non-modelled  sensor  errors  do  not  produce  large  system  errors  and  can  thus  be  in¬ 
terpreted  as  system  nolee  modelled  in  the  system  noise  matrix  Q. 


4.  THE  USE  OF  SENSOR  ERROR  MODELS  FOR  KALMAN  FILTERING 

In  the  preceeding  Section  tht  attempt  was  mod*  to  determine  those  sensor  errors  that 
produce  the  largest  contributions  to  tha  error  budget  at  eyetem  level.  If  only  a  few  sen¬ 
sor  errore  define  the  system  error,  it  is  possible  to  model  the  corresponding  sensor  er¬ 
ror  coefficients  In  a  low-order  sensor  model.  However,  It  may  happen  that  all  the  seneor 
errors  explained  in  Section  2  cause  large  eyetem  errore  and  that  none  of  the  tensor  er¬ 
ror  coefficients  can  be  ignored  in  comparison  with  any  other.  In  this  case  s  Kalman  fil¬ 
ter  would  have  a  rank  up  to  33  and  this  la  bound  to  lead  to  numerical  problems  or  to 


violate  real-time  computational  conditions. 

4.1  NUMERICAL  PROBLEMS  IN  HIGH-ORDER  ERROR  MODELS 

Numerical  problems,  for  example  round-off  errors,  often  result  in  negative  elements 
within  the  trace  of  the  covariance  matrix  P.  These  negative  elements  are  not  defined  at 
all  and  have  fatal  consequences  for  the  filtering.  In  order  to  avoid  such  problems  the 
methods  of  formulating  the  filter  equations  in  terms  of  triangular  matrices  can  be 
applied.  As  the  comparison  between  various  procedures  has  shown  /1 2/,  these  methods  ge¬ 
nerally  involve  an  increase  in  the  number  of  calculations  in  comparison  with  conventional 
filter  equations.  The  advantage  of  avoiding  numerical  problems,  e.g.  negative  elements 
within  the  trace  of  the  covariance  matrix  P,  must  therefore  usually  be  paid  for  with  an 
increase  in  computational  operations. 

In  order  to  save  the  number  of  computational  operations,  it  is  possible  to  take 
into  account  the  structures  of  the  discrete  transition  matrix  ♦  and  the  covariance  matrix 
P.  Most  of  the  elements  of  these  matrices  are  always  zero.  Using  the  system  error  model 
without  any  sensor  error  coefficient  and  the  matrix  4t2  for  the  cross-coupling  of 
the  sensor  errors  via  the  transformation  matrix  CBb  to  tSe  system  error  level,  the  co- 
variance  propagation  equation  of  the  Kalman  filter  can  be  stated  as  follows: 


=11  =12 

P  (k+1 )  = 

“  0  1 
at  ta 

With  the  symmetrical  matrices  F  and  Q 
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is  follows  by  multiplication 

£n(k+1>  “  Kill  in  +  in  *nJ{k)  +  Si2(k+1>  *  iT2(k>  <31> 

P12(k+1)  -  l(4n  P,2  +  i12  £22)  $*2](k)  (32) 

g22(k+1)  -  P22(k)  (33) 

Using  x,  for  the  state  variables  without  sensor  error  coefficients  and  x2  for  the  sensor 
error  the  propagation  equation  for  the  state  vector 


N  ‘J  NJ 

can  be  stated  in  the  simplified  form 

x,  (k+l)  *4n(k)  x^k)  +  412(k)  Xj(k)  (35) 

x2  (k+1)  -  x2(k)  (36) 

The  saving  in  the  number  of  multiplications  can  be  calculated  by  using  nm#1  for  the 
maximum  number  of  state  variables,  which  is  the  sum  of  n,  system  errors  and  nt  sensor 
errors.  In  the  case  of  the  usual  oatrix-vector-operations  a  maximum  number  of  multi¬ 
plications  is  necessary: 

“max  *  2tW  +  "L  (37) 

propagation  of  P  propagation  of  x 

In  accordance  with  the  simplified  Eq.(31,  32,  35)  it  follows  for  the  minimum  number 
aail  of  multiplications  with  PJ}  as  a  diagonal  matrixi 

12  2  2  2  2 
afcin  ■  2n't  ♦  2n1n2  +  n,n2  ♦  2n,n2  ■*  n,n2  ♦  n,n2  +  n^  ♦  n^  (38) 

V  I  . ■ -I  III  ■  »  »■  m’j  Vs  1  ...  mt—  i  ■  mmf  \  i  y  ■  i  ^ 

propagation  of  propagation  of  P1 2  propagation  of  x 

or,  by  aunaing  the  terms,  the  factor  n  of  the  savings  in  the  number  of  multiplications 
can  be  given  thus: 


2(n1+n2)3  ♦  (n^nj)2 

— * - j — - r 

2n,  ♦  6n1»2  ♦  +  ni 
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Fig.  10  shows  the  function  n=f(nj)  on  the  basis  of  a  fixed  number  of  system  errors: 

3  angular  misalignments,  3  velocity  errors,  3  position  errors  and  the  altimeter  bias. 

Although  there  is  a  saving  in  the  number  of  multiplications  by  taking  into  account 
the  structure  of  the  matrices  *  and  P,  the  calculations  still  need  too  much  time  for  a 
high-order  model.  If  one  tr.les~to  mo3el  10  error  coefficients,  this  would  lead  to  a  num¬ 
ber  m  of  multiplications  of 

m  =  2n3/n(n2)  -  2n3/2  -  8000  (40) 

A  digital  computer  with  a  floating  point  processor  and  an  assumed  time  comsumption 
of  approx.  60  ps  for  one  multiplication  will  thus  need  approx.  0.5  s  merely  for  the  co- 
variance  and  error  propagation  equations.  It  can  be  seen  from  this  very  rough  estimate 
that  it  is  essential  to  use  low-order  models  in  real-time  applications.  If  the  order  of 
the  Kalman  filter  is  limited  to  a  maximum  number  of  13  to  15  state  variables,  numerical 
problems  do  not  arise  and  real-time  applications  remain  possible. 

4.2  THE  INTERPRETATION  OF  THE  NON-MODELLED  SENSOR  ERROR  COEFFICIENTS  AS  A  SYSTEM  NOISE 

There  is  another  basic  possiblitiy  of  describing  the  error  behaviour  of  SDS  for 
flight  paths  in  which  a  large  number  of  sensor  errors  are  effective  at  system  level  and 
the  numerical  problems  of  using  high-order  error  models  cannot  be  solved  satisfactorily. 
This  is  the  interpretation  of  the  non-modelled  sensor  error  coefficients  as  system  noise 
in  terms  of  the  matrix  Q.  Roughly  speaking,  the  covariance  propagation  equation  supplies 
over-optimistic  values  In  the  case  of  non-modelled  sensor  error  coefficients.  This  effect 
can  be  compensated  for  by  a  corresponding  increase  in  the  elements  of  the  system  noise 
matrix  g.  However,  this  procedure  leads  to  problems  if  essential  error  contributions  de¬ 
pendent  on  the  sign  of  the  sensor  signals  are  interpreted  as  unbiased  random  signals. 

This  applied,  for  example,  in  the  case  of  a  fixed  scale  factor  error  of  the  gyro:  if  the 
aircraft  performs  a  left-hand  turn  and  then  a  right-hand  turn,  the  system  misalignments 
caused  by  these  sensor  error  coefficients  cancel  each  other  out  to  some  extent.  The  in¬ 
terpretation  of  these  sensor  error  coefficients  as  corresponding  system  noise  would  make 
the  covariance  matrix  worse  -  irrespective  of  the  particular  direction  of  the  turns. 
Therefore,  this  method  leads  to  covariance  matrices  that  assume  over-pessimistic  values 
in  the  case  of  contrary  manoeuvres.  Thus,  of  the  sensor  error  coefficients  defined  in 
Section  2,  those  that  are  basically  suitable  for  description  by  means  of  noise  matrices 
are  the  fixed  gyro  drifts,  the  accelerometer  bias  and  the  quadratic  non-linear  scale 
factor  error  of  the  gyros. 

Of  course,  the  sensor  error  coefficients  which  are  taken  into  account  in  the  systom 
noise  matrix  g  do  not  occur  in  the  form  of  state  variables  and  therefore  estimation  by 
means  of  Kalman  filtering  is  no  longer  possible.  For  this  reason  non-modelled  sensor  er¬ 
ror  coefficients  should  only  be  interpreted  as  system  noise  if  their  contribution  to  the 
overall  system  error  behaviour  is  relatively  small.  When  the  corresponding  elements  of 
the  noise  matrix  g  are  being  determined,  the  gyro  errors  are  to  be  writton  in  angular  terms 
and  the  accelerometer  errors  in  velocity  terms. 

If  the  system  matrix  »  is  approximated  by  means  of  the  identity  matrix  J  for  a  suffi¬ 
ciently  short  interval  T#,“then  the  following  applies: 

ln(k41)  -c^k)  4Ta  •  £nb(k)  •  4«fa(k)  (41) 

4v  (k-H)  »  4v  (k)  ♦  T.  •  C  .  (k)  •  4a.  (k)  (42) 

— n  — n  a  *nb  — d 

When  5  J,  it  follows  for  the  variances  s  that 

o3  (k4l)  -  u2(k)  4  Tft2  -  Et4wb(k)  •  ««J(k)l  (43) 


With  the  integration  of  the  terms  q’£  up  to  the  time  k*T, ,  variances  for  the  mis¬ 
alignments  c  are  obtained  in  accordance  with 

a2  (k-T  )  -  kq><*  (44) 

C  a  — k  V 

In  order  to  obtain  a  standard  deviation  o  that  corresponds  to  the  basic  relation 
misalignment  *  drift  multiplied  by  time  at  the  end  of  the  observed  interval  kTa,  the  fol¬ 
lowing  equation  has  to  be  fulfilled: 

\  *  <kTa)  -  ^kq*/  (4S) 

By  rearrangement  one  obtaina  a  term  in  the  form  of  a  epectral  density  for  determin¬ 
ing  the  elements  qu  of  the  system  noise  matrix  Q 

2*cc  *  k‘Ta  *  *  ‘sJ1  1  r*d2'»> 

or,  by  a  corresponding  calculation  for  the  accelerometer  errors: 


(46) 


(47) 


q^,  =  kTg  •  E[iab  •  iab]  [  (m/s)2/s] 

For  the  fixed  gyro  drift  value  of  0.01  °/h  or  the  accelerometer  bias  of  10 
following  numerical  values  are  obtained: 


g  the 


q2ee  =  2.37  .  10-15  •  kTa  [1/s] 


(48) 


=  10'6  •  kTa  [(m/s)2/sl  (49) 

For  the  remaining  sensor  error  coefficients  the  calculation  of  the  expectation 
values  is  necessary.  The  sensor  signals  consist  generally  of  a  low-frequency  part  that 
describes  the  flight  path,  and  superimposed  high-frequency  vibrations.  Statistically 
speaking,  they  represent  non-stationary  time  series.  The  calculation  of  the  expectation 
values  consequently  poses  considerable  problems:  for  example,  the  variances  that  have 
been  determined  depend  on  the  selection  of  the  observed  flight  path  section.  The  equa¬ 
tions  in  Section  4  thus  only  permit  a  rough  estimate  of  the  order  of  magnitude  of  the 
elements  of  the  system  noise  matrix  that  correspond  to  the  non-model led  sensor  error 
coefficients. 


4.3  THE  ADJUSTMENT  OF  THE  SYSTEM  NOISE  MATRIX  Q  FOR  THE  3  DIFFERENT  FLIGHT  PATHS 

SELECTED 

The  adjustment  of  the  system  noise  matrix  Q  for  the  fixed  gyro  drift  not  included 
in  the  error  model  was  carried  out  in  accordance  with  Eq.(46).  Despite  the  fact  that 
they  were  non-stationary  the  expectation  values  of  the  sensor  signals  were  calculated 
so  that  sensor  errors  depending  on  the  dynamic  environment  of  the  SDS  can  also  be  in¬ 
cluded  in  the  system  noise  matrix.  The  sensor  signals  and  the  angular  accle:  ations  varied 
between  the  listed  values.  Mean  values  were  assumed  for  the  expectation  values  in  order 
to  make  a  rough  estimate  for  the  elements  of  the  system  noise  matrix  g  possible.  The  re¬ 
sults  are  listed  in  Tab.  2: 


flight  path  1 

0.4  °/s  <  a  <  0.6  °/b 

% 

°a  ■  0 

o  «  0.5  °/s 
“b 

a.  s  0. 

“b 

flight  path  2 

0.9  °/s  <o  <4.84  °/s 

“b 

2.3  °/fi2<  g.  <  8.2  <7s2 
*b 

o  •  3  °/o 
"b 

o.  &  5.3  °/s2 
"b 

flight  path  3 

0.5  °/s  <o  <  5.2  °/s 
“b 

0.2  °/e2  <  g.  <22  °/s2 
% 

g  .  2.8  °/a 

% 

g.  ,11  °/s2 
“b 

Tab.  2:  Standard  deviation  for  the  sensor  signal# 

For  the  fixed  scale  factor  error  Du«3,lo~5  and  the  anisoertla  term  D*w»4 *lo~*s  the 
corresponding  element#  of  the  system  noise  matrix  g  are  listed  in  Tab.  3: 


q^/O^d/s)2 

q^/D^P/#]2 

flight  path  1 

5  •  10-" 

- 

flight  path  2 

6  »  10“9 

3.3  •  10"6 

flight  path  3 

5.2  •  Uf* 

1.4  •  10“S 

Tab.  3i  Elements  of  the  matrix  g  corresponding  to  the  sensor  error  coefficients 

4.3.1  THE  CALCULATION  OP  ELEMENTS  OF  THE  SYSTEM  NOISE  MATRIX  g  FOR  FLIGHT  PATH  1 

S  different  error  models  will  again  be  considered.  The  non -mode lied  sensor  error 
coefficients  in  each  case  are  interpreted  as  system  noise. 

Error  model  1/6 

u  no  sensor  error  coefficients  taken  into  account 
0  »  f  (all  sensor  error  coefficients) 

Thus,  In  this  error  model  1/6  all  the  main  sensor  error  coefficients  are  Interpreted  as 
system  noise.  It  follows  from  Eq.(46,  47)  and  Tab.  2  that 


qje  =  10-11/s  and  q^=  2.5  •  10~7(§>2/s. 

Error  model  1/7 

Here,  in  contrast  to  error  model  1/6,  the  fixed  gyro  drift  was  modelled.  This  results  in 

q€€  =  10-1 2/8  and  4v=  2*5  •  10”7(f)2- 

Error  model  1/8 

Here,  switching  takes  place  between  the  fixed  gyro  drift  and  the  gyro  errors  d“,  c?  ,  e,  . 

-•’7  m  0  1 

For  this  error  model  1/8  the  elements  q  were  taken  as  zero.  For  q  =2.5*10  (~)  /s  holds. 

ee  w  8 

Error  model  1/9 

The  error  model  1/9  is  basically  similar  to  1/8.  However,  here  the  non-modelled  sensor 
error  coefficients  are  considered  as  system  noise.  It  thus  follows  for  the  elements  of 
the  noise  matrix  g  that 

qe£  -  7  •  10_13/s  and  qw  »  2.5  •  lo“7(|)2/s. 

Error  model  1/10 

This  error  model  serves  as  a  reference,  i.e.  all  the  sensor  error  coefficients  are  mo¬ 
delled. 

Fig.  11  contains  the  time  histories  of  the  la-values  for  the  component  of  the  east 
position  error.  According  to  this,  the  models  1/7  to  1/10  exhibit  a  largely  identical  re¬ 
sult,  i.e.  the  axes  misalignments  and  the  fixed  scale  factor  error  of  the  gyros  as  well 
as  the  accelerometer  bias  can  be  interpreted  with  sufficient  accuracy  as  system  noise  for 
this  particular  flight  path  1.  However,  the  situation  is  different  in  the  case  of  model 
1/6,  which  does  not  model  any  sensor  error  coefficients.  Although  on  the  whole  sufficient 
agreement  with  the  reference  model  1/10  can  be  obtained,  deviations  up  to  approx.  +  500  m 
do  occur  in  the  middle  of  flight  path  1. 

4.3.2  THE  CALCULATION  OF  ELEMENTS  OF  THE  SYSTEM  NOISE  MATRIX  Q  FOR  FLIGHT  PATH  2 

I 

Once  again  5  different  error  models  are  considered. 

Error  model  2/6 

All  the  sensor  error  coefficients  were  interpreted  as  system  noise.  This  leads  to 
qJE  -  2.4  *  10“10/8  and  q2y  =  10“7<|)2/8. 

Error  model  2/7 

The  fixed  gyro  drift  only  is  modelled.  The  remaining  sensor  error  coefficients  result  in 
q2e  -  2.35  *  10"l0/s  and  q^  -  10"7(|)2/s. 

Error  model  2/8 

In  error  model  2/8  switching  takes  place  between  the  fixed  gyro  drifts  and  the  gyro 
errors  ,  Ojf  ,  D4".  The  elements  ql  are  set  to  zero  and 

<£v  -  10"7(S)2/s. 

Error  model  2/9 

This  is  basically  the  same  as  error  model  2/8,  however,  all  the  non-modelled  error  con¬ 
tributions  are  included  in  the  system  noise  matrix.  This  leads  to 

q*t  -  to"12  i/s  and  q^  -  lo"7(|l2/s. 

Error  model  2/10 

This  is  the  reference  model  again  including  all  sensor  error  coefficients  as  state  vari¬ 
ables. 

Fig.  12,  which  contains  the  time  histories  for  the  lo-valueo  of  the  angular  mis¬ 
alignment  s0,  shows  clearly  the  difficulties  Involved  in  the  interpretation  of  dominant 
sensor  error  coefficients  as  an  unbiased  noise.  The  anisoertia  term  not  contained  in  er¬ 
ror  models  2/6  and  2/7  leads  to  results  which  deviate  considerably  from  the  reference 
values.  The  elements  of  the  system  noise  matrix  0  can  be  adjusted  according  to  whether 
agreement  with  the  reference  values  is  to  be  achieved  at  the  beginning,  in  the  middle 
or  at  the  end  of  flight  path  2.  The  results  for  the  error  models  2/8  and  2/9  represent 
the  actual  error  limits  sufficiently  well  because  no  dominant  senor  error  coefficient 
is  Interpretated  as  system  noise. 

4.3.3  THE  CALCULATION  OF  ELEMENTS  OF  THE  SYSTEM  NOISE  MATRIX  £  FOR  FLIGHT  PATH  3 
Again  a  total  of  5  different  error  modela  are  considered. 

Error  model  3/6 

All  the  sensor  error  coefficients  are  taken  into  account  in  the  aysteo  noise.  This  leads 
to 

qjc  -  1.2  •  I0“10/s  and  qjy  •  2.5  •  10“7(|)2/s. 


Error  model  3/7 

Only  the  fixed  gyro  drift  is  modelled.  Therefore 

qL  “  10"10/s  and  =  2.5  •  lO~7(S)2/s. 

Error  model  3/8  o  o  o 

Now  sensor  error  models  are  switched  between  D°,  D°,  D°  and  d“,  d“,  Dfi“.  The  elements  q2 
are  set  to  zero  and  x  y  z  x  z  ee 

qJv  =  2.5  *  10“7(^)2/s. 

Error  model  3/9 

This  error  model  corresponds  to  model  3/8,  however,  all  the  non-modelled  sensor  error 
coefficients  are  included  in  the  system  noise  matrix.  This  results  in 

q2  =  10~12/s  and  q2  =  2.5  •  10”7(2)2/s. 
ee  vv  s 

Error  model  3/10 

This  is  the  reference  model  again.  It  is  identical  to  models  1/1o  and  2/10. 

Fig.  13  contains  the  time  histories  of  the  1o -values  for  the  north  position  error. 
Error  models  3/6  and  3/7,  which  do  not  include  the  dominant  sensor  error  coefficient  DA“, 
give  only  an  inaccurate  representation  of  the  reference  values.  The  modelling  of  DA“ 
alone  leads  to  deviations  of  approx.  +  800  m.  The  Ic-values  of  the  reference  model  3/10 
can  be  approximated  with  a  deviation  of  approx.  +  400  m  by  using  error  3/9,  which  takes 
account  of  additional  sensor  error  coefficients  in  the  system  noise  matrix  Q. 

4.4  THE  ADAPTIVE  KALMAN  FILTERING  ALGORITH 

It  was  shown  in  Section  4.3  how  some  of  the  sensor  errors  can  be  modelled  or  taken 
into  account  in  the  system  noise  matrix  Q.  However,  there  remain  a  number  of  sensor  error 
coefficients  that  can  be  effective  at  system  level  when  certain  manoeuvres  are  performed. 
These  sensor  error  coefficients  can  lead  to  a  divergence  of  the  Kalman  filter  algorithms. 
The  cause  of  this  divergence  lies  in  an  over-optimistic  covariance  matrix  P,  which  re¬ 
duces  the  elements  of  the  gain  matrix  K  and  thus  -  roughly  speaking  -  ignores  the  measure¬ 
ments  being  received.  In  order  to  avoid  this  divergence,  the  system  noise  matrix  Q  can 
frequently  be  increased  to  a  suitable  extent,  though  this  lowers  the  level  of  system  ac¬ 
curacy  drastically.  In  the  case  of  SDS,  the  calculation  of  constant  system  noise  matrices 
Q  must  allow  for  the  worst  dynamic  environment  of  the  SDS.  For  example,  a  large  number  of 
sensor  error  coefficients  effective  during  a  short  manoeuvre  are  able  to  initiate  the 
divergence  of  the  Kalman  filter  algorithms. 

It  is  far  easier  to  solve  this  problem  by  using  variable  system  noise  matrices  Q. 

This  leads  to  matrices  Q  dependent  on  the  actual  manoeuvres  flown.  The  calculation  of 
the  elements  of  such  a  variable  system  noise  matrix  2  can  be  performed  by  means  of  the 
so-called  adaptive  Kalman  filtering  taking  into  account  the  statistics  of  the  filter  re¬ 
siduals.  The  relationship  between  the  expectation  values  of  the  measurement  z,  the  co- 
variance  matrix  R  for  the  measurement  errors  of,  for  example,  a  radar  unit,  the  measure¬ 
ment  matrix  g  an§  the  unknown  noise  matrix  Q  can  be  formulated  as  follows: 

E[ (z-|x') • (z-Hx')T] (k)  =  g(k)  +H[P'(k)  +g(k)]HT.  (50) 

The  estimates  ft  and  the  covariance  matrix  P  correspond  to  the  predicted  expressions 
ft' ,  P1  via 


x' (k)  «  |(k)  S(k-1)  (51) 

g'(k)  =  | (k)  P(k)  |T(k)  +  Q (k)  (52) 

The  filter  residuals  (z-Hx')  in  many  adaptive  filtering  procedures  are  used  as  the 
basis  of  a  statistical  analysis  / 1 3 ,  14,  15,  16,  17,  18/.  In  visual  terms  Eq.(50)  means 
that  the  differences  between  measured  state  variables  and  predicted  state  variables  are 
described  statistically  by  the  corresponding  covariance  matrices  R,  P  and  g.  The  result 
is  a  variable  system  noise  matrix  g  or  a  corresponding  effect  on  the^gain  matrix  §. 

From  the  large  number  of  possible  adaptive  filtering  algorithms  the  one  based  on 
the  work  of  JAZWINSKI  /1 3,  14/  was  selected  because  it  requires  a  particularly  small 
number  of  calculations  and  is  thus  especially  suitable  for  real-time  applications.  In 
this  algonttun  the  calculation  of  the  expectation  values  of  the  filter  residuals  takes 
place  on  the  basis  of  n  measurements  backward  in  time  history.  If  this  expectation  values 
do  not  correspond  to  the  sum  of  the  covariance  matrices  P  and  g,  this  can  be  compensated 
for  via  the  noise  matrix  g. 

Aoaording  to  /1 4/,  for  a  matrix  B 

§  -  E  [(z-gx1)  (z-gx')T]  -  R  -  gP'gT  (53) 

the  following  criterion  applies: 

V  0  -  a  •  1  or  V  0  *  fi  -  0 


(54) 


By  use  of  this  relatively  simple  algorithm  the  system  noise  matrix  Q  continuously 
adapts  itself  to  the  predicted  statistics  of  the  Kalman  filter.  However, “only  at  the 
level  of  the  measurements  can  the  elements  of  the  matrix  Q  be  calculated  by  this  simple 
algorithm.  An  extension  of  the  adaptive  Kalman  filtering  In  order  to  calculate  all  the 
elements  of  the  matrix  Q  increases  the  computational  burden  drastically. 


5.  DISCUSSION  OF  RESULTS 

5.1  THE  RESULTS  OBTAINED  FROM  AN  ADAPTIVE  FILTER  WITH  SWITCHABLE  SENSOR  ERROR  MODELS 

The  discussion  of  the  results  based  on  low-order  error  models  within  an  adaptive  filter 
assumes  external  measurements  and  a  knowledge  of  the  estimation  errors.  Both  conditions 
can  be  fulfilled  in  the  case  of  simulated  data.  The  known  reference  flight  paths  1  and  3 
are  interpreted  as  external  measurements  and  the  system  and  sensor  errors  whose  values 
are  known  from  the  simulation  are  suitable  for  checking  the  estimations  of  the  adaptive 
filtering. 

5.1.1  THE  RESULTS  FOR  FLIGHT  PATH  1 

Fig.  14  contains  for  flight  path  1  the  time  histories  of  the  parameter  L  defined 
as  follows 


rms  of  |oj|  <  1  °/s  *  L  =  1  or  rms  of  |e|  >1  °/s  -*■  L  =  2  (55) 

In  the  event  of  manoeuvres  the  sensor  error  models  were  exchanged  in  accordance 
with  the  criterion  for  switching.  If  the  flight  path  corresponds  to  a  straight  line,  the 
error  behaviour  is  described  by  the  error  model  which  only  contains  the  fixed  gyro  drifts. 

Fig.  15  is  based  on  error  model  1/9  and  represents  the  time  histories  of  the  esti¬ 
mation  errors  and  the  corresponding  lo-bands  for  the  fixed  gyro  drift  D°.The  accuracy  can 
can  be  improved  to  about  0.005  °/h.  However,  these  extreme  levels  of  accuracy  require  a 
period  of  observation  of  approx.  60  min.  The  estimation  errors  reach  an  accuracy  level 
of  0.002  °/h.  If  one  compares  the  time  histories  of  the  estimation  error  and  the  corres¬ 
ponding  lo-band,  it  becomes  clear  that  the  self-diagnosis  of  the  filter  produces  values 
that  are  over-pessimistic  by  about  the  factor  2.  The  reason  for  this  effect  lies  in  the 
interpretation  of  the  non-modelled  sensor  error  coefficients  as  system  noise. 

Fig.  16  indicates  especially  well  the  existence  of  sensor  errors  that  are  effec¬ 
tive  at  system  level  but  non-modelled.  It  contains  the  elements  of  the  system  noise  matrix 
calculated  by  the  adaptive  filter.  An  increase  in  the  elements  to  values  up  to  1 . 3  m/s  can 
be  seen  in  the  section  where  manoeuvres  were  performed.  This  is  a  result  of  large  devia¬ 
tions  between  the  reference  and  the  modelled  error  behaviour  of  a  SDS  in  the  case  of 
manoeuvres.  The  take-off  and  the  first  turn  are  not  visible  in  Fig.  16  because  the  co- 
variance  matrix  P  is  still  decreasing  from  its  initial  value  and  therefore  the  increase 
in  the  filter  residuals  during  these  manoeuvres  does  not  affect  the  calculation  of  the 
adaptive  system  noiBe  matrix. 

5.1.2  THE  RESULTS  FOR  FLIGHT  PATH  3 

Fig.  17  shows  how  the  frequent  and  extreme  manoeuvres  of  flight  path  3  cause  a 
large  number  of  switching  procedures  between  tho  sensor  error  models  2/9. 

Fig.  18  contains  the  time  histories  of  the  estimation  error  and  the  la-band  for 
the  anisoertia  term  of  the  gyros.  Tho  assumed  accuracy  at  the  beginning  of  ♦  2*10'J  s 
can  be  improved  to  approx.  +  2*lo-t  s.  The  procedure  of  switching  can  clearly  be  recog¬ 
nized  in  the  shape  of  the  lo-band  since  of  course  it  is  only  possible  to  obtain  an  esti¬ 
mate,  and  thus  an  improvement  of  the  covariance  aatrx  P,  if  tho  error  modol  L«2  is  used 
for  filtering. 

Fig.  19  shows  for  flight  path  3  that  it  is  only  at  the  end  of  the  tiraa  histories 
of  the  system  noise  matrix  Q  that  the  filter  residuals  exceed  the  limits  given  by  tho 
covariance  matrices  R  and  g. 

5.2  THE  RESULTS  OBTAINED  FROM  AN  ADAPTIVE  KALMAN  FILTER  WITHOUT  THE  MODELLING  OF 

SENSOR  ERRORS 

The  aim  of  switching  between  various  error  models  was  to  give  a  sufficiently  ac¬ 
curate  mathematical  description  of  the  error  behaviour  of  a  SDS,  although  low-order  sen¬ 
sor  error  models  were  used.  The  success  of  these  techniques  depends  on  the  following  con¬ 
ditions,  hoveverj 

-  It  must  be  possible  to  divide  up  a  flight  path  Into  sections  of  different  structure  on 
tho  basis  of  suitable  criteria  and 

-  only  a  few  sensor  errors  are  effective  at  system  level  in  each  of  these  sections. 

The  validity  of  these  conditions  was  examined  in  Section  3  on  the  basis  of  covari¬ 
ance  and  error  analysis  for  flight  paths  1,  2  and  3.  In  the  case  of  arbitrary  flight  paths 
which  are  subject  only  to  the  limits  imposed  by  the  aircraft's  specifications  it  may  hap¬ 
pen  that  the  conditions  referred  to  above  can  no  longer  be  adhered  to  with  sufficient  ac¬ 
curacy.  A  possible  technique  in  this  case  is  to  take  account  of  all  the  sensor  errors  In 
the  system  noise  matrix  Q. 


5.2.1  THE  RESULTS  FOR  FLIGHT  PATH  1 


Fig.  20  contains  the  time  histories  of  the  estimation  error  and  the  corresponding 
la-band  for  the  angular  misalignment  e0  with  respect  to  flight  path  1.  The  results  are 
based  on  the  switchable  sensor  error  model  1/9,  which  has  already  been  discussed.  The 
estimation  error  and  the  lo-values  show  a  sufficient  correspondence. 

Fig.  21  contains  the  estimation  errors  and  lo-band  for  the  angular  misalignment  eD 
once  more.  However,  all  the  sensor  error  coefficients  are  included  in  the  system  noise 
matrix  g.  It  can  be  seen  that  the  filtering  leads  to  far  too  optimistic  values  for  the 
lo-band,  and  Eq. (18)  is  violated. 

5.2.2  THE  RESULTS  FOR  FLIGHT  PATH  3 

The  problems  that  have  just  been  discussed  are  shown  for  flight  path  3  in  Fig.  22. 
Whereas  the  various  estimates  for  the  angular  misalignment  eD  might  still  be  acceptable 
for  flight  path  1,  the  situation  with  flight  path  3,  with  its  frequent  and  considerable 
manoeuvres,  is  different:  here  extreme  deviations  from  the  reference  error  behaviour 
arise  if  the  sensor  error  coefficients  are  only  taken  into  account  in  the  system  noise. 

In  Fig.  23  the  corresponding  results  are  shown  for  sensor  error  model  3/9  which 
can  be  switched  between  the  dominant  error  sources.  The  comparison  between  the  results 
shown  in  Fig.  22  and  Fig.  23  demonstrates  clearly  the  advantage  of  the  sensor  error  mo¬ 
del  3/9  for  flight  path  3. 

5.3  THE  RESULTS  IN  THE  CASE  OF  INTERRUPTED  EXTERNAL  MEASUREMENTS 

The  results  discussed  so  far  are  based  on  external  measurements  that  were  available 
all  the  time.  However,  if  a  radar  unit  or  a  MLS  station  fails  or  the  aircraft  is  too 
far  away  from  such  a  station,  for  exemple,  it  becomes  especially  important  to  have  a 
modelling  of  the  real  SDS  error  behaviour  which  is  as  accurate  as  possible,  since  it  is 
necessary  to  predict  the  errors  according  to  the  error  equations  all  the  while  the  ex¬ 
ternal  measurements  are  interrupted. 

5.3.1  THE  RESULTS  FOR  FLIGHT  PATH  1 

Figs.  24  and  25  show  the  time  histories  of  the  estimation  errors  and  the  corres¬ 
ponding  lo-band  for  the  angular  misalignment  eq  in  the  event  of  external  measurements 
being  interrupted  after  40  minj  two  error  models  are  compared.  Fig.  24  is  based  on  the 
switchable  error  model  1/9,  which  has  already  been  discussed.  In  Fig.  25  the  correspond¬ 
ing  results  are  given  for  an  error  model  which  contains  no  sensor  error  coefficients. 

Here,  the  faulty  self-diagnosiu  of  the  filter  can  be  seen.  If  sensor  error  model  1/9 
is  used,  a  very  small  estimation  error  is  obtained. 

The  errors  at  the  angular  level  lead  to  position  errors  according  to  the  error 
model  used.  Figs.  26  and  27  represent  the  results  for  these  components.  The  following 
results  can  be  achieved: 

4^  ■  -300  m  +  600  m:  error  model  1/9 

SXjj  “  -2000  m  +  500  mt  error  model  1/6 

Thus,  the  use  of  error  models  without  senscr  errors  leads  to  far  lower  levels  of 
accuracy  even  in  the  case  of  flight  path  1  which  contains  fewer  manoeuvres. 

5.3.2  THE  RESULTS  FOR  FLIGHT  PATH  3 

Fig,  26  shows  the  estimation  error  and  the  la-band  for  the  angular  misalignment  c0 
for  flight  path  3.  After  20  min  the  external  measurements  are  lost.  Based  on  the  error 
model  3/9  an  accuracy  of  about  0,1  0  is  reached  at  the  end  of  the  flight  path  3,  the 
estimation  error  is  less  than  0.01  °. 

Fig.  29  Is  based  on  a  sensor  error  model  which  interprets  all  the  sensor  errors 
as  system  noise.  Although  the  self-diagnosis  of  the  filter  gives  a  value  of  approx.  0,02°, 
the  actual  estimation  error  is  approx.  0.2  i.e.  a  serious  violation  of  Eq.(18)  occurs. 

Figs.  30  and  31  compare  the  results  of  the  two  error  models  for  the  position 

error  components.  Whereas  the  switchable  error  model  3/9  procudes  la-values  of 
approx. 11300  »  after  the  loss  of  the  external  measurements  (Fig.  30),  the  error  model 
without  sensor  errors  gives  lo-values  of  approx.  +1500  m,  although  the  estimation  errors 
are  aa  much  aa  -5000  m  (Fig.  31). 


6.  SUMMARY  AND  CONCLUSIONS 

The  basic  problems  in  the  design  of  error  models  for  aided  SDS  is  the  need  to  provide, 
in  s  low-order  error  model,  a  sufficiently  realistic  mathematical  description  of  a  large 
number  of  sensor  error  coefficients  which,  depending  on  the  manoeuvres  flown,  can  be  ef¬ 
fective  at  system  error  level.  Three  different  flight  paths  were  used  to  discuss  the  ty¬ 
pical  ssnsor  and  system  error  behaviour  of  a  SDS  and  to  develop  corresponding  error  mo¬ 
dels.  If  the  flight  path  can  be  divided  up  into  sections  with  different  structures  where 


only  a  tew  sensor  emut,  me  __  _., _ 

sufficiently  good  description  of  the  real  system  error  behaviour  by  switching  between 
various  low-order  sensor  error  models. 

In  the  case  of  sensor  errors  that  cannot  be  modelled  although  they  are  effective  at 
system  error  level,  the  system  noise  matrix  Q  was  correspondingly  enlarged.  This  was  done 
on  the  one  hand  by  giving  rough  estimates  of  the  sensor  error  effects  not  included  in  the 
error  model,  and  on  the  other  hand  by  using  adaptive  Kalman  filtering,  which  makes  it 
possible  to  calculate  the  elements  of  the  system  noise  matrix  §  on  the  basis  of  a  sta¬ 
tistical  analysis  of  the  filter  residuals. 

Finally  the  results  of  the  calculations  obtained  from  simulated  flight  paths  were 
discussed  for  various  error  models  with  or  without  the  modelling  of  sensor  error  coeffi¬ 
cients,  and  also  when  external  measurements  were  interrupted. 

The  results  in  the  case  of  the  loss  of  external  measurements  are  of  particular  in¬ 
terest  since  here  increased  demands  are  made  on  an  error  model  for  a  SDS. 

If  the  consideration  of  real-time  conditions  means  it  is  only  possible  to  model  a 
few  sensor  errors,  then  it  is  necessary  to  find  a  suitable  method  for  including  in  the 
system  noise  matrix  those  sensor  and  algorithm  errors  which  are  not  modelled  but  are 
effective  at  system  error  level.  The  calculation  of  fixed  matrices  Q  proved  to  be  inade¬ 
quate  because,  with  the  exception  of  constant  bias,  all  the  sensor  irrors  have  to  be  re¬ 
garded  as  being  dependent  on  the  particular  dynamic  environment  of  the  SDS. 

If  low-order  error  models  are  used  for  flight  paths  with  extreme  manoeuvres,  in  which 
a  large  number  of  sensor  errors  can  be  effective  at  system  level,  it  is  necessary  to 
adapt  continuously  the  system  noise  matrix  to  the  predected  statistics  of  the  Kalman  fil¬ 
ter.  The  problem  of  adaptive  Kalman  filtering  lies  in  the  need  to  develop  efficient  al¬ 
gorithms  for  the  statistical  analysis  of  the  filter  residuals.  The  analysis  of  the  filter 
residuals  should  provide  all  the  elements  of  the  system  noise  matrix  Q. 
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USE  OF  FILTERING  AND  SMOOTHING  ALGORITHMS 
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The  increasing  complexity  of  modern  weapon  systems  demands  corresponding  in¬ 
creases  in  the  sophistication  of  the  approaches  used  to  test  these  systems.  The  follow¬ 
ing  two  chapters  discuss  advanced  techniques  for  the  processing  of  missile  system  test 
data.  In  the  first  chapter,  data  from  multiple  references  are  used  in  a  post-flight 
analyzer  that  is  based  in  large  part  on  a  smoothing  algorithm.  The  objectives  of  the 
processing  are  to  obtain  the  best  estimate  of  overall  system  accuracy  and  to  recover  the 
maximum  information  about  individual  guidance  system  error  contributors.  In  the  second 
chapter,  a  procedure  for  validating  the  models  used  in  filtering  and  smoothing  algorithms 
is  presented.  The  precedure  checks  model  validity  using  data  from  multiple  system  tests. 
It  employs  well-known  statistical  hypothesis  testing  methods  in  an  innovative  manner. 

INTRODUCTION 

Modern  weapon  systems  --  particularly  ballistic  missiles  --  have  grown  in  com¬ 
plexity  by  a  significant  amount  over  the  past  20  to  30  years.  Designers  and  developers 
now  concern  themselves  with  the  total  system  aspect  of  missile  development.  Obtaining 
a  broad  system- level  understanding  of  the  missile  and  its  environment  has  become  vital. 

Increasingly,  system-level  understanding  is  supported  by  modern  analytic  methods 
including  applications  of  filtering  and  smoothing  theory.  Advances  in  the  theory  have 
taken  place  in  concert  with  weapon  system  development.  Two  inter-related  elements  of  the 
modern  analytic  approach  are  system  modeling  and  system  testing-  Models,  which  are  mathe¬ 
matical  representations  of  the  physical  characteristics  of  a  system,  have  a  number  of  uses 

Models  for  the  errors  associated  with  various  system  components  and  subsystems 
are  formulated  and  then  combined  to  create  an  overall  weapon  system  error  model.  Such  a 
model  ean  be  used  to  generate  performance  projections  (e.g.  ,  weapon  system  accuracy)  even 
before  the  weapon  system  has  been  built  and  deployed.  Parameters  of  the  model  can  also 
be  varied  about  the  nominal  values  to  illustrate  where  the  overall  system  is  most  sensi¬ 
tive  to  variations  in  subsystem  performance.  In  this  way,  critical  elements  in  the  sys¬ 
tem  design  ean  be  identified  and  given  extra  attention  in  the  development  effort. 

Prior  to  testing,  the  weapon  system  error  model  can  be  exercised  to  determine 
how  well  a  proposed  test  procedure  supports  understanding  of  weapon  system  behavior. 
Knowing  the  ability  of  a  given  test  to  isolate  key  system  performance  characteristics 
is  a  valuable  aid  in  test  program  management.  Once  testing  has  been  performed,  data  are 
available  to  support  validation  of  the  various  models.  Quite  often,  conditions  existing 
in  the  test  environment  must  necessarily  be  different  than  those  which  would  exist  in  the 
operational  environment.  Models  provide  the  mathematical  bridge  which  enables  accurate 
extrapolation  from  performance  under  test  conditions  to  performance  under  operational 
conditions. 

Finally,  models  for  system  behavior  are  the  vehicle  which  supports  development 
of  next-generation,  advanced  systems.  By  efficiently  characterizing  system  behavior, 
models  serve  as  the  building  blocks  for  future  system  design  activities.  In  many  ways, 
they  represent  the  "legacy"  of  a  system  development  program. 

The  keys  to  overall  weapon  system  modeling  are  proper  formulation  of  component 
and  subsystem  models,  sufficient  testing  to  obtain  data  which  are  representative  of  all 
system  characteristics  and  a  reliable  model  validation  procedure  to  ensure  that  the  math¬ 
ematical  representation  corresponds  faithfully  to  the  actual  system. 

The  two  chapters  which  follow  arc  closely  related.  They  are  both  involved  with 
the  application  of  model-based  modern  analytic  methods  to  ballistic  missile  development. 

The  first  is  concerned  with  the  Hinuteman  III  flight  test  program.  In  particu¬ 
lar,  the  chapter  “discusses  the  use  of  external  reference  information  to  enable  determina¬ 
tion  of  specific  error  characteristics  which  make  up  the  model  for  the  missile  guidance 
system.  It  is  shown  that  the  ability  to  identify  elements  of  the  model  is  strongly  in¬ 
fluenced  by  the  nature  of  the  available  reference  systems  and  the  chosen  test  plan.  An 
algorithm  based  on  a  Generalized  Likelihood  Ratio  (CLR)  test  is  seen  to  be  effective  in 
isolating  certain  nonlinear  error  phenomena,  provided  that  adequate  reference  data  are 
available. 

The  second  chapter  is  concerned  with  validation  of  the  models  for  multiple  phases 
in  the  operation  of  a  ballistic  missile  system.  A  procedure  based  on  statistical  hypothe¬ 
sis  testing  methods  is  presetted.  The  procedure  can  be  used  to  determine  whether  a  pro¬ 
posed  test  program  is  capable  of  isolating  bias  error  phenomena.  Alternatively,  when 
data  from  multiple  tests  have  been  collected,  the  procedure  provides  a  statistical  assess¬ 
ment  of  the  presence  of  bias  error  phenomena  in  the  system  being  tested. 


BALLISTIC  MISSILE  GUIDANCE  SYSTEM  EVALUATION 
USING  MULTIPLE  REFERENCES 


I .  INTRODUCTION 

A.  OVERVIEW 

Historically,  ballistic  missile  system  performance  evaluation  has  been  accom¬ 
plished  by  post-flight  processing  of  guidance  system  telemetry  and  radar  measurements. 

This  was  adequate  when  the  goal  of  system  testing  was  weapon  system  accuracy  estimation. 
However,  as  guidance  systems  became  more  accurate,  the  goal  of  system  testing  shifted  to 
the  characterization  of  guidance  system  errors  in  the  "operational"  environment,  and  it 
consequently  became  necessary  to  upgrade  range  instrumentation  quality. 

As  a  consequence,  instrumentation  and  post-flight  data  processor  development 
activities  aimed  at  upgrading  guidance  system  test  methodologies  were  instituted  [1-3]. 

This  chapter  focuses  on  the  test  programs  initiated  by  the  U.S.  Air  Force  in  1970  cul¬ 
minating  in  launch  of  the  last  Minuteman  III  Production  Verification  Missile  (PVM)  in 
March  1980.  Over  the  10  years,  a  series  of  programs  directed  at  improved  guidance  sys¬ 
tem  characterisation  and  advances  in  the  quality  and  availability  of  external  reference 
systems  were  initiated. 

Section  II  is  an  overview  of  the  USAF  test  programs.  In  Sections  III  and  IV, 
a  discussion  of  the  methodology  and  performance  evaluation  studies  undertaken  to  "opti¬ 
mize"  the  recovery  of  guidance  system  error  characteristics  is  presented.  Section  V  dis¬ 
cusses  a  data  processor  --  the  Post  Flight  Analyzer  (PFA)  --  developed  to  evaluate  data 
from  a  series  of  test  programs.  The  processing  of  the  flight  test  data,  and  the  associ¬ 
ated  results,  are  described  in  Section  VI.  Finally,  Section  VII  presents  a  summary  and 
suggests  enhancements  that  may  be  desirable  for  future  ballistic  missile  testing. 
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The  data  processing  methodology  described  in  this  chapter,  as  well  as  the  per¬ 
formance  evaluation  and  data  analysis  software,  is  the  result  of  a  dedicated  effort  by  a 
number  of  TASC  employees.  The  original  GLR  test  methodology  was  formulated  by  H.L.  Jones 
and  refined  by  K.S,  Tait.  The  performance  evaluation  software  was  developed  by  J.A. 
D'Appolito,  C.M.  Ermer,  L.M.  Hawthorne,  and  D.J.  Meyer.  C.J.  Vahlberg  and  D.J.  Meyer 
developed  the  data  processing  software. 

II.  BACKGROUND 

In  1971,  with  the  Minuteman  III  test  program  approximately  two- thirds  complete, 
a  significant  decision  was  made  by  the  USAF;  it  resulted  in  the  termination  of  system 
testing  at  the  Eastern  Test  Range  (ETR)  for  cost  reduction.  Prior  to  this,  approximately 
one-half  of  the  test  missiles  had  been  launched  at  the  ETR.  Subsequently,  flight  tests 
were  conducted  only  at  the  Western  Test  Range  (WTR) .  The  high  accuracy  tracking  capability 
and  the  good  geometry  characteriatics  available  at  the  ETR  were  sacrificed. 

The  guidance  analysis  community  began  to  recognize  the  limitations  of  the  tost 
methodology  in  operation.  The  level  of  accuracy  desired  from  the  test  program  could  not 
be  achieved  with  the  external  reference  systems  available  at  the  WTR.  Also,  the  quantiza¬ 
tion  levels  associated  with  guidance  telemetry  data  were  not  consistent  with  the  levels 
required  for  guidance  system  error  characterization.  The  USAF  begtn  to  look  at  alterna¬ 
tive  reference  systems  that  could  overcome  test  limitations. 

At  approximately  the  same  time  the  Charles  Stark  Draper  Laboratory  (CSDL)  was 
actively  developing  a  floated  inertial  reference  platform,  known  as  AIRS,  the  Advanced 
Inertial  Reference  Sphere  [91-  The  instruments  developed  for  AIRS  were  designed  to  pro¬ 
vide  system  quality  an  order  of  magnitude  better  than  the  primary  Minuteman  III  guidance 
system  (the  NS-20). 

The  AIRS  development  schedule  and  the  need  for  an  improved  reference  system  were 
nearly  coincident.  As  a  result,  the  USAF  instituted  the  Missile  Performance  Measurement 
System  (MPMS)  Program.  An  AIRS  platform  was  incorporated  in  a  separate  wafer*  along  with 
its  associated  electronics,  power  supplies,  telemetry  unit,  cooling  system,  etc.  The 
primary  guidance  system  computer  was  modified  to  allow  for  the  time-tagging  of  guidance 
system  outputs  (integrated  specific  force).  Time-tagging  was  a  means  of  eliminating  the 
impact  of  large  quantization  levels  on  the  recovery  of  guidance  system  errors;  it  reduced 
quantization- induced  errors  by  approximately  an  order  of  magnitude. 

Although  originally  planned  as  a  multi-missile  test  program,  only  one  MPMS  mis¬ 
sile  was  flown.  The  test  was  conducted  on  special  Test  Missile  No.  11  (STM-11)  on  15  July 
1976.  AIRS  functioned  well  and  much  was  learned  about  the  performance  of  the  AIRS  hard¬ 
ware  as  well  as  NS-20  instrument  error  characteristics.  However,  cost  considerations 
dictated  that  that  program  be  limited.  MPMS  led  to  the  FLY-2  Program  in  which  the  AIRS 
platform  was  replaced  by  a  second  NS-20. 


*Minutemar  III  is  capable  of  carrying  one  or  more  insertable  missile  body  sections 
(wafers)  between  the  fourth  stage  motor  and  the  payload  section. 


In  principal,  the  FLY-2  concept  would  appear  less  than  optimal.  In  most  test 
programs  it  is  desirable  that  the  measuring  device  be  an  order  of  magnitude  more  accurate 
that  the  system  being  tested.  However,  by  taking  advantage  of  the  NS-20  instrument  ori¬ 
entations  (Figs.  1  and  2)  and  the  fact  that  the  platforms  can  be  aligned,  in  azimuth,  to 
any  desired  orientation,  within  gimbal  constraints  (Fig.  3),  the  reliance  on  “identical" 
systems  can  be  minimized.  Thus,  for  FLY-2,  it  became  a  question  of  how  to  orient  the 
platforms  to  achieve  "optimal"  recovery  of  a  set  of  "primary  error  sources."  Optimiza¬ 
tion  study  results  are  discussed  in  Sections  111  and  IV. 

A  series  of  three  FLY-2  missiles  were  flown  between  November  1976  and  June  1977. 
Data  from  the  three  flights  provided  significant  insight  into  a  number  of  error  contrib¬ 
utors  not  previously  included  as  part  of  the  system  error  model.  However,  FLY-2  was 
limited  in  its  performance  assessment  capabilities  due  to  the  lack  of  an  independent 
reference  with  an  accuracy  superior  to  the  VTR  radars. 

This  shortcoming  was  resolved  with  the  introduction  of  a  GPS*  receiver  within 
the  FLY-2  wafer.  The  original  intent  of  the  GPS  Receiver  Test  Program  (GPS/RTP  or  FLY-2/ 
GPS)  was  to  demonstrate  that  the  receiver  could  provide  an  accurate  post-boost  update  in 
the  ballistic  missile  environment.  Since  it  was  projected  that  the  receiver  would  main¬ 
tain  iockt  on  the  signals  transmitted  by  the  satellites,  it  was  determined  that  the  range 
and  range-rate  measurements  should  be  used  for  post-mission  evaluation. 

Two  test  flights  were  performed  as  part  of  the  rtinuteaan  III  test  program.  Test 
missiles  PVM-18  and  PVM-19  were  launched  on  31  January  1980  and  27  March  1980.  The  pres¬ 
ence  of  an  accurate  external  reference,  in  addition  to  the  FLY-2  configuration,  provided 
the  best  data  collected  during  the  Kinuteman  III  test  program.  In  summary,  the  USAF  had 
pursued  a  course  that  led  to  ever- increasing  test  capability  without  having  to  make  major 
changes  to  an  overall  test  philosophy. 

III.  EVALUATION  METHODOLOGY 


The  methodology  used  to  "optimize"  FLY-2  (and  subsequently  PLY-2/GPS)  perform¬ 
ance  is  very  strongly  tied  to  the  objectives  set  forth  for  the  test  programs.  Thus,  it 
is  important  that  these  objectives  be  presented  and  the  test  program  goals  be  put  into 
perspective.  The  goals  of  FLY-2  were  to: 

•  Validate  the  effect  cm  system  accuracy  of  preflight  software  (Ground 
Program)  modifications  introduced  as  a  consequence  of  the  Guidance 
Improvement  Program* 

•  Increase  the  understanding  of  a  number  of  “priority"  error  sources 
included  in  the  guidance  system  error  model 


•  Detect  and  identify  “ unmode  led"  error  sources 

•  Identify  sources  of  anomalous  performance  using  the  unique  data 
characterise iesaval  1  s&T c  from!  he TCV - 2  instrumental  ion  system. 


The  same  goals  were  established  for  FLY- 2/CPS  with  the  additional  goal  of  demonstrating 
GPS  receiver  performance  in  the  “operational**  environment. 

The  priority  error  sources  arc:  1)  initial  aximuth  misalignment,  2)  accelerom¬ 
eter  cross-axis  compliance,  3)  gyro  g^  and  g^*  coefficients,  and  9)  platform  compliance. 
The  error  mechanisms  for  these  quantities  are  included  in  the  Minuteman  HI  guidance 
error  model.  However,  the  coefficients  typically  could  not  be  separated  during  static 
testing;*  sled  testing  does  not  provide  the  appropriate  dynamics  for  coefficient  observa¬ 
bility.  Thus,  these  quantities  may  be  “observed”  only  during  powered  flight,  The  goal 
of  FLY-2  and  FLY-2/GPS  waa  to  provide  data  to  assist  in  characterising  these  priority 
error  Sources. 


The  unmodelcd  errors  of  interest  can  actually  be  called  mlsmodclcd  errors. 

There  has  been  specula tTbn ~tK a t  certain  of  the  "bias"  error  coefficients  "have  time-varying 
characteristics  -  specifically  shifts  and/or  ramps.  This  type  of  error,  if  present  in 
the  guidance  system,  could  have  a  significant  impact  on  weapon  system  accuracy.  In  addi¬ 
tion  to  the  bias  shift/ramp  type  of  error,  additional  unmodeled  errors  include  gyro  g  - 


and  g^-sensitlve  error  coefficients.  These  "unmodelcd"  errors  could  be  addressed  using 
FLY-2  and/or  FLY- 2/CPS  data  --  once  suitable  models  had  been  developed  for  them. 


*SlbbaTToeitioning  System  -  A  satellite  navigation  system  being  developed  by  a  tri- 
service  Joint  Program  Office  (5). 

tExcept  during  the  staging  events  when  high  acceleration  rates  are  present  due  to  motor 
shutdown  and  startup. 

tOne  in  a  aeries  of  accuracy  upgrades  made  to  the  Nlnuteman  Ill  guidance  system  between 
1970  and  1976. 

(Static  testing  Implies  all  testing  in  a  1-g  field,  including  tumble  and  vibration 
testing. 


Finally,  sources  of  anomalous  performance  are  those  error  characteristics  that 
were  not  anticipated  but  had  been  discovered  as  a  result  of  data  analysis.  A  variety  of 
these  surfaced  during  the  test  programs.  However,  no  evaluation  studies  had  been  under¬ 
taken,  ahead  of  time,  to  assess  the  ability  to  detect  and  isolate  sources  of  anomalous 
performance . 

Based  on  the  objectives  discussed  above,  there  are  two  criteria  by  which  the 
test  program  may  be  assessed  (optimized):  1)  recovery  of  the  priority  error  sources  in 
a  post-mission  data  evaluation  environment,  and  2)  detection  and  identification  of  un¬ 
modeled  (or  improperly  modeled)  error  characteristics. 

The  USAF  test  program  objectives  called  for  post-mission  processing  of  uesi  data 
to  extract  information  about  the  priority  error  sources  and  the  "unmodeled”  errors.  It 
was  determined  that  the  processor  would  be  based  on  a  Kalman  filter  [6,7).  The  filter 
estimates  the  priority  error  sources  and  the  "unmodeled"  errors  incorporated  in  the  fil¬ 
ter,  to  some  level  of  confidence.  To  address  the  question  of  the  presence  of  instrument 
error  coefficient  shifts  and/or  ramps,  a  new  methodology  was  developed.  The  technique 
referred  to  as  the  Generalized  Likelihood  Ratio  (GLR)  test  is  a  direct  extension  of  the 
Kalman  filter.  The  GLR  tests  use  filter  residuals  to  determine  whether  there  are  any 
unmodeled  errors  (bias  shifts  and/or  ramps)  that  would  cause  the  residuals  to  be  other 
than  a  zero-mean,  white-noise  sequence. 

Figure  A  depicts  the  flow  of  data  through  the  post-flight  evaluation  software. 
The  filter  processes  the  radar,  dual  NS-2Q  guidance  telemetry  data,  and,  if  available, 

GPS  measurements  to  estimate  the  errors  in  the  filter  model.  The  filter  minimizes,  in 
a  mean-squared  error  sense,  the  error  between  the  actual  measurements  and  those  predicted 
by  the  model.  The  NS-20  error  model  assumes  that  the  principal  instrument  errors  are 
biases  over  the  period  of  powered  flight.  It  is  well  known  (6)  that  the  sequence  of 
measurement  residuals  will  be  a  zero-mean  white-noise  sequence  if  the  filter  models  are 
correct. 

However,  if  certain  of  the  error  coefficients  display  sudden  shifts  or  tamping 
characteristics,  the  model  is  not  correct  and  the  measurement  residuals  will  not  be  white 
and  zero-mean.  The  GLR  algorithm  tests  the  mean  and  whiteness  of  the  filter  residuals 
[8[.  The  test  is  a  two-step  process.  The  first  step  determines  whether  a  shift  in  one 
(or  more)  of  the  coefficients  has  taken  place,  This  is  referred  to  as  the  detection 
process.  Detection  is  performed  by  forming  a  weighted  sum  ot  the  lasst  H  measurement 
residuals*  and  using  this  quantity  as  a  test  statistic  in  a  binary  hypothesis  test.  If 
the  test  statistic,  f,  is  greater  than  a  specified  threshold,  c.  a  shift  is  detected,  if 
it  is  smaller  than  e.  no  shift  is  assumed  to  be  present. 

The  second  step  is  i dcnt_i.fi cation.  The  GUI  formulation  results  in  an  algorithm 
that  generates  "an  estimate  of  tHe  state  that  shifted,  the  time  of  the  shift,  and  the 
shift  magnitude.  Under  the  assumption  of  no  ajjr tor i  knowledge  about  the  jump  character¬ 
istic  the  GLR  estimate  is  "optimal."  The  capability  to  estimate  the  jump  characteristics 
makes  the  GLR  test  more  attractive  than  other  re si dual -based  detection  processes  19). 

The  critical  parameters  of  the  GLR  test  design  are  the  Probability  o{  False 
Alarm  (Pj.)  and  the  Probability  of  Detection  (P^).  The  Probability  of  False  Alarm  is  de¬ 
fined  as  the  probability  that  a  shift  will  be  detected  when  no  shift  occurs,  It  is  shown 
in  (8)  that  the  higher  the  value  of  «  selected,  the  lower  the  Probability  of  False  Alarm. 
However,  that  is  not  the  only  trade-off. 

The  Probability  of  Detection,  defined  as  the  probability  that  a  jump  (if  pres¬ 
ent)  will  be  detected,  is  a  function  of  the  shift  detection  threshold,  *,  and  the  window 
length,  M,  as  well  as  the  magnitude  of  the  jump  itself,  v  |8).  As  seen  in  Fig.  $,  the 
longer  the  detection  window,  the  higher  the  Probability  of  Detection  for  a  given  Pj.. 

However,  the  length  of  the  window  is  limited  by  the  missile  flight  time,  computational 
capabilities  of  the  hardware  used  for  post-flight  processing,  and  the  fact  that  multiple 
jumps  may  occur  during  the  flight. 

The  methodology  used  for  evaluat ion  of  the  FLY-2  and  FLY - 2/GPS  flight  test  pro¬ 
grams  is  depicted  in  Fig.  6.  The  same  error  covariance  analysis  procedures  were  used 
for  both  the  FLY-2  and  FLY-2/GPS  studies.  Only  the  FLY-2/CPS  simulation  is  discussed 
here. 

There  are  three  steps  involved  in  the  generation  of  projected  FLV-2/GS  perform¬ 
ance  estimates.  The  first  step  is  simulation  of  the  GPS  segment.  It  is  necessary  for 
two  reasons: 

•  To  develop  a  tine  history  of  GPS  satellite  orbital  positions  and 
velocities  so  that  proper  accounting  is  made  for  GPS/aissile  geometry 

•  To  generate  the  GPS  satellite  position,  velocity,  and  clock  cali¬ 
bration  error  covariances. 

In  Fig.  6,  the  FII.TKR  module  represents  the  second  step,  recursive  solution  of  the  fil¬ 
ter  error  covariance  propagation  and  update  equations.  These  equations  arc  solved  once 


*Tbe  quantity ~H  is  referred  to  as  the  GLR  detection  window  length. 


for  a  specific  GPS  satellite  measurement  schedule.  The  outputs  of  the  FILTER  module  are 
time  histories  of  filter-indicated  performance  and  the  Kalman  gain  matrices. 

The  third  step  in  the  evaluation  process,  the  SYSTEM  module  in  Fig.  6,  involves 
recursive  solution  of  the  linear  system  error  covariance  equations.  These  equations  are 
solved  repeatedly  to  produce  an  error  budget,  using  the  same  Kalman  gain  file  each  time. 
When  all  error  contributors  have  been  evaluated,  the  overall  measurement  system  perform¬ 
ance  projection  can  be  calculated  from  the  detailed  error-source-by-error-source  break¬ 
down.  This  analysis  produces  the  following  benefits: 

•  Determination  of  key  error  contributors  -  indica'ing  whe>-'  to  focus 
attention  for  subsequent  performance  improvements 

•  Identification  of  insignificant  error  contributors  -  indicating 
where  a  less  costly  (i.e.,  poorer  quality)  subsystem  might  be  sub¬ 
stituted  with  minimal  performance  degradation. 

The  methodology  presented  in  this  section  was  used  for  "optimization"  of  FLY-2/GPS  per¬ 
formance.  A  more  detailed  discussion  of  the  terror  covariance  methodology  can  be  found 
elsewhere  (5,101. 

The  FLY-2/GPS  erfo r  covariance  simulations  determine  the  Kalman  filter  estima¬ 
tion  error  covarisnc-c  matrix  based  on  a  sequence  of  measurements.  There  are  three  sets 
of  error  measurements  associated  with  the  GPS-RTP.  The  first  is  the  difference  between 
the  two  1HU  velocity  raeosuressents ,  the  second  set  of  error  measurements  are  those  asso¬ 
ciated  with  tha  raitge  radars,  and  the  third  set  results  from  processing  of  the  GPS 
measurements. 


The  error  sources  for  FLV-2/GPS  are  those  associated  with  the  two  IMUs,  the 
radar  and  the  GPS  satellites.  Table  1  lists  the  errors  modeled  for  each  of  the  iHUs  and 
selected  for  use  its  the  Filter  Model  and  Truth  (i.e.,  system)  Model  formulations. 

Table  II  lists  the  error  sources  associated  with  the  VTR  measurements .  Error 
sources  associated  with  the  GPS  satellite,  propagation  delays,  ths  missile  receiver  are 
given  in  Table  HI.  The  uncertainties  in  satellite  position,  velocity,  etc.  are  pro¬ 
vided  by  a  program  which  simulates  the  GPS  satellite  ground  tracking  process,  and  deter¬ 
mines  the  estimation  error  covariance  for  the  GPS  satellites.  The  propagation  errors 
and  carrier  and  code-loop  errors  listed  in  Table  III  are  modeled  as  white  measurement 
noise  sequences  in  the  simulation  and  are  not  estimated. 

IV.  PERFORHAHCK  PROJECTIONS 

FLY-1  (or  single  1MU  vs.  radar)  performance  was  evaluated  using  several  trajec¬ 
tories  that  emulate  nominal  missions  flown  from  Vandenberg  Air  Force  Base  tVAFft)  to  the 
Kwajalein  Atoll.  FLY-1  performance  was  developed  as  a  baseline  against  which  FLY-2  and 
FLY-2/0PS  performance  can  be  compared-  The  nominal  ground  track  and  specific  force  pro¬ 
files  tor  these  trajectories  arc  shown  in  Figs.  7  and  8.  respectively.  For  these  analy¬ 
ses,  the  aciauth  offset  angle,  <ka.  (Fig.  3)  is  assumed  to  be  erro.  Range  measurements 
from  the  South  Vandcnberg  Air  Force  Base,  Point  Hltgu.  and  Pillar  Point  radars  were  as¬ 
sumed  available  every  half-second  beginning  15,  40,  and  50  seconds  into  the  mission, 
respectively. 

ft.V-1  results  are  presented  in  terms  of  normalized  (unities*)  quantities  called 
recovery  ratios.  Two  types  of  recovery  ratios  are  of  interest:  1)  guidance  error  re¬ 
covery  ratio,  and  2)  error  „coef fit lent  recovery  ratio.  The  former  it  defined  as 

.  RMS  Error  In  Estimate  of  Guidance  Quantity 
SKs  ckiTdance  I r roFTfi'AbMnc* “dPTr icVi «*  U 


These  are  obtained  for: 

•  frown -Range ,  Cross-Track,  Vertical  Position  and  Velocity  Errors  at 
Reentry  Vehicle  Deployment 

a  Down-Range  and  Cross-Track  Miss  Distances 

•  CEP  | 11 ) 

•  Initial  Azimuth  Misalignment. 

Recovery  ratios  for  these  quantities  arc  always  less  than  or  equal  to  1.00;  the  smaller 
the  value,  the  better  the  recovery  of  the  error  of  interest. 

The  error  coefficient  recovery  ratio  (R^-)  is  the  ratio  of  the  final  rms  uncer¬ 
tainty  in  the  estimate  of  the  error  coefficient,  o  j. .  to  the  initial  rms  or  a  priori  un¬ 
certainty,  Oq.  That  is: 


(2) 
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Down-range  and  cross-track  guidance  error  recovery  ratios  for  FLY-1  are  given 
in  Table  IV.  Vertical  position  and  velocity  ratios  (not  shown)  are  essentially  identi¬ 
cal  to  the  down-range  numbers.  The  processing  of  combined  radar  and  NS-20  data  yields 
cross-track  error  reductions  of  40  to  50  percent.  Crosstrack  miss  distance  recovery  is 
essentially  equal  to  cross-track  velocity  error  recovery  because  cross-track  velocity 
error  at  boost-burnout  is  the  major  source  of  cross- track  miss.  Unfortunately,  process¬ 
ing  of  the  combined  radar  and  single  NS-20  data  does  not  produce  any  significant  improve¬ 
ment  in  down-range  (or  vertical)  guidance  error  estimation  as  a  consequence  of  radar  ac¬ 
curacy  and  geometry  relative  to  the  missile  trajectory.  The  Vandenberg  and  Point  Mugu 
stations  essentially  provide  only  down-range  information.  However,  the  NS-20  IMU  is  more 
accurate  in  the  determination  of  down-range  position  than  the  WTR  radars.  Thus,  the 
NS-20  "calibrates  the  down-range  radars."  Point  Pillar  provides  good  cross-track  infor¬ 
mation  and  is  the  source  of  recovery  for  these  errors.  The  reduction  in  error  of  the 
predicted  impact  point  (i.e.,  CEP  recovery)  results  solely  from  the  reduction  in  cross¬ 
track  miss  prediction  error. 

Priority  error  source  recovery  ratios  for  FLY-1  are  summarized  in  Table  V.  This 
table  lists  the  smallest  recovery  ratio  attained  for  a  given  coefficient  over  all  simu¬ 
lated  flights.  Since  the  radar  data  basically  yields  cross-track  information  only,  a 
20%  reduction  in  initial  azimuth  error  Is  attained.  However,  processing  of  FLY-1  data 
produces  no  significant  recovery  of  any  of  ~the  remaining  priority  error  sources. 

The  guidance  error  and  error  coefficient  recovery  ratios  are  excellent  measures 
of  flight  test  performance;  however,  considered  individually  they  are  too  numerous  to 
use  in  a  meaningful  optimization  criterion.  Furthermore,  there  is  no  single  flight  test 
configuration  which  simultaneously  minimizes  all  recovery  ratios  of  interest.  Instead, 
two  simple  measures  of  performance,  one  for  guidance  error  recovery,  and  a  second  for 
error  coefficient  recovery  were  developed. 

In  the  course  of  the  optimization  studies,  approximately  150  FLY-2  flights  were 
simulated.  Table  VI  lists,  for  each  priority  e'ror  source  the  best  (i.e.,  smallest)  co¬ 
efficient  recovery  ratio  attained  over  all  flights,  It  must  be  emphasized  that  no  one 
flight  simulation  yielded  all  these  results . 

If  a  particular  error  source  coefficient  strongly  influenced  the  error  behavior 
of  an  1HU,  simply  averaging  the  outputs  of  two  systems  (under  the  assumption  that  the 
error  sources  In  both  jys terns  are  equal  in  rms  value  and  uncorrelated)  would  reduce  the 
effect  of  that  error  source  on  system  error  by  a  factor  of  1/J2  or  0.71.  It  could  be 
argued  that  coefficient  recovery  ratios  greater  than  0.71  are  not  significant,  fable  VII 
shows  that  FLY-2  produces  no  significant  recovery  of  accelerometer  or  platform  compliance 
coefficients.  The  same  is  true  for  gyro  bias  and  g-dependent  drift  rates.  In  fact,  of 

all  the  priority  «*rror  source  coefficients  only  gyro  g^-  and  g^'-deper  lent  drift  rates 
are  recovered  at  a  significant  level.  For  these  coefficients  it  is  >nvenient  to  define 
a  composite  coefficient  recovery  ratio; 

RCQHP  =  J  tain  hi  *  min  R«B  *  “ln  RJ  *  “in  h]  (3) 


wh;.ro 


min  R  3  smallest  recovery  ratio  attained  for  a  given  coefficient  in  a 
given  run 

2 

SE,  6B  signify  gyre  g-dependent  drift  coefficients 

L 

J,P  signify  gyro  g  -dependent  drift  coeif icicnts 


RCQHP  RC.  ai<s  usc<1  ae*8Ures  FtY-2  performance  for  optimization  purposes. 

A  major  contem  was  selection  of  primary  and  secondary  NS-20  IMU  azimuth  offsets 
and  trajectory  reentry  angle  to  optimize  (i.e.,  minimize)  the  guidance  error  CEP  and  com¬ 
posite  coefficient  recovery  ratios.  Three-axis  velocity  difference  data  and  radar  track¬ 
ing  data  were  processed  every  4.5  seconds  throughout  the  boost  phase  using  a  Kalman  fil¬ 
ter  algorithm  to  estimate  guidance  errors,  Instrument  and  platform  error  coefficients, 
and  initial  alignment  errors. 


With  regard  to  azimuth  offset  Angle  (do)  optimization,  one  might  assume  that 
simultaneous  offset  ot  both  IMUs  is  desirable.  However,  for  the  optimization  criteria 
selected,  this  is  not  the  case.  To  illustrate,  results  of  two  test  cases  from  the  series 
of  medium  reentry  angle  studies  are  summarized  in  Table  VII.  The  flights  had  a  fixed 
azimuthal  difference  of  either  30  er  45  deg  between  the  two  systems.  However,  the  orien¬ 
tation  of  the  primary  guidance  system  (System  1)  was  varied  about  the  deg  orienta¬ 
tion.  The  composite  coefficient  recovery  ratio  and  the  CEP  recovery  ratio  for  these 
flights  are  also  presented. 

For  a  fixed  azimuthal  difference,  superior  recovery  always  occurs  when  one  sys¬ 
tem  is  launched  with  ten  offset.  This  was  observed  to  be  the  case  at  all  reentry  angles 
In  this  orientation  the  gyro  and  accelerometer  errors  contribute  the  least  to  guidance 
errors.  When  both  IMU*  are  offaet  so  that  their  Individual  contributions  to  guidance 
errors  are  comparable,  the  optimal  post- flight  data  processor  cannot  distinguish  between 
the  two  systems.  Thus,  recovery  ratios  are  poor.  Conversely,  when  one  system  it  placed 
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on-axis,  its  contribution  to  guidance  errors  is  greatly  reduced  and  the  errors  of  the 
off-axis  system  become  more  observable.  The  on-axis  IMU  becomes  the  reference  through 
which  errors  in  the  off-axis  system  are  recovered. 

Azimuth  angle  offset  optimization  studies  were  performed,  with  one  platform  al¬ 
ways  at  zero  offset,  for  low,  medium,  and  high  reentry  angles  with  results  shown  in  figs. 
9,  10,  and  11.  All  three  sets  of  results  are  quite  similar  and  demonstrate  that  initial 
azimuth  error  recovery  shows  little  variation  with  offset  angle.  This  is  because  much 
of  the  initial  azimuth  error  recovery  comes  from  radar  tracking  data  and  not  IMU  velocity 
difference  data. 

The  guidance  CEP  recovery  ratio  shows  some  variation  with  azimuth  offset  angle. 
However,  the  variation  is  not  pronounced.  Guidance  CEP  recovery  is  a  minimum  when  Aa  = 
180  deg,  i.e.,  when  the  level  platform  axes  for  the  two  systems  are  antiparallel.  How¬ 
ever,  actual  implementation  of  this  configuration  is  not  possible  due  to  guidance  system 
gimbal  constraints. 

Referring  again  to  Figs.  9  through  11,  the  composite  coefficient  recovery  ratio 
shows  the  greatest  variation  with  azimuth  offset  angle  of  any  of  the  recovery  ratios  con¬ 
sidered.  Furthermore,  for  all  reentry  angles  considered,  this  ratio  reaches  a  minimum 
with  a  30  deg  offset  of  the  secondary  IMUT  This  minimum  is  fairly  broad,  however,  pro¬ 
viding  low  composite  coefficient  ratios  in  the  range  of  22.5  to  45  deg.  Also,  the  com¬ 
posite  coefficient  recovery  ratio  plot  is  symmetric  about  zero  degree  offset  so  that  both 
positive  and  negative  offsets  are  useful. 

As  a  consequence  of  these  optimization  studies,  the  following  conclusions  were 

drawn: 

•  The  primary  guidance  system  should  be  aligned  to  the  target  azimuth 
(Aa  -  0  deg) 

ft  The  secondary  guidance  platform  should  be  offset  22.5  to  45  deg 
from  the  primary  guidance  system 

•  The  reentry  angles  for  FLY-2  should  be  in  the  middle  of  the  systems 
capability  range. 

The  first  two  recommendations  were  followed  on  all  three  FLY-2  flights.  However,  a  range 
of  reentry  angles  was  selected  so  that  specific  error  coefficient  recovery  could  be  em¬ 
phasized  rather  chan  minimization  of  the  composite  performance  index. 

Having  addressed  the  "optimization"  issues  associated  with  FLY-2,  it  is  possible 
to  assess  CLR  test  performance.  The  GLR  test  was  specifically  designed  to  detect  and 
identify  shifts  or  ramps  in  certain  guidance  system  instrument  error  coefficients.  Atten¬ 
tion  is  directed  here  to  the  detection  and  identification  of  shitte  in  the  icceleroanner 
bias  and/or  gyro  bias  drift  coefficients.  The  results  are  bared  on  lit  same  model  used 
in  the  optimization  studies.  In  addition,  two  forms  c't  the  GIJR  ‘..cst  mechanization  are 
considered  *  Fixed-Lag  and  Fixed-Interval.* 

Fixed- Lag  GUI  is  based  on  a  data  window  (M)  of  fixed  length.  The  relationship 
between  wTnJow~Teng'tK~«nd  detectable  jump  magnitude  iv  shown  in  Figs.  12  and  13.  For 
shifts  in  both  accelerometer  bias  and  gyro  bias  drift,  there  is  an  asymptotic  relation¬ 
ship  between  detectable  jump  magnitude  and  window  length.  The  minimum  acceptable  window 
length  is  approximately  29  (100  sec  for  the  5  sec  sampling  interval).  However,  maximum 
detectability  for  all  possible  jump  times  would  require  window  lengths  on  the  order  of 
45  (225  sec).  Figure  12  Indicates  that  accelerometer  shifts  on  the  order  of  10t>  <o  is 
the  initial  rms  uncertainty)  are  detectable,  with  a  false  alarm  probability  (Pj.)  of  0.05 

and  u  detection  probability  (P|.)  of  0.50.  For  gyro  bias  drift,  shifts  on  the  order  of 
50 o  are  detectable  if  they  occur  prior  to  third-stage  thrust  termination. 

The  region?  of  superior  shift  detectability  are  shown  in  Figs.  14  and  15. t  Ac¬ 
celerometer  bios  shifts  are  most  detectable  if  they  occur  after  burnout.  The  poor  per¬ 
formance  prior  to  burnout  is  caused  by  the  large  specific  force  components  exciting  the 
higher-order  accelerometer  and  gyro  error  terms.  Thus,  bias  shifts  must  be  large  rela¬ 
tive  to  the  specific  force  effects  on  the  g-dependent  errors  if  they  are  to  be  observ¬ 

able.  After  tnrust  termination,  the  detection  of  smaller  shifts  is  possible  (for  the 
same  P^)  because  of  the  lower  specific  force  component  magnitudes. 

It  is  well  known  | 11-14 j  that  gyro  bias  drift  errors  enter  guidance  velocity 
error  through  the  term  ±xt,  where  $  is  the  platform  misalignment  vector  resulting  from 
gyro  drift  errors  and  f  is  the  specific  force  vector.  It  follows  that  tyro  bias  drift 

coefficient  shift  detection  should  be  the  best  during  the  period  of  powered  flight  when 

f  is  maximum.  The  data  plotted  in  Fig.  IS  substantiates  this  premise. 


♦The  terms  Fixed -Interval  and  Fixed-Lag  were  selected  because  of  the  close  association 
of  the  formulations  to  the  Fixed- Interval  and  Fixed-Lag  Smoothers  |6|. 

f Fixed-Leg  and  Fixed- Interval  results  are  shorn  in  Figs.  14  and  15  for  comparison;  only 
the  latter  are  discussed. 
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Fixed- Interval  GLR  uses  a  variable  length  window  which  runs  from  the  candidate 
jump  time  to  the  end  of  the  data  interval.  Figures  12  and  13  show  the  effect  of  the  in¬ 
creased  window  length  on  jump  detection.  The  projected  detection  performance  of  Fixed- 
Interval  GLR  is  presented  in  Figs.  14  and  15  for  comparison  with  the  Fixed-Lag  algorithm. 
These  figures  show  the  jump  magnitude  required  to  produce  a  detection  probability  of  0.50 
when  the  threshold  is  set  for  a  false  alarm  rate  of  0.05.  Naturally,  the  larger  windows 
resulc  in  improved  performance  for  all  cases;  however,  the  improvement  is  most  dramatic 
for  accelerometer  jumps  prior  to  burnout  (200  sec). 

Accelerometer  bias  jumps  of  about  2a  are  uniformly  detectable  throughout  the 
flight.  The  significant  difference  between  the  two  GLR  mechanizations  in  detecting  jumps 
before  burnout  is  explained  as  follows.  The  Fixed-Interval  algorithm  always  has  avail¬ 
able  the  filter  residuals  after  burnout  where  the  effect  of  an  earlier  accelerometer 
shift  is  highly  observable.  The  Fixed-Lag  version  lacks  this  information  and  is  unable 
to  identify  a  small  shift  in  the  presence  of  large  g-sensitive  error  coefficients. 

Significant  information  concerning  a  variety  of  the  priority  errors  may  be  ob¬ 
tained  using  FLY-2  flight  test  data.  However,  a  number  of  shortcomings  were  identified 
based  on  insights  gained  during  the  performance  evaluation  and  optimization  studies. 

These  shortcoming/-  w  re  borne  out  during  subsequent  data  processing  activities. 

Principal  amonp  the  shortcomings  is  the  inability  to  distinguish  between  cer¬ 
tain  error  sources  /hose  signatures,  in  measurement  space,  are  nearly  identical.  It  is 
impossible,  for  example,  to  separate  between  initial  primary  and  secondary  IMU  azimuth 
misalignments;  conseo!<e>.'-:ly ,  the  need  for  an  accurate,  independent  position/velocity  ref¬ 
erence  is  apparent  The  advent  of  the  GPS-Receiver  Test  Program  (GPS-RTP)  was  most  timely 
since  it  provided  the  potential  for  uniquely  accurate  reference  system  measurements.  Fig¬ 
ure  16  shows  the  GPS  sntellite  geometry  anticipated  for  the  missile  test  dates. 

The  incorporation  of  the  GPS  measurements  significantly  improves  the  capability 
to  estimate  guida..ce-system- induced  deployment  errors  as  well  as  initial  azimuth  misalign¬ 
ment.  Taole  VIII  summarizes  the  results  of  the  FLY-2/GPS  performance  evaluation  study. 
These  results  represent  those  associated  with  a  medium  reentry  angle  trajectory.  The 
secondary  system  was  offset  45  deg  from  the  primary.  The  table  also  contains  the  pro- 
jecte  !  performance  for  FLY-l/Radar  and  FLY-2/Radar.  It  is  apparent  that  overall  weapon 
system  test  program  performance  could  be  greatly  enhanced  via  the  use  of  GPS  data. 

Recovery  of  the  priority  error  sources  is  also  improved.  Table  IX  presents  the 
secondary  system  guidance  error  coefficient  recovery  capability,  for  a  particular  FLY-2/ 
GPS  mission.  These  are  the  error  sources  that  demonstrate  the  significant  recovery  capa¬ 
bility.  FLY-l/Radar  and  FLY-2  performance  projections  are  also  included. 

Certain  of  the  recovery  ratios  tend  to  remain  large  (poor  recovery),  irrespec¬ 
tive  of  the  measurement  type  or  quality.  This  is  a  consequence  of  the  processor’s  in¬ 
ability  to  separate  the  various  error  sources.  Consequently,  for  evaluation  of  future 
systems,  a  new  methodology  to  define  filter  models,  filter  dimensions,  etc.  that  recog¬ 
nizes  the  limitation  of  "optimar’  data  processors  should  be  developed.  In  addition,  pre- 
launch  and  flight  test  data  must  be  processed  in  a  complementary  manner  to  provide  maxi¬ 
mum  system  understanding  relative  to  each  type  of  data. 

The  value  of  GPS  measurements  in  the  detection  of  instrument  coefficient  shifts 
is  demonstrated  in  Figs.  17  and  18.  A  factor  of  two  to  three  improvement  in  detection 
capability  can  be  achieved  with  the  incorporation  of  the  CPS  information.  Other  than 
this  improvement,  however,  the  characteristics  of  the  detection  process  are  unchanged. 

The  GLR  methodology  must  be  modified  to  account  for  the  inability  of  the  GLH  test  to 
identify,  with  high  confidence,  a  number  of  instrument  error  characteristics. 

V.  DATA  PROCESSOR  STRUCTURE 

The  top-level  structure  of  the  Post  Flight  Analyzer  (PFA)  is  depicted  in  Fig. 

19.  The  PFA  is  structured  such  that  guidance  system  initial  condition  errors  and  instru¬ 
ment  error  coefficient  shifts  are  determined  using  a  five-step  process; 

•  Data  Preprocessing 

•  Filter  Analysis 

•  Model  Analysis 

•  Jump  Detection  and  Identification 

•  Decision  Making. 

In  data  preprocessing,  telemetry  data  from  two  NS-20*,  the  GPS  measurements, 
and  data  from  several  radars  are  sorted,  time  synchronized,  compensated  for  determinis¬ 
tic  errors,  rotated  to  appropriate  coordinate  frames  f<n  comparison,  and  combined  such 
that  all  relevant  high-rate  data  is  reduced  to  s  rate  suitable  for  the  advanced  analysis 
tools.  Since  this  step  involves  substa  Mai  computation,  the  PFA  allows  parallel  proc¬ 
essing  of  etch  data  type,  thereby  decreasing  the  preprocessor  timeline  substantially. 

The  filter  analysis  programs  take  the  sequence  of  range  and  range- rate  differ¬ 
ence  measurements  (<&&  and/or  radar)  and  velocity  difference  measurements  (between  the 
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two  NS-20s)  provided  by  the  preprocessor  and  calculates  estimates  of  NS-20  error  coeffi¬ 
cients  (primary  and  secondary  systems),  GPS  errors,  and  radar  errors.  The  estimation 
process  is  carried  out  with  a  suboptimal  Kalman  filter  augmented  with  smoothing  capabil¬ 
ities  for  key  epoch  times  of  the  missile  flight  (e.g.,  deployment).  The  residual  differ¬ 
ences  between  the  measurements  and  the  estimates  form  the  basis  for  the  model  and  shift 
analysis  tests  which  follow. 

The  model  analysis  programs  characterize  the  residual  differences  provided  by 
the  filter.  In  particular,  tests  are  performed  to  determine  if  the  residuals  are  a  zero- 
mean  white  noise  process  with  a  variance  predicted  by  the  filter  model.  If  these  tests 
are  passed,  the  mathematical  model  imbedded  in  the  filter  is  consistent  with  the  true 
system  dynamics.  If  the  "whiteness"  tests  are  failed,  further  analysis  into  the  nature 
of  the  failure  are  initiated. 

Jump  detection  and  identification  analysis  is  used  to  seek  one  or  more  instru¬ 
ment  parameter  shifts  consistent  with  the  filter  residual  characteristics.  The  GLR  test 
provides  the  primary  means  of  instrument  coefficient  jump  detection  and  identification. 

A.  DATA  PREPROCESSING 

A  number  of  important  steps  must  be  taken  to  prepare  raw  recorded  data  so  that 
it  may  be  efficiently  and  accurately  analyzed.  The  software  that  performs  these  steps 
is  depicted  in  Fig.  20. 

The  FIGA*  Prefilter  calculates  the  specific  velocity  sensed  by  the  primary  NS-20 
and  the  indicated  velocity  difference  between  NS-20s.  Seven  categories  of  deterministic 
errors  are  compensated: 

•  The  six  PICA  pulse  sums  are  adjusted  (using  the  telemetered  time- 
of-last-pulse)  to  reduce  the  effects  of  quantization  and  sample 
time  differences.  Compensations  are  also  made  to  account  for  tim¬ 
ing  differences  due  to  the  asynchronous  sampling  of  the  PIGAs  and 
due  to  guidance  computer  clock  drift  rate. 

•  The  six  PIGA  pulses  are  compensated  for  errors  due  to  "coning,"  the 
result  of  a  misalignment  between  the  PIG  float  and  PlAG  input  axes. 

The  misalignment  angle  and  phase  are  calibrated  using  prelaunch 
telemetry  data. 

•  The  PICA  pulses  are  subsequently  passed  through  a  digital  low-pass 
filter  to  reduce  the  residual  random  errors  resulting  from  quanti¬ 
zation  and  timing  uncertainties.  The  result  is  the  "best  estimate" 
of  all  six  PICA  pulse  sums. 

•  The  PIGA  pulses  are  compensated  with  pre- flight  estimates  of  bias 
and  nonlinearity  errors  and  then  transformed  Into  a  velocity  vector 
in  NS-20  computational  coordinates. 

•  The  velocity  vector  is  then  corrected  for  platform-to-computer  mis¬ 
alignments.  The  misalignment  is  based  on  the  prelaunch  values  of 

A  A 

gyro  g £  and  error  coefficients.  Initial  misalignment,  due  to 
gyro  torquer  limit  cycling,  is  also  taken  into  account. 

•  Platform  compliance  errors  arc  compensated  next  using  a  27-term 
platform  bending  model. 

•  Finally,  the  velocity  of  the  secondary  NS-20  relative  to  the  primary 
NS-20  (lever  ana  effect)  is  subtracted  from  the  sensed  velocity  of 
the  secondary  system.  This  compensation  is  based  upon  the  telem¬ 
etered  NS-20  gimbal  angles  which  have  been  interpreted,  smoothed, 
and  differentiated. 

The  results  at  this  point  in  the  NS-20  processing  are  two  measurements  of  the 
integrated  specific  forcet  (specific  velocity),  as  sensed  by  the  two  set*  of  instruments, 
compensated  for  all  known  deterministic  errors.  The  specific  velocity  vectors  are  pro¬ 
vided  to  the  trajectory  integrator. 

The  Trajectory  Integrator  calculates  the  position  and  velocity  of  the  missile 
based  on  the  Best  availaSle  gravity  model  and  the  sensed  specific  velocity.  To  allow  for 
refinements  in  the  trajectory  (as  bad  data  is  removed  and  errors  are  estimated)  without 
repeating  the  long  integration  process  required  by  this  program,  the  total  gravity  gradi¬ 
ent  matrix  is  also  calculated. 

Radar  data  is  preprocessed  using  two  programs.  The  Radar  Synchronizer  performs 
the  function  of  merging  data:  range  measurement  data  from  uptoiOt racking  radars  are 


*The  accelerometers  used  on  Hlnuteman  111  are  PIGAs  or  Pendulous  Integrating  Gyroscopic 
Accelerometers  (11|. 

tSpecific  force  is  the  sum  of  all  forces  acting  on  the  vehicle  except  for  gravity  |11|. 
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extracted  from  the  raw  data  tapes,  put  in  common  engineering  units  and  time  synchronized. 
The  result  is  a  single  sequential  file  containing  all  available  radar  data. 

The  Radar  Prefilter  determines  the  difference  between  measured  radar  range  and 
range-rate  and  computed  range  and  range-rate  based  on  the  NS-20-indicated  position  and 
velocity  time  history  generated  by  the  Trajectory  Integrator.  High-frequency  measurement 
noise  is  reduced  by  averaging  all  range  differences  over  a  4.5  sec  time  interval.  The 
geometry  of  each  measurement  is  also  determined  in  order  to  properly  weight  the  one-di¬ 
mensional  range  and  range-rate  difference  measurements  in  the  estimation  of  three-dimen¬ 
sional  position  and  velocity  vectors. 

Two  types  of  GPS  measurement  data  are  available:  preflight  data  and  inflight 
data.  The  primary  purpose  of  the  preflight  processing  is  to  calibrate  GPS-related  errors, 
most  notably  receiver  clock  errors.  These  estimates  are  then  applied  to  the  inflight 
data. 


The  data  tape  records  contain  a  mixture  of  parameter  values  from  receiver-gener¬ 
ated  "high-rate"  and  "low-rate"  data  tables;  two  separate  programs  extract  the  required 
components  from  each  table.  The  first  reads  selected  values  from  the  low-rate  table  and 
prepares  the  data  for  input  to  the  calculation  of  pseudo- range.*  These  data  items,  such 
as  master  time  delays,  user  and  satellite  epoch  count  differences,  etc.,  are  either  con¬ 
stant  or  slowly  time-varying  quantitites.  The  required  high-rate  data  table  items  are 
accessed  by  a  second  program.  The  items  include  the  replica  code  counter  states,  vernier 
range  corrections,  range-rates,  and  status  and  identification  tags  for  each  channel. 

The  program  uses  this  data,  along  with  that  supplied  from  the  low-rate  table,  to  form 
the  receiver-to-satellite  pseudo-range  measurement  and  also  scales  the  range-rates  and 
corrects  for  the  range-rate  computational  delay  prior  to  outputting  time- tagged,  satel¬ 
lite-indexed,  corrected  pseudo-range  and  range-rate  measurements. 

The  GPS  measurements  are  compensated  for  the  following  calibratable  errors: 

•  Satellite  clock  errors 

•  Receiver  clock  gravity-sensitive  trending 

«  Tropospheric  propagation  delays 

•  Relative  position  and  velocity  offset  between  GPS  antenna  phase 
center  and  the  primary  IMU 

•  Relativistic  effects,  both  special  and  general,  between  GPS  receiver 
and  ground-based  user. 

A  final  preprocessing  program  determines  the  differences  between  the  measured 
and  computed  range  and  range-rate  using  the  receiver-  and  satellite-indicated  position 
and  velocity.  The  satellite  state  vector  is  computed  at  each  time  of  signal  transmission 
using  the  best  estimate  of  the  "Block  II"  ephemeris  data  provided  by  the  GPS  Joint  Pro¬ 
gram  Office  (15).  The  receiver  position  and  velocity  are  determined  from  the  best  esti¬ 
mate  trajectory  interpolated  to  the  time  of  signal  reception.  The  resulting  difference 
measurements  are  then  compressed  to  suppress  measurement  noise  effects,  These  range  and 
range-rate  differences,  along  with  the  computed  measurement  geometry,  are  the  final  pre¬ 
processor  outputs. 

B.  FILTER  ANALYSIS 

The  filter  calculations  use  the  sequence  of  range  and  range-rate  differences 
(GPS  and/or  radar)  and  velocity  differences  (between  N5-20s)  to  calculate  the  best  esti¬ 
mate  of  NS-20  guidance  coefficients,  GPS  errors,  and  radar  errors  based  on  a  priori  error 
statistics.  The  reaidual  differences  between  the  measurements  and  estimates  are  the 
basic  inputs  to  the  GLR  testa.  Date  flow  is  shown  in  Fig.  21.  The  residual  calcula¬ 
tions  are  divided  into  two  separate  sets  of  programs  for  computational  efficiency.  The 
firsjt,  Gain  Calculation,  requires  the  straight-forward,  although  lengthy,  computation  of 
Salman  ga'lns’liascd'  on  the  nominal  trajectory,  the  system  error  model  an«  the  measurement 
sequence . 


TWo  forms  of  smoother  may  be  used.  The  Fixed-Point  Smoother  (16)  allows  the 
computationally  efficient  estimation  of  a  limited  number  of  smoothed  states  at  selected 
times.  This  is  particularly  attractive  if  only  certain  candidate  states  are  suspected 
to  be  time-varying.  The  Fixed- Interval  Smoother  (6)  allows  the  smoothed  estimation  of 
all  parameters  but  requires  more  computation*.  Both  have  a  place  in  he  search  for  un¬ 
modeled  parameter  changes. 

The  second  program  set.  Residual  Calculator,  uses  the  gains  to  interpret  the 
measurements  from  the  data  preprocessor .  burin g  preliminary  data  editing,  the  relatively 
simple  residual  calculations  can  be  performed  many  times,  using  the  same  set  of  gains, 
without  significant  loss  of  accuracy.  The  more  lengthy  gain  calculations  need  be  re¬ 
peated  only  after  "bad"  data  has  been  removed  or  whenever  the  filter  model  is  changed. 


*Ps’eu3b- range  measurements  contain  "true"  slant  range  plus  the  receiver  clock  phase 
offset. 
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C.  MODEL  ANALYSIS  AND  JUMP  DETECTION/IDENTIFICATION 

The  primary  jump  detection  is  performed  using  the  Generalized  Likelihood  Ratio 
(GLR)  test.  The  Kalman  filter  residuals  are  used  in  the  GLR  test.  The  GLR  test  compu¬ 
tations  are  divided  into  two  routines:  the  first  calculates  the  GLR  gain  matrices,  the 
second  performs  the  GLR  test  on  the  data.  (The  GLR  gain  matrices  need  only  to  be  recom- 
puted  when  the  filter  gain  matrices  are  recomputed.)  The  GLR  Test  routines  use  the  gain 
matrices  to  determine  whether  the  filter  residuals  are  consistent  with  the  model.  In 
the  event  of  a  jump  detection,  the  time  and  identity  of  the  parameter(s)  which  changed 
(jumped)  are  estimated.  These  jump  estimates  may  then  be  used  to  change  the  model  and 
the  residual  calculations  may  be  repeated. 

Residual  Tests  are  performed  on  the  filter  and  smoother  outputs.  If  the  models 
are  correct,  the  residuals  will  be  a  zero-mean,  uncorrelated  random  sequence  and  with 
variance  as  predicted  by  the  model.  Thus,  the  sample  mean  and  variance  calculations  pro¬ 
vide  some  clues  as  to  the  nature  of  the  modeling  errors.  The  residual  tests  provide  a 
"quick- look"  capability  to  identify  missions  with  possible  parameter  jumps. 

VI.  DATA  PROCESSING  EVALUATION  RESULTS 

Data  processing  included  evaluation  of  data  from  the  three  FLY-2  flights  and 
the  first  FLY-2/GPS  mission.  Here,  the  processing  results  from  one  FLY-2  flight  (STM- 
13V)  and  the  FLY-2/GPS  flight  (PVM-I8)  are  highlighted.  Table  X  summarizes  the  princi¬ 
pal  characteristics  of  each  flight. 

The  STH-13V  data  analysis  focused  on  jump  detection  and  identification.  A 
quick-look  technique  for  determining  the  possibility  of  PIGA  and/or  gyro  error  coeffi¬ 
cient  shifts  was  developed  based  on  the  velocity  difference  data  generated  by  the  dual 
IMUs.  Figure  22  is  a  plot  of  sensed  velocity  differences  over  the  first  1500  sec  of 
powered  flight.  Three  distinct  phases  are  evident.  During  the  first  or  powered  flight 
phase  (0  to  180  sec),  uncompensated  acceleration-dependent  errors  cause  parabolic  error 
growth.  Over  the  second  phase  (180  sec  to  500  sec)  no  error  growth  is  evident  since  the 
vehicle  ia  experiencing  a  nearly  zero  specific  force.  The  third  phase  (after  500  sec) 
provides  a  clear  indication  of  a  PIGA  bias  shift. 

The  PIGA  bias  shift  is  easily  detected  by  the  GLR  algorithm.  However,  in  order 
to  identify  a  gyro  error  (e. g. ,  bias  drift)  coefficient  shift  during  the  powered  flight 
phase,  a  closer  look  at  the  data  over  the  first  interval  is  required.  Rather  than  veloc¬ 
ity  differences,  it  ia  more  enlightening  to  examine  plat  form- to-computer  misalignment 
angles,  £.  These  angles  can  be  obtained  by  recalling  that,  over  a  very  short  time  inter¬ 
val  At, 

f tc  ♦  At 

I  <t  x  f)  dt  a  av  (4) 

The  Av  are  the  differences  in  measured  velocity.  If  is  assumed  constant  over  the  in¬ 
terval  At, 

flo  ♦  At 

t  XI  (f )dt  *  Av  a  -  { A)  (5) 
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where  (A)  is  the  skew-symmetric  matrix  of  integrated  specific  force  components.  As  a 
consequence 

4  a  -  |A*AJ  av  (6) 

Figure  23  presents  the  estimates  of  the  three  components  vf  *  in  computer  axes 
(down-range,  cross-track,  and  up).  Straight-line  approximation  curves  correspond 

to  tha  assumption  that  the  gyro  drift  le  caused  by  a  bias  only  *  t silly  with  a  Jump 
change  in  magnitude.  The  curves  in  Fig.  23  indicate  the  possibility  of  gyro  bias  error 
breaks  occurring  at  approximately  60  and  135  sec.  However,  tbe  shifts  are  not  large  and 
may  be  accounted  for  by  the  NS -20  error  model. 

It  was  anticipated  that  a  PICA  bias  shift  would  be  evident  in  the  CDt  output 
and  perhaps  one  or  more  gyro  bias  shifts  would  be  detected.  It  should  be  recalled  that 
the  GLR  test  answers  the  question:  whst  is  the  relative  likelihood  tuat  a  given  guidance 
coefficient  experienced  a  shift  at  a  given  time,  compared  to  the  null  hypothesis  (no 
shift)?  The  likelihood  ratio  (i)  is  the  quantity  used  to  qusntify  the  alternative  hy¬ 
potheses.  Large  I  implies  a  significant  jump  in  the  parameter  of  interest.  Values  of 
C  <  10  are  not  significant  as  Indicators  of  s  jump  at  the  Pf  =  0.05  level.  Results  were 

obtained  using  both  the  Fixed-Lag  GLR  algorithm  with  a  100  sec  data  window,  and  from  the 
Fixed-Interval  GLR  algorithm  incorporating  data  up  to  600  sec  into  the  flight.  For  brev¬ 
ity,  only  Fixed-Lag  results  are  presented  herein. 

The  Fixed-Lag  GLR  results  (Fig.  24)  display  s  weak  PICA  bias  anomaly  at  140  sec 
which  falls  well  below  the  l  =  Ifl  threshold.  Consequently,  this  weak  anomaly  was  not 
deemed  to  be  significant.  A  strong  anomaly  is  evident  in  the  post-boost  phase.  While 
this  anomaly  has  minimal  impact  on  system  accuracy  (because  of  its  time  of  occurrence), 

It  was  evaluated  in  detail.  The  conclusion  of  the  analysis  is  that  all  three  PIGAs  in 
the  primary  system  experience  shifts  in  their  bias  level  near  450  sec. 
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Fixed-Lag  gyro  bias  drift  likelihood  ratios  demonstrate  a  false  response  due  to 
the  large  PIGA  bias  shifts  (Fig.  25).  Only  two  of  the  three  gyro  bias  drift  states  are 
presented;  the  third  drift  state  is  nearly  unobservable  and  consequently  of  no  interest. 
Indepth  analysis  of  the  telemetry  and  radar  data  indicates  that  no  detectable  gyro  bias 
drift  anomalies  occurred  during  the  boost  phase. 

Results  of  the  STM-13W  data  analysis  are  typical  of  those  for  each  FLY-2  flight. 
The  present  of  a  large  PIGA  bias  shift  near  the  450  sec  time  print  was  evident  on  all 
the  flights.  In  addition,  there  was  no  indication  of  a  detectable  PIGA  or  gyro  anomaly 
during  boost. 

It  was  concluded  that  anomalous  guidance  system  performance,  in  the  form  of 
shifts  in  instrument  parameters,  is  not  significant  statistically  or  in  terms  of  weapon 
system  accuracy.  Thus,  for  the  FLY-2/GPS  data  processing  activity,  the  focus  shifted  to 
guidance  coefficient  estimation  and  GPS  receiver  performance  assessment. 

For  the  PVM-18  mission,  four  GPS  satellites  provided  range  and  range-rate  meas¬ 
urements.  The  subsatellite  points  for  each  satellite  are  indicated  in  Fig.  26.  The  sig¬ 
nals  from  the  satellites  were  acquired  approximately  five  sec  after  launch  and,  except 
during  staging  events,  provided  accurate  information  out  to  approximately  1000  sec. 
Satellites  No.  1  and  No.  2  yield  very  good  cross-range  information,  while  Satellites  No. 

3  and  No.  4  contribute  primarily  to  down-range  information.  Overall,  GDOP  (Geometric  Di¬ 
lution  of  Precision  (5))  was  approximately  4-5  over  the  flight,  with  the  vertical  channel 
having  the  poorest  GDOP.  The  cross-track  axis  had  a  single-channel  GDOP  less  than  1.0. 
Values  of  azimuth,  as  measured  from  north,  and  elevation  above  the  local-horizontal  plane 
of  the  satellite  relative  to  the  receiver  are  summarized  in  Table  XI, 

The  primary-minus-secondary  IMU  velocity  differences  for  PVM-18  are  presented 
in  Fig.  27.  The  three  phases  of  error  growth  (i.e.,  powered  flight,  free  flight,  and 
post  PIGA  bias  shift)  are  evident. 

The  velocity  differences  are  quite  small  through  the  first  125  sec  of  flight 
(the  first  and  second  stages  of  missile  thrusting).  A  rapid  divergence  occurs  during 
third-stage  thrusting,  leveling  off  in  all  three  axes  at  thrust  termination.  The  nature 
of  the  third-stage  divergence,  occurring  most  noticeably  in  the  cross-track  direcTiorT  ” 
and  wi^h  an  ^vTousthrusb^epcn^ence~rT8  indicative  otTargo  resTSual  gyro  errors  Tor 
iToairDaYy  P  1  GA~  mitt all gnme n ts )in  o'ne  or  "both  systems.  Cross-track  sensitivity  to  gyro 
errorsis  extremely  higW  Til  Minuteman  "Sue  to"  tKe  exclusively  in-plane  (x-z  plane)  thrust¬ 
ing  pattern  utilized.  The  sensitivity  to  errors  about  azimuth  is  further  accentuated  by 
the  particular  trajectory  flown. 

Range  and  range-rate  difference  measurements  along  the  Satellite  No.  1  line-of- 
sight  are  presented  in  Figs.  28  and  29,  respectively.  The  dotted  line  represents  the 
high  frequency  measurement  data  and  the  solid  line  represents  the  "smooth"  difference 
measurements  used  in  the  subscouent  data  analysis.  These  are  typical  of  the  range  and 
range-rate  measurements  provided  by  ail  four  satellites,  (The  large  spikes  in  the  range- 
rate  measurement#  occur  during  periods  when  the  tracking  algorithm  is  in  frequency  track 
only. ) 


The  continuous  GPS  measurement  availability  and  excellent  measurement  quality 
made  it  possible  to  improve  the  post-mission  analysis  results.  The  S^j  and  f‘Sj  gyro  er¬ 
rors  were  identified  as  the  primary  sources  of  the  observed  impact  error  for  PVM-18. 

This  conclusion  is  based  on  the  fact  that  the  post- flight  analysis  is  in  excellent  agree¬ 
ment  with  prcflight  predictions  of  the  miss-distance  contributions  of  the  and  Fgj 

error  sources.  However,  even  with  the  excellent  GPS  measurement  quality,  the  inherent 
eapabi  Uty  for  a^GVatctr^afjtXnc^ 

instrument  errors  is  still  in  question.  This  separability  problem  was  demonstrated 

baaed  on  rceulte  of  a  sensitivity  study  performed  as  part  of  the  PVM-18  analysis.  This 
study  examined  the  sensitivity  of  the  estimated  impact  error  contribution  of  critical 
gyro  error  sources  to  their  initial  rms  uncertainties.  Results  of  this  study  show  sig¬ 
nificant  variations  in  the  estimated  miss  distance  resulting  from  the  individual  gyro 
error  sources,  while  total  impact  error  displays  minimal  net  variation.  This  behavior 
ia  indicative  of  a  basic  inability  to  isolate  individual  gyro  error  source  contributors. 

Post-flight  evaluation  of  Minuteman  performance  based  on  GPS  test  measurements 
indicates  that  CPS  is  an  excellent  absolute  reference  for  both  position  and  velocity. 
Estimation  of  initial  platform  misalignment  ia  also  improved.  In  addition,  qualitative 
insight  into  PIGA  performance  nan  also  be  obtained  through  examination  of  tKU/GPS  veloc¬ 
ity  differences.  A  performance  issue  that  impacts  future  utilisation  of  CPS  must  still 
be  addressed,  however.  This  issue  is  the  proper  use  and  interpretation  of  data  that  ia 
extremely  accurate  at  the  position  and  velocity  levels  but,  due  to  the  complexity  of  the 
underlying  guidance  error  model,  does  not  provide  unique  insight  into  the  magnitudes  of 
specific  instrument-related  error  mechanisms.  In  other  words,  the  model  observability 
problem  must  be  addressed  as  it  becomes  necesssry  to  work  to  finer  levels  of  detail  in 
generating  system  understanding.  The  capabilities  of  filtering  and  smoothing  analysis 
offer  a  great  deal  in  evaluation  of  complex  weapon  systems  but  only  to  a  certain  thresh¬ 
old  which  must  be  identified. 
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VII.  SUMMARY 

Over  the  past  10  years  the  U.S.  Air  Force  has  upgraded  the  accuracy  of  the  Min- 
uteman  III.  To  assess  and  further  these  accuracy  upgrades,  a  number  of  flight  test  pro¬ 
gram  instrumentation  enhancements  were  incorporated.  These  include  post- flight  process¬ 
ing  of  multiple  IMU  test  data  (the  MPMS  and  FLY-2  Programs),  and  use  of  GPS  data  (the 
FLY-2/GPS  Program).  This  series  of  flight  test  programs  was  planned  and  executed  in  a 
logical  manner  to  minimize  cost/schedule  impacts. 

The  Minuteman  III  flight  test  program  enhancements  have  been  successful.  A  num¬ 
ber  of  significant  error  mechanisms  were  identified  and  isolated  using  data  obtained  dur¬ 
ing  the  MPMS,  FLY-2,  and  FLY-2/GPS  Programs.  With  the  improved  accuracy  objectives  asso¬ 
ciated  with  the  next  generation  ICBM  system,  problems  of  flight  test  optimization  and 
post-mission  processing  will  continue  to  provide  challenging  opportunities  over  the  years 
ahead.  There  will  almost  assuredly  be  further  advances  in  filtering  and  smoothing  theory 
to  support  the  needed  growth  in  system  understanding. 
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Figure  2  Gyro  Input  Axes 
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Figure  3  inertial  Measurement  Unit  Frames 
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Figure  4  Post -Flight  Evaluation  Software  Data  Flow 
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Figure  $  General  Relationship  Between  PQ  tsd  Fp  u  a  Function  of  H  and  a 
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Figure  8  Typical  Specific  Force  Tine  History 
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Figure  9  Recovery  Ratio*  a*  a  Function  of  Secondary 
tHU  Aainuth  Offset  ( Low  Reentry  Angle) 
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Figure  11  Recovery  Ratios  as  a  Function  of  Secondary  IMU 
Azimuth  Offset  Angle  (High  Reentry  Angle) 


Figure  12  Accelerometer  Bias  Jump  Detection  vs 
Window  Length  Using  Fixed-Lag  GLR 
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Figure  13  Gyro  Bias  Drift  Jump  Detection  vs 
Window  Length  Using  Fixed-Lag  GLR 
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Figure  16  Phase  I  GPS  Satellite  Configuration 
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Figure  27  Prlnary  Minus  Secondary  1MU  Velocity  Differences  (Computer  Coordinates) 
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Figure  28  GPS-Prinary  Satellite  1  Range  Difference  Measu resents 
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Figure  29  GPS-Primary  Satellite  1  Range-Rate  Difference  Measurements 
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TABLE  II 

RADAR  ERROR  MODEL  SUMMARY 


NUMBER  OF  STATES 

ERROR  SOURCE  NAME 

FULL 

MODEL 

TRUTH 

MODEL 

FILTER 

MODEL 

RANGE  MEASUREMENTS 

Bias  Error 

1 

1 

1' 

Scale  Factor  Error 

1 

1 

1 

Random  Error 

1 

1 

I 

Measurement  Noise 

1 

1 

1 

Survey  Errors 

0 

0 

0 

RANGE-RATE  MEASUREMENTS 

Bias  Error 

1 

1 

1 

Scale  Factor 

1 

1 

1 

Random  Error 

1 

1 

1 

Measurement  Noise 

1 

1 

1 

Survey  Errors 

0 

0 

0 

TABLE  III 

GPS  ERROR  MODEL  SUMMARY 


NUMBER  OF  STATES 

ERROR  SOURCE  NAME 

FULL 

MODEL 

TRUTH 

MODEL 

FILTER 

MODEL 

SATELLITE  ERRORS 

Position 

3 

3 

0 

Velocity 

3 

3 

0 

Solar  Radiation  Force 

1 

1 

0 

Gravitation  Constant 

1 

1 

0 

Satellite  Clock 

3 

3 

0 

PROPAGATION  ERRORS 

2 

0 

0 

RECEIVER  ERRORS 

Missile  Clock 

5 

3 

3 

Carrier  and  Code  Loop 

2 

0 

0 

TABLE  IV 

FLY-1  GUIDANCE  ERROR  RECOVERY  RATIOS  AT  BOOST  BURN-OUT 


REENTRY 

ANGLE 

DOWN-RANGE 

POSITION 

ERROR 

CROSS -TRACK 
POSITION 
ERROR 

DOWN -RANGE 
VELOCITY 
ERROR 

CROSS -TRACK 
VELOCITY 
ERROR 

PREDICTED 
CROSS -TRACK 
HISS 

PREDICTED 

CEP 

Low 

0.98 

0.56 

0.99 

0.63 

0.63 

0.82 

Medium 

0.99 

0.53 

0.99 

0.56 

0.56 

0.76 

High 

0.99 

0.55 

0.99 

0.62 

0.63 

0.79 
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TABLE  V 

BEST  FLY-1  PRIORITY  ERROR  SOURCE  COEFFICIENT  RECOVERY  RATIOS 


ACCELEROMETER  COEFFICIENTS 

GYRO  COEFFICIENTS 

INITIAL 

AZIMUTH 

ERROR 

1ST  ORDER 
NONLINEARITY 

<*Fil> 

CROSS-TRACK 

E2 

(Bc2) 

PLATFORM 

COMPLIANCE 

BIAS 

g-DRIFT 

g2 -DRIFT 

gA-DRIFT 

0.99 

0.99 

0.99 

0.99 

0.99 

0.96 

0.98 

0.80 

TABLE  VI 

BEST  FLY-2  PRIORITY  ERROR  SOURCE  COEFFICIENT  RECOVERY  RATIOS 


ACCELEROMETER  COEFFICIENTS 

GYRO  COEFFICIENTS 

1ST  ORDER 
NONLINEARITY 

<«Fil> 

CROSS-TRACK 

E2 

(Bc2) 

PLATFORM 

COMPLIANCE 

BIAS 

g-DRIFT 

g2-DRIFT 

gA-DRIFT 

«C 

«D 

6B 

6E 

P 

J 

0.93 

0.96 

0.93 

0.99 

0.76 

0.81 

0.48 

0.18 

0.07 

0.22 

TABLE  VII 

EFFECT  OF  AZIMUTH  OFFSET  VARIATIONS  ON  ERROR  RECOVERY  -  FLY-2/GPS 


DIFFERENTIAL 
AZIMUTH  OFFSET 
|Aa<  -A<>2 1 

(deg) 

INDIVIDUAL  AZIMUTH 
OFFSETS  (deg) 

COMPOSITE 

COEFFICIENT 

RECOVERY 

GUIDANCE 

CEP 

RECOVERY 

A«i 

&<>2 

30 

0 

30 

0.29 

0.62 

-15 

15 

0.47 

0.65 

45 

0 

45 

0.32 

0.63 

-15 

30 

0.44 

0.72 

-22.5 

22.5 

0.49 

0.76 

TABLE  VIII 

SUMMARY  OF  GUIDANCE  ERROR  RECOVERY  RATIOS  AT  BOOST  BURN-OUT 


TRACKING 

SYSTEM 

GUIDANCE  RECOVERY  RATIOS 

DOWN-RANGE 

POSITION 

ERROR 

CROSS -TRACK 
POSITION 
ERROR 

DOWN-RANGE 

VELOCITY 

ERROR 

CROSS -TRACK 
VELOCITY 
ERROR 

INITIAL 

AZIMUTH 

ERROR 

FLY-1/R«d«r 

0.98 

0.56 

0.99 

0.63 

0.80 

FLY-2/Reder 

0.71 

0.50 

0.71 

0.60 

0.60 

FLY- 2/CPS 

0.01 

0.02 

0.02 

0.03 

0.36 

TABLE  IX 

SECONDARY  GUIDANCE  SYSTEM  PRIORITY  GYRO  ERROR 
SOURCE  COEFFICIENT  RECOVERY  RATIOS 


INSTRUMENTATION 

CONFIGURATION 

RECOVERY  RATIOS 

68 

6E 

P 

J 

FLY-l/Reder 

0.98 

0.96 

0.98 

0.99 

PLY-2/TUdar 

0.S4 

0.33 

0.22 

0.56 

FLY-2/GPS 

0.51 

0.25 

0.13 

0.37 
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TABLE  X 

FLIGHT  TEST  PARAMETERS 


PARAMETER 

- b - 

STM-13W 

PVM-18 

Launch  Date 

1 

31  January  1977 

31  January  1980 

Launch  Time 

05 :  <^5 : 00  PST 

05:40:00  PST 

Reentry  Angle 

i  Low 

Medium 

Primary  Azimuth  Offset 

1 

0  deg 

0  deg 

Secondary  Azimuth  Offset 

45  deg 

0  deg 

TABLE  XI 


GPS  SATELLITE/RECEIVER  GEOMETRY 


*At  time  of  launch. 
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VALIDATION  OF  FILTER/SMOOTHER  MODELS 


I .  INTRODUCTION 

A.  MOTIVATION  FOR  MODEL  VALIDATION 

The  importance  of  the  model  used  in  design  of  a  Kalman  filter/smoother  is  well- 
recognized.  In  practical  applications,  tradeoffs  inevitably  exist  between  complex  models 
which  represent  the  real  world  with  great  fidelity  and  simplified  approximate  models 
which  lead  to  less  costly  implementations.  To  the  system  analyst,  the  question  of  model 
validity  is  of  great  importance  for  two  reasons.  First ,  if  Kalman  filtering  results  are 
to  be  interpreted  with  confidence,  it  is  essential  that  the  model  be  a  valid  representa¬ 
tion  of  the  physical  system.  Second ,  and  perhaps  even  more  important,  the  model  often 
represents  a  baseline  design  of  the  system  with  associated  baseline  system  performance 
characteristics.  If  that  model  is  not  a  valid  representation  of  the  actual  system  being 
tested,  there  is  an  implication  that  the  system  does  not  match  its  baseline  design,  and 
therefore  may  not  meet  its  baseline  performance  characteristics. 

In  this  chapter,  a  procedure  for  model  validation  is  described.  The  procedure 
is  based  on  statistical  hypjthesis  tests  focused  on  the  question:  "Are  the  estimates 
generated  by  a  Kalman  smoother  from  system  test  data  consistent  with  the  system  model?" 

The  procedure  is  a  multiple-test  approach;  filter/smoother  estimates  from  several  system 
tests  are  combined  in  a  common  data  base  on  which  the  statistical  hypothesis  tests  are 
performed. 

The  model  validation  procedure  discussed  is  based  on  well-known  statistical  hy¬ 
pothesis  testing  methods  (1,21.  The  contributions  presented  are  first ,  a  problem  formu¬ 
lation  and  second ,  analysis  and  data  reduction  procedures  which  lead  to  efficient  appli¬ 
cation  of  the  statistical  hypothesis  tests.  Another  approach  to  the  model  validation 
problem  which  is  potentially  applicable  to  the  same  class  of  systems  is  based  on  maximum- 
likelihood  parameter  identification  procedures  |3,M.  In  each  of  those  methods,  Expecta¬ 
tion-Maximization  (E-M)  algorithms  (5j  are  used  to  obtain  maximum  likelihood  estimates 
of  statistical  parameters.  Those  estimates  are  then  available  for  use  in  the  calculation 
of  test  statistics  for  hypothesis  testing  similar  to  that  discussed  here. 

Another  approach  to  model  validation  is  based  on  direct  examination  of  residual 
sequences  resulting  from  an  ensemble  of  test  results  ( 6-8 | .  Procedures  based  on  this  ap¬ 
proach  are  designed  only  to  detect  the  presence  of  data/model  inconsistency;  unlike  the 
procedures  described  in  this  chapter,  they  do  not  attempt  to  Isolate  the  source  of  the 
inconsistency  to  specific  system  parameters.  Kashyap  and  Bao~T5T~<n seusa  the  model  vali¬ 
dation  problem  from  this  viewpoint  with  an  emphasis  on  much  smaller  systems  than  consid¬ 
ered  here  and  under  the  restrictive  assumption  that  all  tests  are  based  on  identical  sce¬ 
narios.  Goodrich  and  Caines  (7)  consider  the  statistical  parameter  identification  prob¬ 
lem  based  on  likelihood  functions  formed  dtrectly  from  the  innovations  (i.e.,  filter  re¬ 
siduals)  of  each  test.  Bara*  |8j  approaches  the  model  validation  problem  by  generating 
test  statistics  from  normalized  residual  sequences  from  each  test.  Although  these  latter 
approaches  have  considerable  potential  bccuase  of  their  generality,  they  are  computation¬ 
ally  very  costly,  and  do  not  lead  to  the  development  of  an  expanding  data  base  available 
for  use  by  the  test  deslgner/analyst . 

8.  THE  MODEL  VALIDATION  PROBLEM  FOR  LINEAR  DISCRETE-TIME  SYSTEMS 

The  models  used  in  the  design  of  post-test  data  processors  are  typically  devel¬ 
oped  via  a  two-atep  procedure.  First,  models  using  algebraic  and  differential  {or  dif¬ 
ference)  equations  are  developedTor~ each  component  and  error  source  of  the  complete  sys¬ 
tem  baaed  on  physical  understanding.  Second,  specific  numerical  values  for  the  various 
parameter*  of  these  models  are  determined~tlirough  laboratory  tests,  analysis  of  previous 
field  tests,  and  engineering  judgment.  This  procedure  leads  to  a  formulation  of  the 
model  validation  problem  based  on  the  model  structure  and  parameters  shown  in  Fig.  1. 

In  this  discrete-time  formulation,  the  structure  of  the  model,  including  the 
order  of  the  difference  equations  describing  the  dynamic  model,  is  assumed  to  b#  correct. 
The  various  model  parameters  shown  in  Fig.  I  are  less  certain  and  are  to  be  validated. 

The  vector  t  is  referred  to  as  the  initialization  veetor  for  the  model.  Use  of  a  sepa¬ 
rate  symbol~for  e  (which  is  exactly  x_  in  Fig.  1)  is  motiviated  by  the  multiple-phase 

models  discussed  in  Section  11  D.  Note  that  a  separate  block  has  been  shown  representing 
the  measurement  system  because  the  measurement  system  model  is  sn  essential  part  of  the 
oversll  model  validation  Question.  If  dynamic  states  are  used  in  the  measurement  system 
model,  they  must  be  included  in  the  g(  vector  representing  system  dynamics  in  Fig.  1. 

The  model  parameters  fall  into  two  categories:  Structural  parameters  which  de¬ 
fine  the  elements  of  matrices  and  H4  and  statistical  parameters  which  define  the  nor¬ 
mal  density  functions  which  model  c ,  v(  and  y( .  The  approach  described  here  is  directed 

only  at  b  and  l,  the  statistical  parameters  associated  with  t.  For  typical  systems,  these 
parameters  are~among  the  most  crucial  in  determining  overall  performance,  in  a  ballistic 
missile  system,  for  example,  non-eero  values  of  b  lead  directly  to  a  bias  »n  the  impact 
distribution;  similarly,  off-nominal  values  of  10)  lead  directly  to  an  off-nominal 

Circular-Error-Probable  (CEP)  for  the  system.  The  other  statistical  parameters  and  the 
structural  parameters  in  Fig.  1  should  not  be  dismissed  as  unimportant,  but  they  are 
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Figure  1  Model  Structure  and  Parameters 

generally  somewhat  better  known  tnao  b  and  S.  Exceptions  to  «hU.  and  an  area  of  current 
research,  arc  the  parameters  associated  with  Harkov  error  models  often  used  to  represent 
various  system  error  sources.  Approaches  to  validating  these  parameters  have  been  pre* 
seated  in  the  literature  |?,81  but  alsorithms  which  are  practical  for  use  in  large  system 
test  data  analysis  have  not  been  developed  as  yet. 

the  model  validation  problem  addressed  in  this  chapter  is  as  follows: 

Given  Models  as  defined  by  the  matrices: 

v  •}.  mJ*  of,  «*t 

k  *  1,  .  .  . ,  N  *  Humber  of  tests 

t  *  l,  .  .  . .  tf{k)  s  Humber  of  Kalman  updates  for  test  k 

*  ttk 

and  optimal  smoothed  estimates  ^  ,  of  the  initial  state  vector 

of  each  test,  x^ 

*V 

Evaluate 

Consistency  of  the  model  assumptions: 

1  >  0,  1  »  lfl 

with  the  estimates,  x*  .  assuming  that  the  remainder  of  the  model 

is  correct 

Only  the  model  initial-mean  validation  procedures  are  described  in  this  chapter. 
Validation  of  the  modeL  initial  covariance  matrix  is  based  on  the  same  input  data  and  in¬ 
volves  similar  but  not  Identical  algorithms.  A  summary  of  covariance  matrix  validation 
procedures  is  given  tn  Section  IV. 

C.  ACKNOWLEDGMENT 

Design  and  development  of  the  procedures  described  in  this  chapter  and  of  a  soft¬ 
ware  system  which  Implements  them  has  been  the  work  of  several  individuals  at  The  Analytic 
Sciences  Corporation.  The  original  concept  was  developed  J.L.  Center  and  P,d.  01 inski. 


12-31 


Design  of  the  bias  capability  procedures  was  primarily  by  J. E.  Sacks  and  covariance  ma¬ 
trix  validation  procedures  were  developed  by  F.K.  Sun.  The  software  system,  including 
the  data  base  design,  was  developed  by  S.L.  Rubin. 

II.  VALIDATION  OF  THE  MODEL  INITIAL  MEAN 

In  this  section,  details  of  the  model-mean  validation  procedures  are  presented. 
The  first  subsection  provides  a  description  of  a  "data  equation"  representation  of 
smoothed  estimates  from  a  Kalman  filter/smoother .  The  other  two  subsections  describe  al¬ 
gorithms  for  data  processing  and  capability  analysis  which  are  based  on  the  data  equation. 

A.  THE  DATA  EQUATION 

Efficient  computation  of  all  test  statistics  and  probabilities  described  in  this 
section  is  based  on  a  single  equation,  called  the  data  equation,  for  each  test  result. 

The  data  equation  defines  the  explicit  relationship"T»etween  smoothed  estimates  calculated 
by  a  Kalman  filter/smoother  algorithm  and  e,  the  initialization  vector  of  the  dynamic 
model  state.  Figure  2  illustrates  the  relationship  between  the  data  equation  and  the 
model  structure  of  Fig.  1.  The  smoothed  estimate  of  which  appears  on  the  left-hand 

side  of  the  data  equation  can  be  interpreted  as  a  noisy  measurement  of  c.  The  response 
matrix,  D,  describes  the  cross-coupling  which  exists  between  initialization  errors  and 
estimates,  and  the  noise  term,  e*.  represents  the  accumulated  effect  of  the  ass -aed  zero- 
mean  dynamic  system  and  measurement  system  noise  processes,  {w£}  and  { v} . 
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Figure  2  Data  Equation  Description  of  Complete  System  Structure 

AM  of  the  data  processing  procedures  which  are  used  for  validating  statistical 

parameter*  (IQ  and  b>  of  a  are  based  on  using  only  0,  and  R  as  input  data,  it  can 

ha  shown  that*  in  fact ,  1*  It  i  sufficient  statistic  for  2  and  h.  That  is.  one  could 
not  infer  any  more  shout  the  pr obab i u t ydens i i y  rune lion  8f  j  by  any  type  of  processing 

on  the  original  data  sequence,  iz^l ,  than  by  correctly  processing  §*  alone.  This  com¬ 
pression  of  the  thousands  of  individual  ateasurements  contained  in  typical  data  sequences 
into  a  single  vector  estimate  makes  the  validation  procedures  described  in  this  chapter 
attractive. 

In  the  following  paragraphs,  the  date  equation  for  each  test  k  Ck  a  1 . Si) 

is  described.  The  superscript  k  is  suppressed  t-4  avoid  unnecessary  notations!  complexity, 

but  ft  should  be  remembered  that,  in  general,  different  data  eot-Uon  matrices  <0k,  Pk) 
result  for  each  test. 

1.  l^jJSaibLlA^ 

Let  the  initialization  vector  in  Fig.  2  be  rewritten 

£  *  b  ♦  tr  (1) 

where 

b  =.E  If}  the  actual  initial  mean,  ox  1 

£r  -  «  (0,  t0) 

Where  XQ  is  assumed  known.  The  data  equation  in  Fig.  2  esa  be  rewritten 


-  0  !»  ♦  S 


(2) 
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where 


Cov  (e)  =  D  1Q  DT  +  Cov  (*')  (3) 

and  we  define 

B  =  Cov  (e)  =  Cov  (R8)  (4) 

—  — o 

For  an  optimal  filter  /smoother , 

Cov  (x  )  =  Cov  (x8  +■  x®) 

— o  — o  — o 

or  (5) 

2  =  R  +  Ps 

o 

where  Ps  is  the  error  covariance  of  the  fixed-point  smoothed  estimate  of  x.  Furthermore, 
for  an  optimal  filter/smoother,  and  for  t  =  x^, 

£  [*|  =T]  =  £  [<■>'♦£■>  iT].'»£o 

*5>t]  = » 

Therefore,  provided  the  inverse  exists, 

D  =  R  i”1  (6) 

C 

So,  given  the  a  priori  covariance,  IQ,  and  the  Kalman  smoother  error  covariance,  P  ,  one 

can  ideally  use  Eqs.  (5,  6)  to  find  matri^s  R  and  D  necessary  to  define  the  data  equa¬ 
tion,  Eq.  (2). 


Unfortunately,  for  many  sys»  models  of  interest,  observability  of  the  various 
components  of  xQ  varies  widely.  The  effect  is  that  R  is  a  very  poorly-conditioned  matrix 

and  computation  of  R  via 

R  =  20  PS  (7) 

is  very  inaccurate.  The  inaccuracy  is  due  both  to  the  finite  word  length  effect  on  the 

subtraction  and  to  round-off  error  during  computation  of  P8  in  typical  filter/smoother 
algorithms.  This  drawback,  combined  with  the  requirement  for  inverting  2Q  in  Eq.  (6), 

ha3  me Civ® tad  the  use  of  recursive  equations  for  the  direct  computation  of  the  D  and  R 
matrices.  These  equations  are  presented  in  the  appendix  along  with  an  outline  of  their 
derivation. 


2 •  The  Normalized  Data  Equation 

Although  data  processing  procedures  based  on  statistical  hypothesis  tests  could 
be  developed  directly  from  Eq.  (2),  there  are  two  practical  < easons  for  not  doing  this. 

First,  test  statistics  based  on  R8  in  Eq.  (2)  would  not  be  distributed  according  to  stan- 

dard  probability  distributions  for  which  efficient  computational  procedures  already  exist. 
Second,  as  mentioned  earlier,  limited  model  observability  results  in  a  poorly-conditioned 

R-matrix,  i.e,,  a  high  degree  of  linear  dependency  among  the  components  of  R^.  Both  of 

these  difficulties  can  be  overcome  by  a  normalization  procedure  which  will  result  in  a 
reduced  number  of  normalized  estimates,  z. ,  which  will  be  pairwise  uncorrelated.  Test 

1  2 

statistics  based  on  these  normalized  estimates  will  be  central-  or  non-central  x  random 
variables  under  the  various  hypotheses  considered  in  the  next  section. 

To  obtain  the  normalized  data  equation,  Eq.  (2)  is  premultiplied  by  a  matrix,  M, 
chosen  so  that  the  resulting  noise  term,  Me,  will  ha”e  identity  covariance.  If  we  let 


R  =  Rli  (R^)  (8) 

where  is  any  positive  semi-definite  square-root  of  R,  then  M  can  be  computed  from 


t9> 


Because  of  the  ill-conditioned  nature  of  typical  R  matrices,  M  will  usually  be  a  pseudo¬ 
inverse  of  bS,  One  procedure  which  has  proved  satisfactory  for  computing  M  is  based  on 
use  of  a  singular-value-dec^mposition  algorithm  to  obtain  matrices  U  and  S  such  that 

R  =  U  S  UT  (10) 


where 
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S  =  Diag  (s^,  s2,...,sn}  (11) 

T 

and  U  is  an  n*n  matrix  of  eigenvectors  of  R  K.  The  transformation,  M,  can  therefore  be 
computed  using  the  formula 

M  =  (sV  UT  (12) 

where  (  )”  denotes  a  pseudo-inverse  matrix  and 


1 

■fsl  1 

0 

(sV  = 

0  /  X 

mx(n-m) 

0 

*  1 

is  a  rank-m  pseudo- inverse  cf  S^.  Multiplying  Eq.  (2)  on  the  left  by  M  yields  the  "ol- 
lowing  normalized  data  equation  for  the  bias  problem, 

z  =  H  b  +  v  (14) 


where 


r  =  Hr 
—  — o 


(15) 


is  an  m  x  i  vector  of  normalized  estimates  and 

H  =  M  D  (16) 

is  an  m  *  n  normalized  response  matrix.  The  covariance  of  the  noise  term  in  Eq.  (14)  is 


Cov(v)  =  Cov  (Me)  =  M  8  H  1  v„  (17) 

—  —  to*m 

as  desired. 

B.  DATA  PROCESSING  PROCEDURES 

Data  processing  for  model-mean  validation  consist)  of  a  four-step  procedure  de¬ 
signed  co  "evaluate"  consistency.  The  four  steps  are:  model  acceptance,  error  detec¬ 
tion,  error  isolation,  and  parameter  estimation.  Each  of  the  steps  is  based  on  quanti¬ 
ties  derived  from  the  cumulative,  normalized  data  equation 

z  =  H  b  ♦  v  (18) 

where  e,  H,  and  v  now  represent  collections  of  elements  from  single-test  data  equations. 


(19) 


1 .  Hypothesis  Testing  Procedures 

Model  acccnt<n..c  is  designed  lo  test  the  validity  of  the  normalization  process. 

If  some  of  tfie "model  matrices  (♦»,  Ht>  Qt ,  Rvl)  do  not  accurately  model  the  system  which 

generated  the  data  processed  by  the  Kalman  filter,  then  each  v1*  in  Eq.  (19)  may  not  have 
identity  covariance.  This  would  cause  subsequent  st»tistical~hypotheaie  tests  to  be  unre- 

liable  because  of  deviations  from  the  assumed  x  (central  or  non-central)  distributions. 
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The  hypothesis  to  be  tested  is 
HAt  v  ~  N  (0,1) 


(20 


The  test  statistic  to  be  used  is 

Aa  =  (It  •  H  b»|2  *21) 

*» 

where  b  is  the  least 'squares  (also  maximum- likelihood  under  1<A)  estimate  of  b  based  on 


Eq.  (18).  The  hypothesis  test  is 
Re,‘ect 

AA  *  kA 
Accept 


(22) 
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where  the  threshold,  A.,  is  determined  from  a  specified  level-of-significance,  a,  such 
that  A 


Pr  {  Aa  >  XA  |  Ha  is  true  }  =  a 


(23) 


The  threshold,  AA,  is  determined  from  Eq.  (23)  using  the  fact  that  if  HA  is  true,  then 


A 


(24) 


where  degrees  of  freedom  p  =  m  -  Rank  H  and  m  is  the  dimension  of  z.  Rejection  of  HA 

means  that,  with  high  confidence,  there  exists  a  modeling  error  other  than  b  /  0,  and 
the  succeeding  analysis  procedures  may  yield  misleading  results.  Non-rejection  of  HA, 

of  course,  does  not  preclude  the  possibility  of  other  modeling  errors.  It  does  imply 
that  if  such  errors  exist,  they  have  not  caused  z  to  deviate  significantly  from  its  base¬ 
line  statistical  distribution. 


Error  detection  is  designed  to  detect  the  presence  of  an  error  of  the  type  con¬ 
sidered,  a  non-zero  mean  in  this  case.  The  hypothesis  to  be  tested  is  simply: 

Hp  :  b  *  0  (25) 


and  the  test  statistic  used  is  derived  from  the  same  least-squares  solution  used  in  the 
model  acceptance  test, 

Ad  =  | |  H  b  | |2  (26) 


The  bias  detection  hypothesis  test  becomes 


Reject 

> 

< 

Accept 


(2?) 


where  the  threshold,  A^,  is  again  based  on  a  specified  level-of-significance,  o, 

Pr  {  AD  >  Ajj  |  Hjj  is  true  }  =  o  (28) 


Under  hypothesis  Kp, 


A 


D 


P  -  Rank  H 


(29) 


2 

enabling  calculation  of  the  threshold,  Ay,  from  the  central  x  distribution, 
like  the  model  acceptance  test,  is  a  significance  test;  rejection  of  Hq  means 

high  confidence,  a  non-zero  bias  exists  in  the  system. 


This  test, 
that,  with 


Figure  3  is  helpful  in  interpreting  detection  and  Isolation  test  results.  The 
triangle  shows  the  relationship  between  the  data,  t,  its  projection  on  the  linear  space 
spanned  by  possible  bias  vectors  which  has  length  A^  (called  "explained  sum-of-squares" 

or  ESS),  and  the  residual  vector  which  baa  length  AA  (called  "residual  sum-of-squares" 

or  KSS).  For  the  isolation  tests  discussed  below,  the  projection  of  t  on  a  subspace  of 
that  spanned  by  all  possible  biases  is  considered. 


Another  interpretation  of  A^  is  as  a  Generalized  likelihood  Ratio  (GUI)  which 
could  be  used  as  a  CLR  test  statistic  to  select  one  of  the  two  alternative  hypotheses: 


Hjj  :  b  =  0  X2  Hj,  :  b  /  0. 


Figure  3  Decomposition  of  Data  Into  Explained  and  Residual  Portions 
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The  third  and  fourth  steps  of  the  consistency  evaluation  are  done  simultaneously 
following  detection  of  a  modeling  error.  In  the  error  isolation  step,  the  analyst  at¬ 
tempts  to  isolate  the  error  to  a  subset  of  the  components  of  b.  The  following  notation 
is  used  to  describe  the  isolation  procedure: 

B(J)  s  {  b  :  nxl  vector,  bj  arbitrary  if  jsJ,  bj  =  0  if  j  j  J  }  (30) 

where 

J  =  U  i  Si  <  J2  '*•  <  jq  -  n} 

is  an  index  set  with  q  <  n  elements.  Selection  of  the  index  sets,  J,  could  be  based  on 
engineering  judgment  or  on  automatic  set  selection  procedures. 

The  hypothesis  to  be  tested  for  each  J  can  be  written 

Hj  :  b  e  8(J)  (31) 

and  an  appropriate  test  statistic  is 

Aj  =  AD  -  II  H  bj  ||2  (32) 

where  bj  is  the  least-squares  (and  maximum  likelihood)  solution  of  the  over-determined 
set  of  equations 

Hj  bj  S  *  (33) 

In  Hj,  columns  of  H  whose  indices  are  not  in  J  are  replaced  by  zeros. 

The  form  of  the  isolation  test  is 
Reject 

Aj  >  Aj  (34) 

Accept 

where  the  threshold,  Aj,  is  determined  from 

Pr  l  Aj  >  Aj  |  Hj  is  true  )  *  o  (35) 

Under  hypothesis  Hj,  It  can  be  shown  that 

Aj  ~  Xp  p  »  Rank  H  -  Rank  Hj  (36) 

2 

so  that  the  threshold  computation  is  again  based  on  a  central  x  distrubtion. 

Each  isolation  test  statistic,  Aj,  can  be  interpreted  as  a  GLR.  fhe  two  alter¬ 
native  hypotheses  for  each  J  are: 

Hy  :  b  unconstrained 

Hj  :  b  e  B(J> 


The  fourth  component  of  the  evaluation  is  the  estimate,  bj,  generated  for  each 
of  the  sets  tested.  Engineering  judgment  is  of  utmost  importance  in  the  interpretation 
of  the  (Aj,  |>j)  pairs.  Aj  is  a  measure  of  how  much  of  the  ESS,  A^,  la  explained  by 
biases  in  B(J).  A  perfect  score,  Aj  =  0,  would  Indicate  "certain”  isolation  of  the  bias. 

In  actual  data  processing,  random  effects  are  aliased  by  various  bias  sources:  different 
bias  sources  may  have  similar  signatures  in  the  data  space.  As  a  result,  the  analyst 
may  "accept"  (i.e.,  fail  to  reject)  two  or  more  sets,  B(J),  as  explaining  the  data.  Se¬ 
lection  of  the  most  likely  bias  candidates  from  among  those  accepted  is  based  on  statis¬ 
tical  scoring  techniques  and,  most  Importantly,  good  engineering  judgment. 


Output  information  from  a  computer  program  which  implements  the  hypothesis  tests 
must  Include  test  statistics  and  thresholds  for  each  of  the  tests  along  with  the  least- 


squares  estimates, 


for  each  alternative  index  set,  J.  A  sophisticated  isolation 


algorithm  should  also  include  a  ranking  system  tor  determining  which  of  the  alternatives 
arc  the  best  candidates  for  explaining  the  data.  A  discussion  of  criteria  for  ranking 
the  various  alternatives  is  beyond  the  scope  of  this  chapter  (21* 


12-36 


C.  CAPABILITY  ANALYSIS 

In  this  subsection,  procedures  for  calculating  probabilities  associated  with  the 
detection  and  isolation  tests  described  above  are  presented.  These  probabilities  are  of 
great  importance  to  the  system  analyst.  First ,  during  the  planning  stage  of  system  tests 
and  test  instrumentation,  it  is  important  to  learn  as  much  as  possible  about  the  observa¬ 
bility  of  various  system  errors,  if  they  in  fact  exist,  under  the  proposed  test  program. 
Second ,  capability  analysis  based  on  these  probabilities  provides  a  guide  for  planning 
and  interpreting  the  data  processing  procedures  and  results. 

1 .  Scaling  the  Bias  Errors 


Capability  analysis  results  are  based  on  the  probability  density  functions  of 
the  test  statistics.  These  densities  are  completely  determined  by  the  original  model 
and  specfication  of  model  errors  which  are  assumed  to  exist.  The  procedure  to  be  used 
to  specify  error  sources  considered  in  capability  analysis  is  to  select  errors  which  are 
"significant"  according  to  a  well-defined  system  performance  criterion.  This  is  called 
"scaling  the  error  sources." 

For  model  mean  validation,  system  errors  are  non-zero  components  in  the  initial¬ 
ization  error  bias  vector,  b.  The  scaling  criterion  has  the  form: 


where 


||b||E2  =  bTEb 


(37) 


(38) 


N  =  number  of  tests  from  which  data  are  to  be  processed, 


and 


Jk  =  *t~  s*n  m) 

is  a  gradient  matrix  defining  the  sensitivity  of  a  sxl  output  vector,  r,  to  the  initiali¬ 
zation  error  vector,  6. 

2 .  Evaluating  Detection  and  Isolation  Probabilities 

Probabilities  associated  with  the  detection  and  isolation  tests  are  calculated 
from  the  known  probability  density  functions  for  test  statistics  A^  and  Aj.  Let  by  rep¬ 
resent  a  hypothesized  true  bins  in  the  a  priori  distribution  of  t.  Then  the  cumulative 
date  vector,  z,  in  Eq.  (18)  would  be 

z  a  Hby  ♦  v  (AO ) 

Therefore  z  would  be  distributed  normally  with  mean  Hby  and  covariance  1.  denoted  z  ~ 
N(Hby,  1).  Let  Che  least -squares  (and  maximum  likelihood)  estimate  of  by.  be  written 

b  o  H*  a  (41) 

where  H~  represents  a  pseudo- inverse  of  H.  Therefore,  the  detection  test  statistic  de¬ 
fined  in  Eq.  (26)  can  be  written 

ad  =  *T  [ll*T  <hTm>  h’]s  (42) 

Since  the  bracketed  matrix  is  idempotent,  and  z  is  normal  with  identity  covariance,  AQ 
la  a  non-central  random  variable,  denoted 

A0  -  x2  <P,«2> 


where 


p  =  degrees  of  freedom  =  Rank  H 

2  2 
d  =  non-centrality  parameter  =  IIHbyll 

The  probability  of  detecting  the  hypothesized  bias,  by,  is  given  by 
P0(bT)  a  Pr  (x2(p.d2>  >  A„) 


(43) 
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where  the  threshold,  An,  satisfies  Eq.  (28).  This  probability  can  be  evaluated  using 

u  2 

well-known  procedures  for  non-central  x  densities. 

For  the  error  probabilities  associated  with  isolation  tests,  the  situation  is 
slightly  more  complex,  but  again  results  in  evaluating  probabilities  from  a  non-central 
2 

X  density  function.  The  constrained  estimate,  bj,  can  be  denoted 

bj  =  H]  z  (44) 

Combining  Eq.  (32)  with  Eqs.  (42),  (44)  we  have 

Aj  s  zT  (H'T  (HTH)  H*  -  Hj'T  (HTH)  Hj]  z  (45) 

The  bracketed  matrix  is  again  idempotent  so  that 

Aj~X2(p'.«5)  (46) 


where 


and 


p*  =  rank  H  -  rank  Hj 
«j  “  Ild-HjHj)  HbT||2 


(47) 


For  a  specified  index  set  and  hypothesized  true  bias  bp,  the  probability  of  not  rejecting 
(i.e.,  accepting)  set  J  as  possibly  containing  the  bias  is 


Pj(bp)  *  Pr  {x2(p',6j>  <  Aj) 


(48) 


where  the  threshold,  Aj,  satisfies  Eq.  (35).  If  bp  is,  in  fact,  contained  in  set  B(J), 
then  a2  =  0,  and 

Pj  (bT)  =  1  -  «  (49) 


where  a  is  the  level  of  significance  used  to  select  the  threshold,  Aj. 

Although  Aj.  and  Aj  can  be  interpreted  as  generalized  likelihood  ratios,  they 
were  developed  from  a  geometrical  viewpoint.*  Aj  can  be  interpreted  as  the  square  of 

the  distance  from  the  point  (Hbp  +  v)  in  £-space  to  the  set  HB(J)  defined  by 

H8(  J)  »  (  t  :  C  ■  Hbj  .  bj  e  B(J)]  (50) 

Then,  Pj(bp)  can  be  Interpreted  as  the  probability  that  this  distance  is  less  than  a 
threshold  distance,  Aj,  which  defines  an  "acceptance  region"  around  HB(J). 

3 .  Output  Information  from  Capability  Analysis 

For  capability  analysis,  biases  bp  which  satisfy 

I lbTl |g  «  d  (51) 

for  a  specified  error  magnitude,  d,  are  considered.  There  are  many  ways  in  which  proba¬ 
bilities  associated  with  specified  alternative  sets,  B(J),  can  be  calculated  and  pre¬ 
sented.  One  which  has  been  successfully  implemented  is  to  consider  true  biases,  bp, 

which  are  in  ths  set 

B(l,d)  a  {bj.  s  bj.  eB(l)  and  ||bp|lE  =  d]  (52) 

where  I  Is  a  specified  index  set.  Than  maxima  and  minimum  pcobabllltiaa 

pmax,min^J,lt)  18  “ln  pj  (53> 

bT  tB( i ,d) 

can  ba  computed;  they  define  the  range  of  probabilities  of  accepting  eat  B(J)  aa  explain¬ 
ing  a  biaa  cauaad  by  b.cB(X,d) .  Exaaplaa  of  reaulta  of  this  type  era  presented  in  Sac- 
cion  III.  “T 


*beveloped  by  J.fc.  Sacks 
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D.  SOFTWARE  SYSTEM  IMPLEMENTATION 

This  section  describes  an  efficient  software  system  implementing  the  procedures 
presented  here.  Extension  of  the  procedures  to  multiple-phase  models  is  discussed  first 
since  this  extension  has  a  significant  impact  on  the  software  design.  The  remaining  sub¬ 
sections  summarize  major  design  considerations  and  organization  of  the  complete  software 
system. 

1.  Multiple-Phase  Models 

The  system  model  considered  in  the  previous  section  is  specified  by  the  system 
and  measurement  model  equations  in  Fig.  1.  For  some  systems,  hovever,  a  complete  model 
of  a  complex  system  can  be  most  efficiently  represented  by  two  or  more  sets  of  system 
and  model  equations,  each  set  representing  one  "phase"  of  system  operation.  The  relation¬ 
ships  between  variables  describing  system  operation  during  two  adjacent  phases  is  defined 
at  a  specific  value  of  the  independent  variable,  called  the  interface  time,  by  the  fol¬ 
lowing  equation: 


-jo  =  Aj.j-1  *j-l  ^j-l*  +  Ajo  -j 

where 


(54) 


XjQ  =  initial  condition  of  phase  j 
Xj_^  (Tj _^ )  =  state  of  phase  j-1  at  interface  time,  Tj_^ 

Aj  j-1’  Ajo  are  known  transformation  matrices 
e j  ~  N  (0,  =  the  initialization  error  for  phase  j. 

Equation  (54)  provides  the  motivation  for  distinguishing  between  e  and  x  in  the  previous 
section;  the  statistical  parameters  of  XjQ  vary  from  one  test  to  another  because  of  varia¬ 
tions  in  Aj  and/or  the  distribution  of  Xj_^  (Tj_^)  but  £j  ®odeled  by  a  probability 
distribution  which  can  reasonably  be  assumed  to  be  the  same  for  all  tests. 


Distinct  measurement  systems  will  generally  be  used  during  distinct  phases  and 
the  resulting  measurements  processed  in  distinct  filter/amoother  runs  resulting  in  a  col¬ 
lection  of  smoothed  estimates,  ,  j  =  1 . Np,  where  Np  is  the  number  of  phases  for 

which  estimates  are  available.  Therefore,  Np  data  equations  must  be  combined  to  form  a 

single,  normalized  data  equation  for  the  entire  test.  Another  important  difference  in 
the  multiple-phase  case  is  that  in  order  to  account  correctly  for  correlations  which  must 
exist  between  estimates  of  initial  conditions  in  different  phases,  smoothed  estimates 
(along  with  corresponding  D  and  R  matrix  elements)  of  a  subset  of  the  state  vector  of  the 
earlier  phase  at  interface  time  must  be  Included  in  the  data  equation. 


For  a  two-phase  model,  the  composite  data  equation  (before  normalization)  is 


(55) 


(56) 


and  xj  Is  the  subset  of  the  phase-one  state  vector  at  the  interface  time  corresponding  to 
non-zero  columns  of  Aj  j.  The  composite  covariance  matrix  is 


V1" 

“  R1 

(symmetric) 

R*2 

t>2A2,lRi 

R2 

(57) 


=  Cov  (a*1,  g*1)  . 


where 


(58) 
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Since 


Rj  is  a  sub-matrix  of  the  R^ -matrix 
in  Eq.  (56). 


(59) 


for  phase  1  based  on  the  augmented  estimate  defined 


The  fact  that  the  smoothed  estimate  of  x^  provides  critical  information  is  not 

surprising.  In  fact,  it  can  be  shown  (9]  that  in  order  to  obtain  sufficient  statistics 
for  estimating  statistical  parameters  in  a  two-phase  system,  a  third  estimate  (in  addi¬ 
tion  to  R^,  j  =  1,2)  is  necessary.  In  (9],  it  is  shown  that  (R^8* ,  R8*',  x^)  are  suffi¬ 
cient  statistics  for  the  parameter  estimation  and  validation  problem  of  interest  in  this 
chapter. 


2.  Important  Design  Considerations 


The  procedures  described  in  Section  II  are  based  on  the  normalized  data  equation 
elements  (z^,  H^)  for  each  test  result.  The  extent  to  which  calculations  of  (z^,  H^) 

should  be  formalized  depends  on  several  factors.  Some  of  the  most  important  are: 

i)  Size  (i.e.,  number  of  states)  and  observability  of  the  models 

ii)  Number  of  phases  in  the  model 

iii)  Number  of  tests  available  for  analysis 

For  highly  observable  (i.e.,  well-conditioned  R)  single-phase  models  and  a  small  number 
of  tests  (say  up  to  10),  little  formal  data  organization  is  necessary.  Data  equation 
matrices  (D,R)  could  be  computed  from  closed- form  formulas  (Eqs.  (5)  and  (6))  or  from 
the  recursive  formulas  (given  in  the  appendix)  implemented  concurrently  with  the  filter/ 
smoother  equations.  One  additional  program  would  be  required  to  compute  and  store  nor¬ 
malized  data  equation  elements,  (3^,  H^).  Subsequent  analysis  (data  processing  or  capa¬ 
bility  analysis)  could  then  be  performed  by  specially-developed  programs,  possibly  using 
subroutines  from  standard  regression  analysis  packages.  For  more  complex  cases  however, 
especially  when  multiple-phase  models  are  involved,  a  more  formal  approach  is  essential 
to  obtain  the  maximum  benefit  from  the  analysis  techniques. 

The  organization  of  the  software  system  presented  in  Fig.  4  includes  four  dis¬ 
tinct  program  segments  and  two  data  base  segments.  The  first  of  the  program  segments, 

The  Interface  Program,  is  optional  depending  on  the  choice  of  method  for  computing  D  and 
R  matrices.  If  closed- form  formulas  are  used,  or  if  the  recursive  equations  are  inte¬ 
grated  with  the  filter/smoother,  then  resulting  (D,R)  matrices  can  be  written  directly 
to  the  Input  Data  Base.  Alternatively,  the  Interface  Program,  which  implements  the  re¬ 
cursive  equations  for  D  and  R,  can  read  a  file  stored  by  the  f llter/smoothcr  which  com¬ 
pletely  describes  the  model  used  in  the  data  processing.  The  minimum  set  of  matrices 
which  must  be  stored  in  that  file  is  shown  in  Fig.  4.  The  interface  program  must  also 
compute  and  store  the  cumulative  state  transition  matrix  for  phase  j  of  test  k. 


•  HIM 


Figure  4  Software  System  Organisation 

The  second  program  segment,  Data  Normalization,  performs  two  functions.  First, 
for  multiple-phase  models,  it  assembles  the  composite  data  equation  matrices  and  the  er¬ 
ror  response  matrix,  J^,  for  each  test,  k  =  1,  .  .  .,  N.  Second,  for  all  models,  it  nor¬ 
malizes  the  composite  data  equation  elements  and  stores  the  resulting  (z^,  H^)  along  with 
on  the  Computational  Data  Base  (CDB).  All  subsequent  analyses  deal  only  with  data  on 
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the  CDB.  Calculations  performed  by  the  Interface  and  Phase  Combination  Programs  are  rela¬ 
tively  expensive  but  are  performed  only  once  for  each  phase  of  each  test.  Therefore,  re¬ 
peated  calculations  using  the  last  two  program  segments  can  be  performed  at  relatively 
low  cost.  The  CDB  contains  sufficient  statistics  for  the  model  validation  problem. 

The  Data  Processing  and  Capability  Analysis  Programs  contain  implementations  of 
the  statistical  hypothesis  tests  and  probability  computations  discussed  in  Section  II. 
These  two  segments  share  many  subroutines  and  are  complementary  in  that  one,  Capability 
Analysis,  assesses  the  capability  of  the  other.  Data  Processing,  to  solve  the  problem. 

User  inputs  and  possible  outputs  for  these  two  segments  are  summarized  in  Fig.  5.  Re¬ 
quired  inputs  for  the  Capability  Analysis  Program  do  not  include  normalized  data,  z^i 

that  is,  all  of  the  computations  necessary  to  perform  capability  analysis  can  be  done 
before  data  are  actually  collected.  This  program  can  therefore  be  used  as  a  powerful 
observability  analysis  tool  in  the  evaluation  of  proposed  test  programs.  In  the  Capa¬ 
bility  Analysis  Program,  details  of  how  the  hypothesized  true  biases,  b^.,  are  to  be  se¬ 
lected  are  not  indicated.  One  method  was  discussed  in  Section  II  B;  example  results  are 
included  in  the  next  section. 
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DATA  PROCESSING 
PROGRAM 

CAPABILITY  ANALYSIS 
PROGRAM 

USER 

INPUTS 

•  FOR  TESTS  TO  BE 

ANALYZED 

S  LEVEL  OF  SIGNIFICANCE.  a 

•  ALTERNATIVE  INDEX 

SETSjJ,.  i  =  1.  2....} 

S  ^  FOR  TESTS  TO  BE 

ANALYZED 

0  LEVELOFSIGNIFICANCE.  a 

S  ALTERNATIVE  INDEX 

SETSjJ,.  i  «  1.  2....} 

S  HYPOTHESIZED  TRUE 

BIASES,  |I>T(,  |  »  1.2..,,} 

typical 

OUTPUT 

OATA 

O  HYPOTHESIS  TEST 

RESULTS  teccsplsncs. 
detection,  Isotedon) 

S  RANKED  LIST  OF 

ALTERNATIVE  SETS 

JJ,I  -  1.2....} 

S  DETECTION  PROBABILITIES 

S  ISOLATION  PROBABILITIES 
«VDt> 

Figure  5  Inputs  and  Outputs  for  Analysis  Programs 
III.  KXAMPLEi  INITIAL  MEAN  VALIDATION  IN  A  TVO-PHASB  SYSTEM 

This  example  Is  based  on  a  simple  two-phase  model  of  a  ballistic  missile  system. 
The  dynamic  equations  represent  error  models  of  Inertial  Navigation  Systems  (INSs)  which 
might  be  used  in  a  mobile  missile  launcher  (prelaunch)  and  onboard  a  missile  (postlaunch). 
Numerical  values  used  are  for  Illustration  only  and  are  not  based  on  any  specific  weapon 
system.  The  parameters  to  be  validated  are  the  components  of  the  assumed  zero  mean  of 
the  state  initialisation  vector  for  each  phase 

kj  «  E  (Cjl  *  0j.  J  a  1,2.  <60) 

A.  BASELINE  MODELS 
1 .  Pre launch  (Phase  1)  Model 

The  14  states  In  this  model  are  summarized  in  Table  I  along  with  the  one-sigma 
uncertainty  of  the  initialisation  vector  and  time  constants  associated  with  the  Narkov 
error  models.  This  model  represents  errors  in  a  local-level,  velocity-damped  (damping 
constant  =  1.0),  inertial  navigation  ayatem  moving  in  a  path  tangential  to  the  earth's 
•urface  at  velocltlas  small  relative  to  earth  rotation  |10,11|.  Three  repreaentative 
error  sourcas  are  modeled:  gyro  drift  rates,  accsiarometer  sensing  errors  due  to  deflec¬ 
tion  of  the  local  vertical,  and  velocity  reference  errors. 

Measurements  processed  by  s  Kalman  f liter/smoother  for  atate  estimation  in  phase 
1  are  to  he  generated  hy  subtracting  INS-lndlcated  position  from  an  externally-derived 
position  measurement.  Measurement  noise  la  assumed  to  be  100  ft  (oae-oigme)  in  each  hori¬ 
zontal  direction  (north,  east). 
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TABLE  I 

PHASE  1  MODEL  STATE  VECTOR  AND  INITIALIZATION  ERROR  MODEL 


STATE 

NUMBER 

VARIABLE  NAME 

oio=lNITIAL 

UNCERTAINTY 

MARKOV 

TIME  CONSTANT 

1 

Latitude  Error 

0.5  nmi 

2 

Longitude  Error 

0.5  nmi 

3 

North  Velocity  Error 

0.2  kts 

4 

East  Velocity  Error 

0.2  kts 

5 

North  Platform  Tilt 

0.3  min 

6 

East  Platform  Tilt 

0.3  min 

7 

Azimuth  Misalignment 

2 . 0  mTn 

8 

North  Gyro  Drift-Rate 

0.1  mln/hr 

18  hrs 

9 

East  Gyro  Drift-Rate 

0,1  mln/hr 

18  hrs 

10 

Vertical  Gyro  Drift-Rate 

0.1  mln/hr 

18  hrs 

li 

North  Vertical  Deflection 

0.25  min 

50  hrs 

12 

East  Vertical  Deflection 

0.25  min 

50  hrs 

13 

North  Velocity  Reference  Error 

0.5  kts 

(Bias  Error) 

14 

East  Velocity  Reference  Error 

0.5  kts 

(Bias  Error) 

2.  Postlaunch  (Phase  2)  Model 

Table  II  describes  the  17  states  used  to  represent  a  space-stable  inertial  navi¬ 
gation  system  with  orthogonal  axes  parallel  to  an  earth-centered  coordinate  system.  The 
table  also  includes  one-sigma  uncertainties  of  the  components  of  e*.  Two  representative 
error  sources  are  modeled;  accelerometer  bias  errors  due  to  miacaiibration  and  uncompen¬ 
sated  gyro  drift-rate  errors.  The  latter  include  biases  in  each  coordinate  axis  (three 
gyros  assumed)  and  two  terms  representing  thrust -dependent  bias  errors  in  the  x  and  z 
directions. 


TABLE  II 

PHASE  2  MODEL  STATE  VECTOR  AND  INITIALIZATION  ERROR  MODEL 


STATE 

NUMBER 


VARIABLE  NAME 


^INITIAL 

UNCERTAINTY 


1 

2 

3 

4 

5 

6 

7 

8 
9 


Poaiclon  Error 


INS 

Velocity  Error ^ Error* 


Misalignment 


SO  ft 
100  ft 
100  ft 
0.5  ft/scc 
0.5  ft/sec 
0.5  ft/sec 
0.5  mTn 

0,2  m7n 
0.2  mTn 


10 

•x  1 

\ 

0.02  ft/sec2 

11 

12 

•> 

*. 1 

Accelerometer  Errors 

Bias  Error 

0.02  ft/sec* 
0.02  ft/sec2 

13 

' 

. 

^Source 

8  min/hr 

14 

15 

*  i 

s 

** 

Gyro  Drift-Rate  I 

Modal* 

8  mTn /hr 

8  mln/hr 

16 

Cfx 

I  g-Sensitive  Gyro 

10  mln/hr/g 

17 

*ft 

(  Drift-Rate  ) 

10  ain/fcr/g 
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Position  measurements  for  phase  2  are  assumed  to  be  similar  to  those  in  Phase  1 
so  that  z  consists  of  measurements  of  INS  position  error  along  each  coordinate  axis  with 
additive  noise  which  has  a  one-sigma  value  of  30  ft. 

3.  Phase  Interface  Model 


The  following  transformation  matrices  define  a  simple  model  of  the  process  by 
which  the  Phase  2  INS  is  initialized  by  the  Phase  1  INS  at  interface  time  just  prior  to 
launch.  The  complete  initial  state  vector  for  Phase  2  can  be  written 

X20  —  >'2  A j  x^(t^)  —2  (hi) 


where  the  A  matrices  are  defined  as  follows. 

A^  defines  a  7x1  output  vector  from  Phase  1, 

Eli  =  A1-1^CP  (62> 

where 

A1  =  (A11  °7x7)  (63) 

Aii  =  Diag  (1,  cosL,  1,  1,  1,  1,  -1) 

and  L  =  latitude  at  launch. 

Next,  ^2  contains  the  necessary  scale  factors  and  coordinate  transformations  to  map 
into  the  Phase  2  INS  errors: 


A2i  i  _  .  i 

A2  5 

S  A22  j  .  . 

17x7 

;  a23 

°8x7 

where 


/  0 

0  \ 

/  0 

0  \ 

/  0 

0 

-1\ 

1-sinp 

COSp  I 

*  ^59  c  ^3  ( 

11  1  \  co** 

COJJ)  .  ■  I 

[  -sinP 

cosp 

0 

\  cosg 

sinp/ 

\  C08p 

sinP 

0/ 

In  these  matrices,  P  represents  the  launch  direction  (azimuth)  and  C^,  C2  are  position 
and  velocity  unit  conversion  factors.  The  17  *  1  vector,  tj,  in  Eq.  (61)  represents  ini¬ 
tialization  errors  in  the  Phase  2  INS  introduced  during  the  transfer  alignment  process 
and  Phase  2  INS  lnstrciment  errors  (biases).  These  errors  are  assumed  to  be  uncorrelated 
with  *i(ti>.  The  mean  value  of  Zj  is  to  be  validated. 

ft.  DESCRIPTION  OF  THE  TESTS 

For  this  example,  10  test  scenarios  were  simulated  using  standard  covariance 
analysis  procedures.  Parameters  which  summarize  the  test  scenarios  are  tabulated  in 
Table  III.  The  10  testa  describe  a  mix  found  in  a  typical  test  program,  some  yielding 
much  better  observability  of  system  error  sources  than  others.  No  one  mission  would  pro¬ 
vide  a  clear  test  of  the  correctness  of  the  zero-mean  assumption  for  the  model's  Initial¬ 
ization  vectors.  Only  by  processing  data  from  an  ensemble  of  missions  can  a  determina¬ 
tion  be  made  with  a  high  degree  of  confidence. 


TABLE  111 

PARAMETERS  OF  THE  TEST  SCENARIOS 


TEST 

NUMBERS 

MODEL  PARAMETERS 

BASELINE  MODEL 
PREDICTED 
PERFORMANCE 

CEP  (feet) 

Phase  1 

Phase  2 

LATITUDE 

L 

LAUNCH 
AZIMUTH  » 

LAUNCH 
ELEVATION  0 

RANGE 

R(nmi) 

1,4 

30* 

135* 

45* 

850 

8400 

2 

38* 

135* 

30* 

2400 

13000 

3 

so* 

135* 

65* 

1500 

14200 

5.8 

0* 

135* 

45* 

850 

8800 

6 

0* 

135* 

30* 

2400 

15200 

7 

0* 

135* 

65* 

1500 

14200 

9,10 

30* 

180* 

45* 

1900 

8900 
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C.  CAPABILITY  ANALYSIS  RESULTS  FOR  THE  EXAMPLE  SYSTEM 
1.  Bias  Detection  Capability 

Table  IV  contains  bias  detection  capability  results  for  several  cases.  Five 
different  hypothesized  "true"  bias  sources  are  considered.  Each  bias  is  identified  by 
the  state  variables  in  the  left  hand  column  which  indicate  that  the  corresponding  element 
of  or  e 2  biased  by  che  indicated  magnitude.  The  a  priori  sigmas  are  from  Tables  1 

and  II.  For  the  two  cases  which  contain  more  than  one  biased  component,  two  probabil¬ 
ities  are  printed  corresponding  to  the  worst  (pD(mir.)^  And  ^est  ^PD(max)^  cases  a^ong  all 

those  biases  satisfying  the  error  constraint,  f|b^.||^  =  d.  This  error  is  the  rms  miss- 

distance  across  the  ensemble  of  10  tests  which  would  be  caused  by  each  hypothesized  bias. 
The  two  values  of  d  for  which  results  are  given  in  the  tables  (1000  and  2000  ft)  are 
relatively  small  for  the  example  system  which  has  an  average  CEP  across  the  10  tests  of 
11,000  ft  (see  Table  III).  Thus,  the  probabilities  presented  correspond  to  a  system  bias 
which  would  be  difficult  to  detect  in  the  impact  domain  defined  by  down-range  and  cross¬ 
range  error  based  on  the  10-test  ensemble. 

TABLE  IV 


BIAS  DETECTION  CAPABILITY  RESULTS 


TRUTH  SET 
CONTAINING 
BIAS  AND 

A  PRIORI 
SlGKA 

ERROR 
MAGNITUDE 
d  (ft) 

P 

(a) 

D(MIN) 
NUMBER  OF 
TESTS 

BIAS 

AT 

PD(H1N) 

— - - 

p  (a) 

*D(MAX) 

NUMBER  OF 

TESTS 

BIAS 

AT 

PD(MAX) 

5 

10 

20 

5 

xo 

20 

{6Vn},0.2  kts 

1000 

1.0 

1.0 

Ml 

3.29(b) 

2000 

m 

1.0 

i 

6.58(b) 

(AH] ,2.0  mTn 

1000 

a 

o 

0.11 

O' 

o 

rg 

(ONLY 

ONE  VALUE  OF  SINGLE- 

t 

COMPONENT  BIAS  YIELDS 

2000 

0.11 

o.u 

0.12 

0.419{c) 

REQUIRED  d) 

1PO0 

0.18 

0.29 

0.53 

0.086 

2000 

0.52 

0.86<d> 

1.0 

0.172 

,4AV0(N,E)J 

1000 

0.14 

0.19 

0.29 

wm 

0.26 

0.44 

0.77 

Q.235 

0.259 

0.2b  min 

2000 

0.29 

0.52<d) 

0.86 

0.78 

0.99 

1.0 

0.471 

m 

0,517 

{a  .a„,a.) 

1000 

a 

a 

0.11 

-0.0020<c) 

0.14 

0.18 

0.27 

-0.0072 

x  y  t 

0 

0 

0.02  ft/sec2 

-0.0019 

0.0075 

2000 

0.11 

0.12 

0.14 

-0.0040<c) 

0.27 

0.48 

0.82 

-0.014 

0 

0 

, . 

-0.0038 

0.015 

NOTES t 

(a)  Baaed  on  thresholds  for  level -of-aignificance  a  =  0.10;  Table  entries  =  a  indicate 
no  detectability  of  the  hypothesized  bias 


(b)  bt  >>  a i  Implies  low  system  sensitivity  to 

(c)  bj  <  *>£  Implies  high  system  sensitivity  to 

(d)  Simulated  data  processing  results  for  these  cases  are  discussed  in  text. 


For  each  esse  <t.e.,  each  bias  source  and  each  d  value)  three  probabilities  ste 
given  corresponding  to  the  number  of  test  results  assumed  to  be  available  for  processing. 
The  five-test  case  assumes  that  only  testa  1*4  and  9  are  available  and  the  20-test  case 
assumes  that  each  of  the  10  tests  described  in  Table  lit  is  conducted  twice. 

The  various  esses  in  Table  IV  can  be  grouped  into  three  broad  categories.  For 
some  cases,  indicated  by  note  (b)  in  the  figure,  the  bias  magnitude  required  to  produce 
a  significant  d  is  far  greater  than  the  modeled  prior  uncertainty.  Such  s  bias,  if  it 
existed  in  the  system,  would  represent  s  severe  system  anomaly  and  would  be  easily  de¬ 
tected  by  the  data  processing  algorithm  if  the  associated  state  variable  is  observable 
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in  the  classical  control/estimation  theory  sense.  For  the  north-velocity  bias  case 
shown,  the  smoothed  estimates  of  the  system  initial  velocity  for  the  10-test  ensemble 
would  clearly  indicate  the  presence  of  the  bias;  the  nodel  validation  data  processing 
software  would  not  be  necessary. 

A  second  category  is  typified  by  the  cases  indicated  by  note  (c)  in  Table  XV. 
Here  the  system  performance  measure  is  extremely  sensitive  to  the  hypothesised  bias  so 
that,  for  the  {^H}  case  for  example,  a  Bias  of  only  one-tenth  of  the  a  priori  uncertainty 
would  cause  a  1000  ft  ms  error.  For  these  cases,  even  perfect  measurement  of  the  states 
of  interest  would  have  great  difficulty  in  distinguishing  between  the  unbiased  a  priori 
model  and  the  actual,  slightly  biased  density  function. 

The  third  category  includes  the  rest  of  the  cases  in  the  table  and  the  majority 
of  cases  in  practical  applications.  For  these  cases,  system  sensitivity  is  moderate, 
relative  to  the  d  of  interest,  and  the  unique  capability  of  the  data  processing  algorithm 
based  on  data  from  multiple  tests  is  of  maximum  advantage.  The  steadily  increasing  prob¬ 
abilities  as  the  numbers  of  tests  increases  is  clearly  evident. 


Bias  Isolation  Capability 


Table  V  contains  bias  isolation  capability  results.  Three  of  the  hypothesized 
bias  sets  for  which  detection  capability  results  were  presented  in  Table  IV  are  used  here 
The  probability  of  accepting  each  of  six  alternative  sets  is  computed  for  each  case.  As 
with  the  detection  capability  results,  for  multiple-component  hypothesized  biases,  worst- 
case  *od  best-case  probabilities  are  computed  for  each  case.  The  ef¬ 

fect  of  increasing  d  (from  2000  to  4000  ft)  or  the  number  of  tests  (from  10  to  20)  is 
shown  for  each  truth/altemative  pair. 


TABLE  V 

BIAS  ISOLATION  CAPABILITY  RESULTS 


molts ! 

{#)  Based  on  threthold*  for  level-sf-signifitsnce  o  ■  0.10;  Table  entries  *  l-«  «  lb*  de¬ 
signed  probability  that  M  alternative  *el  coatslniag  the  truth  he  accepted. 

(b)  Probability  O.fO  «  l-«  indicate*  estrone  difficulty  of  diotiaguiahina  between  thi* 
truth/ alternative  pair. 


A  general  feature  of  the  results  is  that  for  alternative  sets  which  do  not  con¬ 
tain  the  hypothesized  bias,  the  probabilities  of  acceptance  (i.e.,  the  probabilities  of 
mis-isolation)  do  not  vary  much  from  one  alternative  set  to  another.  This  result  is 
typical  of  mis-isolation  probabilities  which  have  been  computed  for  many  weapon  system 
models.  Exceptions  to  this  general  feature  do  occur  however;  one  such  case  is  indicated 
by  note  (b)  in  the  figure.  That  result  (mis- isolation  probability  -  1  «*)  means  that 
biases  in  the  verticil  deflection  errors  and  the  velocity  reference  errors  in  a  local  - 
level  Inertial  navigation  system  have  almost  identical  signatures  in  the  dats  space.  It 
can  be  shown  that  by  using  engineering  Judgment  in  interpreting  isolation  test  results, 
incorrect  acceptance  of  as  *  bias  source  can  almost  certainly  be  avoided. 

0.  DATA  PROCESSING  RESULTS 

In  order  to  illustrate  applications  of  dats  processing  algorithms  to  the  ex¬ 
ample  two -phase  system,  two  sequences  of  simulated,  normalized,  biased  dula,  JzS ,  were 
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generated.  Biases  simulated  in  the  two  sequences  corresponded  to  two  cases  considered  in 
the  previous  bias  capability  analysis  results,  each  of  which  caused  d  *  2000  ft  of  ras 
error: 

Case  I  bT  :  {bg  *  E  1*^(0)}  =  0.172  ain/hr} 

Case  II  ^  :  {bu  *  E  t«AvD(N)(0>  =  -0.245  ain, 
b12  =  E  f«AVJ)<E)(0)  =  0.250  £in} 

If  b 

For  each  case,  the  sequence  {z  }  was  made  up  of  12  sets  of  10  z  (representing  results 
of  each  of  the  10  test  scenarios). 

1.  Case  I:  Single-Component  Bias 

This  case  is  based  on  a  single-coaponent  bias  which  is  highly  detectable  (Pq  = 
0.86)  and  easily  isolated  (Pj  ~  0.13  for  J  not  including  e^).  In  10  of  the  12  trials, 

the  hypothesis,  b  =  0,  was  rejected  at  the  a  =  0.1  level,  an  experimental  detection  rate 
of  0.83.  Isolation  results  for  the  six  alternaive  sets  considered  earlier  (Table  V)  are 
suaaarized  in  Table  VI.  Acceptance  rates  of  incorrect  alternative  sets  for  this  saall 
number  of  trials  are  higher  than  predicted,  but  for  each  set,  two  of  the  false  acceptances 
occurred  in  the  trials  in  which  the  detection  test  failed  to  reject  the  b  =  0  hypothesis. 
Thus,  use  of  engineering  judgment  in  these  cases  would  help  to  avoid  an  incorrect  conclu¬ 
sion;  after  failure  of  a  detection  test,  the  analyst  should  not  conclude  that  a  bias 
exists  in  any  of  the  (falsely)  isolated  sets  without  a  careful  examination  of  the  bias 

estimates ,  bj ,  produced  for  each  set. 


TABU  VI 

BIAS  ISOLATION  RESULTS  FOR  CASE  1 


ALTERNATIVE 

SET  U) 

NUMBER  OF  TRIALS 
FOR  WHICH  SET  J 

WAS  ACCEPTED  (Nj) 

ACCEPTANCE 

RATE 

(Nj/12) 

PREDICTED 

PROBABILITY 

<Pj) 

(tjj.  tg,  tg) 

12 

1.0 

(l*e) 

lftAV0(H,E)1 

3 

0.24 

0.13 

taV«EF(8,E)} 

3 

0.25 

0.13 

4 

0.33 

0.13 

1?) 

3 

0.25 

0.12 

3 

0.25 

Oil 

2  •  Case  II:  TVo-Component  >1  aj 

This  case  is  based  on  a  two-component  bias  which  is  only  marginally  detectable 
<P0  =  0.52)  and  very  easily  mis-isolated  (Pj  -  0.50  for  several  sets  which  do  not  include 

the  biased  states).  The  hypothesis  "b  *  0"  was  rejected  at  the  a  *  0.1  level  for  only 
three  of  the  12  trials  for  this  case;~thia  0.25  detection  rate  was  poorer  than  the  pre¬ 
dicted  value  0.52  but  within  reason  because  of  the  small  sample  sice. 

IV.  SUMMARY  AMD  EXTENSIONS 

A  formal,  organized  approach  to  validation  of  the  statistical  parameters  of  ini¬ 
tialization  errors  in  linear  dynamic  system  models  used  by  Kalman  filter/smoothers  has 
been  presented.  The  approach  for  the  mean-value  (bias)  parameters  has  been  discussed  in 
detail;  a  similar  approach  which  has  been  developed  for  initial  covariance  matrix  vali¬ 
dation  is  discussed  briefly  In  the  following  paragraph,  the  hiss  validation  procedures 
include  statistical  hypothesis  tests  for  detecting  and  isolating  bias  errors  based  on 
data  processing  results  and  capability  analysis  formulas  for  calculating  probabilities 
of  detecting  and  isolating  (or  mis-isolating)  hypothesised  model  bias  errors.  All  of 
the  analysis  procedures  are  based  on  a  data  equation  representation  of  the  overall  sys¬ 
tem  which  transforms  at;  initialization  vector,  t,  into  e  smoothed  estimate  vector,  i* 
(Fig.  2).  “ 


Analysis  procedures  for  the  covariance  problem  are  also  based  oo  the  data  equa¬ 
tion  of  Fig.  2,  but  normalization  for  this  problem  is  done  with  respect  to  the  covariance 
of  e*  (process  and  measurement  noise  effects)  rather  than  with  respect  to  the  covariance 

of  l£.  Hypothesis  tests  for  covariance  matrix  validation  are  based  on  quadratic  forms 

in  normal  random  variables.  Another  difference  in  the  covariance  problem  is  the  way  in 
which  errors  are  scaled  during  capability  analysis.  Since  e  model  error  for  this  problem 
Is  of  the  form  A1  ,  the  effect  of  this  error  oo  the  system  output  vector  Is  e  change. 


12-46 


Aif ,  in  the  covariance  matrix  of  the  system's  output.  A  variety  of  scalar  measures  might 
be  used  as  no.js  on  A^;  one  which  would  be  appropriate  for  weapon  systems  such  as  the 
example  in  Section  III  is  the  change  in  CEP  due  to  &ZQ. 

The  procedures  have  potential  application  beyond  the  basic  model  validation 
problem.  For  example,  if  the  model  is  viewed  as  describing  an  ideal  system  design,  the 
procedures  can  be  used  as  system  parameter  estimation  tools.  Since  parameter  estimates 
are  generated  from  sufficient  statistics  for  all  the  data,  the  algorithms  provide  an 
efficient  method  for  compressing  and  storing  the  data  generated  by  an  ensemble  of  tests. 

For  situations  in  which  both  bias  and  covariance  errors  exist  in  the  system  model,  the 
algorithm  can  be  used  recursively  to  successively  adjust  estimates  in  the  data  equations 
as  improved  estimates  of  model  parameters  are  generated. 

APPENDIX  A:  RECURSIVE  CALCULATION  OF  DATA  EQUATION  MATRICES 

In  this  appendix,  recursive  equations  are  presented  which  can  be  used  to  compute 
matrices  (D,R)  associated  with  the  smoothed  estimate  data  equation  for  a  single-phase 
model  as  discussed  in  Section  II  C.  An  outline  of  the  derivation  is  also  presented. 

D,R  FOR  A  SINGLE-PHASE  MODEL 

Let  the  linear  discrete-time  system  model  be  represented  for  each  A  =  0,  1,  . ., 

£f  by 

*£+l  =  +  *  Cov  <Kg>  =  Q*  (A-D 

x^  =  c  ~  N  (0,  X0) 


— £+1  =  H£+l  *£<-1  +  -£+l  •  Cov  ^i+l*  =  ^v£+l 


(A-2) 


Then  matrices  D^ ,  R^  such  that 

=  °£  ^o  +  £=0,1,... 

Re  *  Cov  (x£)  =  i>4  Zo  DAT  +  Cov  (e^) 


(A-3) 

(A-4) 


where  k^  is  the  optimal  filteted  estimate  at  time  £,  can  be  computed  from  the  formulas 

D,.1  *  W  *1  D«  *  K,*1  H..1  w  <*-5> 


• 0 


and 


x+1 


~  ^£  R£  *£T  +  K£+l  ''£+1  K£+1T 


Ro  =  0 


where 


W  =  0)  *  h  V 1 


^i+l  =  H£+l  P£+l/£  H£+l  +  ^v£+l 


(A-6) 

(A- 7) 
(A-8) 


(K£+i  is  the  Kalman  gain  and  is  one-step  prediction  error  covariance.) 

Equations  \A-5),  (A-C)  can  be  derived  by  an  induction  procedure.  The  zero  ini¬ 
tial  condition*  are  a  result  of  the  filter  initialization 


R  =  E  fx  )  =  0 
— o  — o 


(A-9) 


The  induction  procedure  is  based  on  ''he  Kalman  filter  update  formulas  and  uses  the  ortho¬ 
gonality  property  of  an  optimal  filter 


where 


Cov  vf+1)  *  0 


^£♦1  =  -£+l  '  H£+l  *£ 


(A-10) 


(A- 11) 


is  the  innovation  at  time  £+1.  The  factor  in  Eq.  (A-5)  results  from  toe  following 

representation  of  the  state  at  £>1, 


-£+l  3 


M  So  4  Vi 


(A-12) 


where  w£+^  is  the  cumulative  effect  of  all  process  noise. 

Now  let  Eqs.  (A-l),  (A-2)  be  replaced  by  an  augmented  state  model: 


♦  a  xa  +  wa 
ft  =e 


(A-13) 


where 


■=■0 


**2+1  — £+1  +  —2+1 


A  filtered  estimate  of  this  augmented-state  system  is  therefore 

«:  =  l-‘\ 


where  ft®  is  the  optimal  smoothed  estimate  of  x0. 

The  data  equation  for  the  smoothed  estimate  is 


(A- 14) 


(A-15) 


(A-16) 


-I  5  D1  +  -1  .  2  =  0,1, 


where 


ns  -  n8 

°2+l  ~  u2 

Ds  s  0 
o 


’Ki+1  H2  +  l  ^2  D2 


Corresponding  covariance  matrices,  are 
Rj  =  Cov  <*|)  ,  Rj  =  Cov  (Ra,  R|) 
which  satisfy  recursions 


(A-17) 


(A- 18) 


(A-19) 


*2+1  =  ♦*  Rl  *  *2+1  *2+1  *2+1 

R0  °  0 


(A-20) 


Rl+1  "  *1  *  *2+1  *2+1  *2%1 

R*  m  0 
o 


(A-21) 


Equations  (A-18)  -  (A-21)  are  derived  directly  from  Eqs.  (A-5),  (A-6)  by  replacing  each 
matrix  by  an  augmented  form,  including 


')-  -a- 


(A-22) 


and  then  equating  corresponding  partitions  of  the  two  equations. 

Summarizing:  for  a  single-phaee  system  in  which  the  smoothed  data  equation  ma¬ 
trices  (D*,  R*)  are  required,  the  necessary  ^recursive  equations  are  (A-5),  (A-18)  and 
(A-21).  Matrices  R  and  Rc  are  not  required  for  this  case.  The  matrices  denoted  (D,R) 

in  Section  II  are  the  smoothed  date  equation  matrices  (D?,  rJ)  in  this  appendix,  evalu¬ 
ated  after  the  last  update  at  time  tf.  * 
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SUMMARY 

This  chapter  develops  the  full  linear  model  describing  the  propagation  of  error  for  inertial  navigation 
systems  employing  the  local- level,  wander-azimuth  navigation  mechanization  equations.  The  model  applies 
to  Schuler- tuned,  space-stable  or  strapdown  inertial  system  instrumentations.  For  this  model,  alternative 
approximate  linear  models  are  developed  which  in  different  operational  applications  have  proved  adequate 
as  "design  models"  for  the  application  of  Kalman  estimation  theory. 

INTRODUCTION 

The  application  of  modern  linear  estimation  techniques  requires  an  "adequate"  model  of  the  system 
whose  states  are  to  be  estimated.  The  adequacy  of  a  particular  model  of  a  physical  system  is  usually 
determined  by  digital  computer  simulation.  In  these  simulations  as  accurate  a  model  as  is  obtainable  is 
employed  to  represent  the  physical  system  and  the  subset  of  its  stales  which  are  of  interest  in  the  applica¬ 
tion  are  estimated  using  a  Kalman  filter  based  on  a  "design  model" of  this  system.  The  adequacy  of  the 
design  model  is  ascertained  by  observing  how  well  the  state  estimates  track  the  actual  states  of  the  simula¬ 
ted  physical  system  using  the  root  mean  square  (rms)  difference  between  them  as  a  criterion.  Some 
designers  also  compare  the  standard  deviations  from  the  estimator  covariance  matrix  to  these  rms  values 
as  an  Indicator  of  design  adequacy.  A  more  absolute  judgement  of  estimator  performance  is  obtainable  by 
comparing  the  rms  estimate  errors  with  the  standard  deviations  from  the  covariance  matrix  for  the  estimator 
based  on  the  exact  model  of  the  simulated  system,  the  so-called  "real-world"  model  covariance  analysis 
approach.  To  economize  on  mechanization  requirements  it  is  usually  desirable  to  simplify  the  latter  complete 
estimator  design  model,  motivating  the  examination  of  various  simplified  candidate  design  models. 

Regardless  of  the  estimator  design  criterion  employed,  the  work  of  estimator  design  Involves  a  "tuning" 
process  in  which  states  are  added  or  deleted,  dynamic  intercouplings  are  changed  and  white  noise  compo¬ 
nents  are  altered.  The  subject  of  this  chapter  deals  with  an  aspect  of  this  lattor  process  for  the  case  of 
Inertial  navigation  systems.  Herein  it  is  shown  that  the  analytical  model  for  propagation  of  error  in  the 
navigation  variables  of  position,  velocity  and  attitude  have  alternative  forms  with  various  degrees  of  approx¬ 
imation.  The  model  differences  are  important  in  that  they  imply  different  real-time  digital  computer  mechan¬ 
isation  requirements  in  terms  of  memory  and  duty  cycle.  These  models  have  boon  employed  in  operational 
systems  and  have  been  proved  effective  for  differing  application  requirements. 

LOCAL-LEVEL  COORDINATE  SYSTEM  NAVIGATION  EQUATIONS 

The  process  of  terrestrial  navigation  using  inertial  equipment  Involves  the  measurement  of  force  by  a 
usually  orthogonal  triad  of  accelerometers  whose  orientation  relative  to  the  earth  is  eatablished  and  main¬ 
tained  by  three  usually  orthogonal  gyroscopic  axes.  Since  the  orientation  of  the  accelerometers  relative  to 
the  earth  la  known,  the  force  due  to  gravity  can  be  removed  from  thc-sc  measurements  using  an  analytical 
model,  to  obtain  the  acceleration  of  the  Inertial  aystem  center  of  mass  relative  to  inertial  space.  Correction 
of  these  measurements  obtained  or  expressed  In  some  reference  navigstion  coordinate  frame  for  Coriolis 
accelerations  due  to  the  effects  of  earth  and  reference  franc  rotation  rate  relative  to  inertial  apace,  yields 
the  rate  of  change  of  system  velocity  relative  to  the  earth  with  respect  to  the  reference  navigation  coordi¬ 
nate  frame.  Integration  of  these  variables  with  proper  tnlt  allaation ,  then  yields  the  velocity  of  the  inertial 
system  relative  to  the  earth.  Transformation  of  the  veloclf  components  to  an  earth  fixed  frame  and  sub¬ 
sequent  integration  yields  system  position  change  relative  to  the  earth  thus  accomplishing  the  navigation 
objective.  To  make  these  statements  mere  specific,  the  navigation  equations  that  are  mechanized  for  moat 
aircraft  inertial  navigation  systems  assume  as  the  navigstion  reference  coordinate  frame  (x.y.z),  a  local- 
level  aystem  with  "wander  azimuth  angle*  a,  aa  illustrated  in  Figure  1.  This  coordinate  frame  residea  at 
the  center  of  maes  of  the  inertia)  system  and  Is  maintained  In  the  local-level  orientation  as  the  system  is 
moved  relative  to  the  earth.  Alternative  mechanisations  for  the  behsvior  of  the  wander  angle  are  possible. 
For  example  if  «(t)  s  0  to r  all  t.  then  the  navigation  equations  correspond  to  the  north-slaved  mechaniza¬ 
tion  since  the  y-axis  is  always  directed  northward.  In  this  case  the  x  and  y  axes  are  coincident  with  the 
local  east  and  north  axes  while  the  s  axia  still  remains  coincident  with  the  local  vertical. 

COORDINATE  FRAMES  FOR  ERROR  MODEL  DEVELOPMENT 

The  local-level  coordinate  navigation  equations  that  are  implemented  In  the  real-time  digital  computer 
have  been  derived  In  detail  elsewhere!  21  and  are  aummarited  in  extended  fora  in  the  appendix  to  be 
readily  employed  below.  The  development  of  the  describing  error  equations  presented  here  is  more  con¬ 
ceptual  th’ft  mathematically  rigorous  to  simplify  presentation  of  the  materia).  The  development  is  facilitated 
by  the  introduction  of  two  additional  orthogonal  coordinate  systems  to  that  depicted  in  Figure  1.  The  frame 
shown  there  Is  hereafter  referred  to  as  the  reference  frame  and  is  local-level  at  the  true  position  of  the 
inertial  system  with  the  asimuth  angle,  a. 

PLATFORM  FRAME 

The  first  additional  frame  is  called  the  platform  frame  and  is  slightly  misaUgnsd  from  the  reference 
frame  via  email  attitude  error  angles  as  defined  by  the  skew-symmetric  transformation i 


where: 


where  the  angles  are  positive  counter-clockwise  about  their  respective  axes,  the  Variables  4X  y  representing 
tilt  of  the  platform  coordinates  and  the  azimuth  variable  •  representing  the  difference:  ’ 

♦,4« p  -  “  (2) 


Figure  1.  Illustration  of  the  Local-Level  Navigation  Reference  Coordinate  Frame  jx.y.s)  and  an  Earth- 
Fixed  (X,Y,Z)  Reference  Coordinate  Frame. 


SYSTEM  COMPUTED  LOCAL  GEODETIC  FRAME 


The  second  orthogonal  coordinate  frame  used  in  the  error  description  la  called  the  system  computed 
local  geodetic  frame  and  differs  in  angular  orientation  from  tha  reference  coordinate  frame  by  three  small, 
independent  n on-orthogonal  rotations  due  to  errors  in  system  computed  geodetic  position  and  the  wander 

angle: 


«♦ 


is  the  error  in  system  computed  geodetic  latitude  positive  clockwise  about  the  local  east 
axis.  Note  f  6g  is  positive  counter-clockwise  about  the  east  axis 


is  the  error  in  system  computed  longitude  positive  counterclockwise  about  tbe  earth's 
polar  axis.  Nota  this  error  can  be  projected  onto  the  local  north  and  vertical  axes 
knowing  tha  ay  slam  latitude  asi 


«*m4  tx  cos  a 

11  O) 

«*y  A  4X  sin  ♦ 

4  a  A  o  *  a  is  the  difference  between  the  system  computed  wander  angle  and  that  of  the  reference 

coordinate  system. 

The  three  sources  of  angular  rotation  can  bs  expressed  about  tha  reference  coordinate  axes  knowing  the 
wander  angle  mat 


Tha  notation  convention  in  this  chapter  normally  identifies  scalar  quantities  with  subscripts  except  where 
the  text  defines  vector  quantities.  Brackets  M.  are  employed  to  Identify  matrices  which  ere  ell  defined 
explicitly  in  the  text.  Variables  with  no  subscripts  are  normally  vectors  which  are  defined  explicitly  in  the 
text  and  where  relevant  tho  text  Indicates  tha  coordinate  system  in  which  tha  vector  la  assumed  to  be 
expressed. 


(4) 


4  flx  4  *  cos  <*+4  8^  sin  “ 
4  6y  4  4  8jtj  coe  a  ~  4  6g  sin  a 

4  8  ASX  sin  ♦  +  4  a 
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These  three  rotations  describe  completely  the  difference  in  orientation  of  a  coordinate  system  described  by 
the  direction  cosine  matrix  [D)cdefined  in  the  appendix,  based  upon  the  computed  latitude,  longitude  and 
wander  angle  {♦.A,  aj  of  a  local-level  coordinate  frame  relative  to  the  reference  local-level  coordinate  frame: 
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ALTERNATE  INSTRUMENT  MECHANIZATIONS  OF  INERTIAL  SYSTEMS 


(5) 


(6) 


Several  mechanisations  can  be  employed  for  obtaining  the  inertial  instrument  measurements.  Regard¬ 
less  of  the  instrumentation  approach  employed,  the  local-level  coordinate  system  formulation  of  the  naviga¬ 
tion  mechanisation  equations  can  be  employed  to  accomplish  the  navigation  function  as  long  as  the 
acceleromete-  measurements  can  be  expressed  along  the  local-level  coordinate  axes.  In  all  mechanisations 
the  attitude  error  4,  between  the  platform  frame  and  the  local-level  frame  is  minimised  prior  to  use  of  the 
inertial  system  for  navigation  by  a  process  called  alignment. 

SCHULER-TUNED  PLATFORM  MECHANIZATION 


The  Schuler-tuned  platform  mechanisation  of  the  inertial  instrumentation  has  been  employed  for  several 
decades.  In  this  approach  it  is  attempted  to  maintain  the  accelerometer  sensing  axes  coincident  with  the 
local-level  frame  by  appropriate  processing  of  the  system  gyros.  The  commonly  mounted  gyros  and 
accelerometers  comprise  what  is  called  the  stable  element  which  is  Isolated  from  the  angular  motion  of  the 
carrying  vehicle  by  a  gimbal  set.  Once  initial  alignment  of  the  accelerometer  sensing  axes  with  the  local- level 
frame  coordinates  has  been  achieved,  this  orientation  is  maintained  by  processing  the  gyros,  relative  to 
inertial  space,  by  the  system  computed  spatial  rate  ac .  of  the  local-level  frame.  This  term  is  the  sum  of 
the  system  computed  angular  rate  of  the  local-level  frame  relative  to  the  earth  oc,  and  the  system  computed 
rotation  rate  of  the  earth  relative  to  inertial  apace,  Q^.i 

"c  °  pc  +  flc 

The  ptatform  frame  in  this  mechanisation  is  the  orientation  defined  by  the  acceleromoter  sensing  axes  which 
are  Ideally  maintained  coincident  with  the  local-level  reference  frame.  Consequently  the  angular  rate  of 
the  platform  coordinate  frame  relative  to  the  instrument  sensing  axes  expressed  in  the  local-level  frame  v^., 

Is  mechanised  as  aero. 

STRAPDOWN  AND  SPACE-STABLE  INERTIAL  INSTRUMENT  MECHANIZATIONS 

More  generally  as  in  the  case  of  an  alternate  gimbaled  inertia)  system  or  s  strapdown  inertial  system 
mechanlastion,  the  platform  coordinate  frame  is  a  computed  orientation  relative  to  that  where  the  accelerom¬ 
eter  axes  actually  exist,  that  la  determined  using  the  Inertial  instrument  messurements.  In  the  error-less 
case  the  platform  frame  Is  sgtin  of  course  coincident  with  the  local-level  reference  frame.  In  a  strapdown 
system  where  there  is  no  gimbal  set  and  the  inertial  instruments  are  'strapped'  to  the  carrying  vehicle 
frame,  the  transformation  matrix  between  the  platform  frame  and  the  instrument  frame  Is  computed  using 
the  system  computed  spatial  rate  of  the  local-level  frame  uc,  minus  the  gyro  measurements  of  angular  rate 
of  the  vehicle  frame  relative  to  inertial  space  both  expressed  in  the  local- level  frame: 

vc  s  “c  '  lP,c  “g  (l) 

For  tn  alternate  gimbaled  inertial  system  such  as  space-stable,  where  the  Instruments  rsmsin  fixed  relative 
to  inertial  space,  the  platform  to  instrument  frame  transformation  matrix  la  obtained  as  the  system  computed 
spatial  angular  rate,  expressed  in  the  local-level  frame  *»  » 


since  the  accelerometer  axes  are  presumed  fixed  relative  to  inertial  space- 

In  all  the  above  mechanisations  the  linear  transformation  or  direction  cosine  matrix  from  the  accelerom¬ 
eter  coordinate  frame  |x,y,tlt  to  the  platform  frame  can  be  computed  once  initialisation  by  alignment  has 
been  achieved,  by  integration  of  the  matrix  differential  equation: 

|P!e  »  !v)c  tP)c  (10) 

where  the  anti-symmstric  matrix  of  relative  angular  rates  of  the  platform  frame  relative  to  the  instrument 
frame  in  platform  axea  lx  defined  aa: 
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Using  the  three  coordinate  systems  defined  above ,  we  may  now  develop  a  set  of  ton  linea-  differential 
equations  describing  the  propagation  of  error  for  the  local-level  coordinate  system  navigation  equations  and 
the  three  different  inertial,  instrument  implementations  just  discussed.  These  ten  equations  describe  the 
time  rate  of  change  of  velocity  difference  4V ,  attitude  error  +  ,  position  error  4  6,  and  elevation  error,  4h. 

ERROR  IN  THE  SYSTEM  COMPUTED  VELOCITY 

The  rate  of  change  of  velocity  difference  is  obtained  by  first  differencing  the  representation  of  the 
mechanised  version  of  this  equation  employing  actual  values  in  the  system  computer ,(11A)  of  the  appendix, 
with  the  representation  of  the  acceleration  equation  for  the  local- level  reference  navigation  frame,  (1A)  of 
the  appendix.  Since  the  accelerometer  measurements  are  expressed  in  the  platform  coordinate  frame,  their 
difference  from  the  accelerometer  measurements  expressed  in  the  reference  frar-te  due  to  the  effecte  of 
attitude  error  1st 

«a  4A  -  A  «  |*|  A  (12) 

The  system  computed  Coriolis  acceleration  components  are  all  determined  from  variable#  which  reside  in 
registers  In  the  system  computer.  These  system  computed  values  differ  from  the  values  of  the  seme  vari¬ 
ables  thst  correspond  to  the  reference  navigation  coordinate  frame  as  defined  below  t 
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which  upon  examination  of  Equations  («*/*,  12A)  Ignoring  the  small  effect*  of  error  in  the  computed  radii  of 
curvature,  yields t 
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where  to  first  order  R  can  be  assumed  to  be  a  nominal  redius  for  tha  earth. 


Further  to  first-order: 


which  with  (4)  can  be  written  as: 
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Combining  the  above  equations. 


4Q  =  }48]  Q  =  0  *  4  8 

the  difference  between  (11A)  and  (1A)  to  first-order  is: 
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a  (4)  A  +  ( <5  p  +  2«nl  V  -  (iV]  (p  +  2J1)  +  4y  +  V 
where  the  following  skew-symmetric  matrix  definitions  have  been  employed: 


and: 
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represents  the  error  in  the  computation  of  the  gravity  vector 
from  the  analytical  mode)  due  solely  to  error  in  system  computed 
geodetic  position  and  elevation.  If  an  ellipsoidal  equlpotentia) 
surface  is  assumed  for  the  grsvity  model  4  yx  *  <  Yy  =  0,  and 
4  Ya  1*  In  error  due  only  to  error  in  system  computed  latitude 
and  elevation. 

(22) 

represents  generalised  accelerometer  measurement  error  due  to  all 
instrument  related  error  sources  transformed  onto  the  reference 
local-level  coordinate  frame  plus  the  difference  between  actual 
gravity  and  that  represented  by  the  gravity  model.  Note  if  an 
ellipsoidal  squlpotsntisl  surfacs  is  assumed  for  the  gravity  model, 
error  axiits  along  the  level  axes  due  to  the  deflection  of  the 
vertical  to  the  geotd  relative  to  that  of  the  ellipsoid!  *).  The  error 
exiats  along  the  vertical  due  to  model  error  for  the  intensity  of 
the  grsvity. 


At  this  point  sorns  remarks  should  bs  mads  about  ths  meaning  of  the  various  quantities  defined  above. 
Generally  ths  definition  of  ’error*  that  Has  been  selected  stands  for  ths  slmpls  difference  between  the 
value  of  the  variable  aa  it  physically  axitta  in  ths  system  computer  and  ths  valua  of  ths  variable  as  Inttr- 
pretsd  in  ths  reference  local-level  navigation  coordinate  system  when  determined  without  error.  For  the 
case  of  earth  rate,  the  error  (18)  corresponds  to  ths  difference  between  the  earth  rate  determined  In  the 
computer  coordinates  and  that  for  ths  reference  coordinates.  For  velocity  error  (13,  20),  this  is  not  the 
case  as  by  construction  ths  velocity  error  which  arises  from  ths  integration  of  Equation  (20)  results  from 
ths  translomafion  error  0,  between  the  platform  and  reference  coordinate  systems,  the  errors  in  the  com¬ 
puted  Coriolis  acceleration  end  gravity  vector  components  sad  of  course  may  measurement  errors  associated 
with  the  sccelercmetere. 


If  ea  occurs  in  acme  applications,  it  is  desired  to  express  the  ancsiersUon  measurement  in  an  earth- 
fixed  coordinate  frame  prior  to  integration,  aa  (X,Y,Z)  shown  in  Figure  1,  then  the  total  transformation 
error  must  include  the  effects  of  the  position  error  48,  of  (4)  since  the  measurements  Ap,  must  be  trans¬ 
formed  through  the  system  computed  direction  cosine  matrix  (D)c  of  the  appendix,  to  the  earth-fixed  frame. 
In  this  csss  however,  the  mechanisation  equations  which  are  integrated  to  compute  the  system  vslor'ty  are 
different  from  (1A)  aa  ths  earth-fixed  frame  In  which  the  integration  is  performed  is  obviously  not  rotating 
with  respect  to  the  earth  as  la  the  local-level  navigation  f rasas.  Ths  correct  mechanisation  aquation  for  ths 
rets  of  change  of  inertial  system  velocity  relative  to  the  earth  with  respect  to  earth-fixed  coordinates  is,  in 


error-free  vector  form: 


where: 
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denotes  the  rate  of  change  of  velocity  relative  to  the 
earth  with  respect  to  the  earth-fixed  frame  which  when 
Integrated  in  the  earth- fixed  frame  yields  system 
velocity  with  respect  to  the  earth  in  the  earth-fixed 
frame 

is  the  specific  force  measured  by  the  accelerometers 
due  to  the  true  system  acceleration  and  the  modeled 
gravity  at  the  true  system  position,  expressed  in  the 
earth- fixed  frame 
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is  the  earth  rotation  rate  expressed  in  the  earth- fixed 
frame  of  Figure  1 
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is  the  projection  of  the  modeled  gravity  vector  at  the 
true  system  position  onto  the  earth-fixed  frame: 


One  mechanised  version  of  (23)  that  would  be  integrated  in  the  system  computer  is: 
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where  all  components  are  expressed  in  the  earth- fixed  coordinate  system  prior  to  integration.  Note  the 
earth  rate  components  are  known  exactly  in  the  earth- fixed  frame,  but  the  direction  and  magnitude  of  the 
gravity  vector  components  relative  to  these  coordinates  are  a  function  of  the  system  computed  latitude, 
longitude  and  elevation  and  hence  subject  to  the  errors  in  these  variables.  The  equation  for  the  difference 
between  (23)  and  (24)  is  then  to  first  order: 
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where  relative  to  the  local-level  reference  frame: 
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represents  angular  error  incurred  In  the  transformation  of  the  accelerometer  measurements  from  the  platform 
frame  to  an  earth-fixed  frame.  Here,  as  <9  is  small,  ws  havtt 


M 

V 

y  «  U  *  68) 

y 

UJ 

(27) 


Note  that  the  computer  frame  is  not  an  earth-fixed  frame  but  related  to  an  earth- fixed  frame  through  the 
computed  position  end  wander  angle  (4  .X.a)^.  Further: 
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ERROR  IN  THE  SYSTEM  COMPUTED  POSITION  AND  ELEVATION 


In  that  the  direction  coaine  Matrix  |D|  defines  the  system  computed  latitude,  longitude  and  wander 
angle,  the  perturbation  of  this  matrix  due  to  error  In  these  three  Independent  variables  ie  of  interest. 
Directly  from  the  definition  (5A),  the  following  perturbations  relative  to  (41,64,4a)  can  be  obtained 
which  upon  employing  the  definition  (5A,4)  yields  not  surprisingly  the  second  equivalence: 


6<X,x>  =  -6X  (ca  sX  +  sa  s4  c X) 

-6  a  (sa  cX  +  cas4sX)  =  4  8^  •  <X,y>  -48*  <X,z> 

-4  8  (sa  c4  sX) 

6  <X,y>  =  4  X  (sa  sX  •  ca  sJ  cl) 

+4a  (sa  84  sX-  cacX)  =58  •  <X,z>  -  48  •  <X,x> 

X  z 

-4*  (ca  c4  sX) 

6<X,z>  =  -5*  (c4  sX)  +  5X  (c4  cX)  =  56  •  <X,x>  -  48  •  <X,y> 

5  <Y,x>  =  "4a  (ca  c*)  -  44  (sa  84)  =  4  8^  *  <Y,y>  -  5  6^  •  <Y,z>  (29) 

4  <Y,y>  =  -4  a  (sa  c4)  -  5  4  (ca  84)  =  5  8^  •  <Y,z>  -  4  8z  >  <Y,z> 

6<Y,z>  =  5 4  c4  =  46  •  <Y,x>  -  4  6  •  <Y,y> 

y  x 

4  <Z,x>  =  4  X  (sa  s4  s  X  -  ca  sX) 

+4  a  (sa  sX  -  ca  s4  cX)  =  4  6^  •  <Z,y>  -  48  •  <Z,z> 

-5  4  (sa  c4  cX) 

4<Z,y>=  5X  (sa  cX  +  ca  s4  sX) 

+4  a  (ca  sX  +  sas4cX)  =5  8x*<Z,z>-58  •  <Z,x> 

-4X  (ca  c4  cX) 

4  <Z ,z>  =  -4X  (c4  sX)  -  4  4(s4  cX)  =  4  8y  •  <Z,x>  -  5  8^  «  <Z,y> 
where  (s,c)  denote  (sine,  cosine)  respectively.  Clearly: 

I4D]  4  (D)c  -  [D]  *»  (681  ID]  (30) 


=  tq.-  4D] 


Thus  as  stated  above,  we  have  confirmed  that  the  perturbation  vector  angle  6  8,  represents  a  rotation 
from  the  local-level  reference  frame  to  the  computed  geodetic  frame.  Proceeding  formally  we  may  differen¬ 
tiate  the  first-order  approximation  in  (30)  to  obtain: 

(to)  •  (4*8]  (DJ  ♦  (46)  (31) 


which  from  (4A): 

yields  for  (31): 


(i»  »  ip)  (D) 

I  to]  a  <(4i]  a  (46)  to))  (D) 


and  further  upon  differentiation  of  (30)  and  using  definitions  we  obtain  to  first-order: 

(to]  »  ?4p)  (D)  ♦  ( p  1  ( 6D]  4(D)C  -  (6) 


where: 


(32) 

(S3) 

(34) 


(4el  4(p1c  -  (pj  (35) 

which  using  (30)  yields i 

(to]  ■  «4p|  ♦  (p)  (161)  (0)  (35) 

Nots  Ip  denotes  tha  difference  in  the  angular  rate  of  the  system  computed  geodetic  frame  and  the  local - 
level  frsms  both  taken  with  respect  to  the  earth  frame,  whereas  6  4  denotes  the  angular  rets  of  change  of 
the  system  computed  geodetic  frame  taken  with  rsepoct  to  the  local-level  geodetic  frame.  From  Equations 
(31,  35)  since  ID)  la  Invertible,  we  obtain: 

(«•]  *  (*p|  ♦  Ip)  l«6]  -  166]  |p]  (37) 


or  in  acalar  form  upon  expansion: 


(38) 


er  in  vector  form: 


4  6 


x 


«px  +  P,«ey  -  Py«e8 

«Py  +  px«e,  -  p,«0x 
«p2  +  Py  «0X  -  px«6y 


46  = 


4  p  -  p  x  4  6 


(39) 


The  form  (19)  merely  indicates  that  the  rate  of  change  of  angular  error  of  the  computed  geodetic  frame 
relative  to  the  local- level  reference  frame  4  6,  is  simply  the  sum  of  the  rate  of  change  of  this  angular  error 
relative  to  the  earth  4p  or  4  8e,  and  the  Coriolis  effect  due  to  the  rotation  rate  between  the  computed 
geodetic  frame  and  the  earth.  The  second  term  in  (39)  can  also  be  viewed  as  the  error  in  resolving  the 
correct  relative  angular  rate  vector  p,  from  the  earth-fixed  frame  to  the  local-level  reference  frame  due  to 
the  error  4  8,  in  a  knowledge  of  its  orientation  when  the  system  computed  latitude,  longitude  and  wander- 
angle  are  employed. 


The  final  position  error  equation  is  that  for  the  elevation  of  the  system  relative  to  the  earth.  Eleva¬ 
tion  error  is  obtained  by  integrating  the  difference  between  the  system  computed  and  actual  system 
velocity  relative  to  the  earth  as  projected  onto  the  local  vertical  axis: 

4b  =  V  -  V  $  4V  (40) 

c  z  z  X 
c 

ERROR  IN  THE  PLATFORM  FRAME  ORIENTATION 


To  derive  the  dynamics  of  the  attitude  error  4,  of  the  platform  frame  relative  to  the  local-level  refer¬ 
ence  frame  we  can  proceed  in  a  manner  analogous  to  that  for  angular  position  kw'  above.  First  the  dif¬ 
ference  between  the  system  computed  platform  frame  to  instrument  frame  transformation  (P)c,  and  the 
local-level  to  instrument  frame  transformation  [Pi,  is  derived: 

[6P1  4  [Plc  -  (P)  =>  (4l  [PI  (41) 


where: 
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r  ^ 
X 
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»  (pi 
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B  (I  ♦  4l 
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(42) 
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p 

have  been  used.  Differentiation  of  (41)  yields: 

[6$)  «  ([♦)  ♦  !♦)  fv|)  (PI  (43) 

where  (4)  is  expressed  and  differentiated  with  respect  to  the  local-level  frame.  Further 

UP!  »  I  PL  -  (PI  «  tvl  IPL  -  !v)  (PI  (44) 

c  c  c 

aU4  vj  ♦  [vj  (#t )  (PI 


where: 

(4vJ  4  (v)e  -  | v  1  (45) 

is  the  difference  between  the  angular  rate  of  the  system  computed  platform  frame  with  respect  to  the 
instrument  frame  expressed  in  the  platform  frame  |vL,  and  the  angular  rate  of  the  local  level  frame  with 
respect  to  the  instrument  frame  ( v]  expressed  in  the  local-level  frame.  Equating  (43.  44)  using  the 
InvertibUlly  of  (P|  yields: 

(41  -  (4v|  ♦  tv)  (g)  -  (4|  (v)  (44) 


which  yields  the  vector  form: 

4  *  4  v  ♦  #  *  v  (47) 

It  le  of  Interest  to  specify  (47)  for  the  three  instrumentation  mechanisations  discussed  above:  Schuler- tuned, 
space-stable  and  strapdown. 

In  tha  case  of  s  Schuler-tuned  mechanisation,  the  platform  frame  Is  ths  instrument  frsme,  hence  v.  is 
taro.  However  the  instrument  frame  rotates  relative  to  tha  local-level  frame  a*  the  gyros  ere  processed  at 


(38) 


or  in  vector  form: 


4  6x  =  «Px  +  P,«*y  -  Pysei 
«Sy  =  «Py  +  Px«ei  - 

4  6s  =  «P2  +  Py  fi6x  -  Px4  6y 


4  8  =  4  p  -  p  *  4  9 


(39) 


The  form  (39)  merely  indicates  that  the  rate>  of  change  of  angular  error  of  the  computed  geodetic  frame 
relative  to  the  local-level  reference  frame  4  9,  is  simply  the  sum  of  the  rate  of  change  of  this  angular  error 
relative  to  the  earth  4  p  or  4  8e,  and  the  Coriolis  effect  due  to  the  rotation  rate  between  the  computed 
geodetic  frame  and  the  earth.  The  second  term  in  (39)  can  also  be  viewed  as  the  error  in  resolving  the 
correct  relative  angular  rate  vector  p,  from  the  earth- fixed  frame  to  the  local-level  reference  frame  due  to 
the  error  4  6,  in  a  knowledge  of  its  orientation  when  the  system  computed  latitude,  longitude  and  wander- 
angle  are  employed. 


The  final  position  error  equation  is  that  for  the  elevation  of  the  system  relative  to  the  earth.  Eleva¬ 
tion  error  is  obtained  by  integrating  the  difference  between  the  system  computed  and  actual  system 
velocity  relative  to  the  earth  as  projected  onto  tha  local  vertical  axis: 

$h  =  V  -  V  $  4V  (40) 

c  s  s  * 
c 

ERROR  IN  THE  PLATFORM  FRAME  ORIENTATION 


To  derive  the  dynamics  of  the  attitude  error  4,  of  the  platform  frame  relative  to  the  local-level  refer¬ 
ence  frame  we  can  proceed  in  a  manner  analogous  to  that  for  angular  position  error  above.  First  the  dif¬ 
ference  between  the  system  computed  platform  frame  to  Instrument  frame  transformation  (Plc,  and  the 
local-level  to  instrument  frame  transformation  (PI,  is  derived: 

UP]  4  (Plc  -  (P!  -  (4)  EP1  (41) 


where: 
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have  been  used.  Differentiation  of  (41)  yields: 

Ul'l  -  U4l  ♦  (41  tvl )  (PI 

where  (el  Is  expressed  and  differentiated  with  respect  to  the  local-level  frame-  Further 

UP!  ■  (P!c  •  (PI  *  U!c  (Plc  -  (v|  (P) 

°(Uv)  *  tvl  (411  (PI 


(42) 


(43) 

(44) 


where: 


Uv]  4  |v|c  -  (v|  (4$; 

U  the  difference  between  the  angular  rate  of  the  system  computed  platform  frame  with  respect  to  the 
Instrument  freme  expressed  In  the  platform  frame  ( v  1  c .  and  the  angular  rate  of  the  local  level  frame  with 
respect  to  the  Instrument  frame  (v|  expressed  in  the  local-level  frame.  Equating  (43,  44)  using  the 
InvertihlUty  of  (P|  yields: 


Ul  -  Uvl  ♦  (v|  (4)  -  (41  (v| 


(46) 


which  yields  the  vector  form: 

4“4va4»v  MT) 

It  Is  of  Interest  to  specify  (4?)  for  the  three  Instrumentation  mechanisations  discussed  above;  Schuler-tuned, 
space-stable  end  slrspdown. 

In  the  case  of  a  Schuler-tuned  mechanisation,  the  platform  frame  is  the  instrument  frame,  hence  v-  Is 
taro.  However  the  instrument  frame  rotates  relative  to  the  local-level  freme  es  the  gyroe  ere  processed  at 


the  computed  spatial  rate  uc,  applied  about  the  platform  frame  coordinates  as  opposed  to  the  local- level 
frame  coordinates.  The  instrument  frame  also  rotates  relative  to  the  local-level  frame  due  to  the  generalized 
drift  rate  of  the  gyros  e.  hencei 


4v  =  4u+4*<‘>+£  =  4 


(48) 


where  4  id  denotes  the  scalar  difference  of  the  components  of  the  vectors  u>c  and  w; 


4io4 


ai  —  u 

x  x 
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I  -111 

z  z 

c 


=  4  p  +  4  Q 


(49) 


and  f  x  u  expresses  the  rotation  rate  of  the  platform  frame  relative  to  the  local-level  frame  when  the 
local-level  spatial  angular  rate  components  are  applied  In  the  misaligned  platform  frame  even  if  they  are 
perfectly  known. 


For  the  space-stable  mechanization; 


v  A  “  (^®) 

c  c 

whereas  the  angular  rate  of  the  local-level  frame  relative  to  the  instrument  frame  is; 

v  a  in  -  c  (51) 

where  the  components  of  v  are  about  the  local-level  frame  coordinates.  Hence  for  the  space  stable  mechani¬ 
zation  using  (47,  50.  51),  we  have 


4  =  itii  a  *  w  *  c  (52) 

For  the  strapdown  mechanization; 

v  (S3) 

c  c  eg 

whereas  the  angular  rate  of  the  local-level  frame  relative  to  the  instrument  frame  is; 

v  =  w  -  (PI  4  (54) 


where  we  define  generalised  gyro  drift  rate  about  the  Instrument  exes  c  *  a*  the  difference  between  the 
actual  frame  rotation  rate  relative  to  Inertial  apace  *  and  the  gyro  output  measurement  a  • 


4  4  4. 

yielding  the  generalized  drift  rate  thou*,  local-level  coordinates  as; 

«  «  IH  *g 

Hence  differencing  {S3.  54)  ueing  (41)  and  (55,  56),  we  have  to  first  order; 

4  v  «  4«  -  |4)(Pl  4  ♦  z 


(55) 


(56) 


(57) 


Consequently  using  (47,  Si.  57)  we  have 

4  «  4«  -Ul  (PJ  4  *  «  *  4  «  U-  (PI  4} 


« 


which  sat 

(4)  IP)  4  «  -  *  .  (P)  4  (5«) 

yields; 

i  a  4  i  i  (59) 

to  describe  the  dynamics  of  the  attitude  error  for  the  strapdown  instrument  mechanization  -  #e  note  in 
summary  that  regardless  of  Uie  three  mechanisation*  employed,  the  dynamic*  of  the  attitude  error  propaga¬ 
tion  (49,  42.  59)  u*  identical  in  form.  Hots  however  the  generalized  drift  rat*  vector  z.  employed  in 
those  equadone  Is  obtained  by  transforming  the  drift  rale  about  Instrument  axe*  to  ib*  Incal-level  frame 
coordinate  ernes. 

ANGULAR  ROTATION  BETWEEN  THE  PLATFORM  AND  SYSTEM  COMPUTED  GEODETIC  FRAMES 

The  dynamics  of  the  angular  rotation  from  the  system  computed  geodetic  frame  to  the  platform  frame; 

♦  4  4  *  49  (W) 

relative  to  the  local-level  frame,  can  be  obtained  by  differencing  the  corresponding  individual  rates  ss 
derived  above  09,  59); 


or; 


*44-  44*  4p-4t«fl*4»w*«-4p  -  tt  •  p 


(61) 


(6?) 


The  advantage  of  (62)  is  that  it  is  simpler  to  propagate  than  the  attitude  error  expression  (59). 

Knowledge  of  *  and  4  6  allows  determination  of  ♦  at  any  instant  of  time  by  (60).  Noting  that  since  in  (62) 
the  time  rate  of  change  of  y  is  relative  to  the  local-level  frame  which  rotates  with  angular  rate  w  with 
respect  to  inertial  space  we  can  conclude: 

(63) 

where  denotes  the  time  rate  of  change  of  y  relative  to  inertial  space.  In  words,  the  rate  of  change  of 
y  viewed  from  inertial  space  is.simply  the  generalized  drift  rate  of  the  system  gyros  projected  onto  the 
inertial  frame  coordinate  axes*  . 

RELATIONSHIP  BETWEEN  COORDINATE  FRAMES 

The  correspondence  between  the  coordinate  frames  discussed  above  can  be  summarized  conveniently  in 
Figure  2  below.  Herein  the  arrows  Indicate  in  which  direction  the  denoted  transformation  is  used  in  obtain¬ 
ing  the  ensuing  coordinate  frame.  Since  all  transformations  are  orthogonal  however,  the  inverse  transfor¬ 
mation  is  realized  by  the  transpose. 


INSTRUMENT 

(ACCELEROMETER)  FRAME 


(P)1  ll  +*)(D] 


(x.y.zla 

PLATFORM  FRAME 

(*•*•*], 


(i 


EARTH-FIXED  FRAME 
[X.Y.Z] 


SYSTEM  COMPUTED 
GEODETIC  FRAME 


(x.y.i] 


(1  ♦  66] 


Figure  2.  Relationship  Between  Coordinate  Frames 
KALMAN  KILTER  MODELING  CONSIDERATIONS 

After  having  developed  the  fundamental  linear  differential  equations  describing  the  error  propagation 
for  Inertial  navigation  *y items,  we  are  now  in  a  position  to  diacuea  various  simplification#  which  havr  been 
made  to  implement  operational  Kalman  fitter  designs.  The  objective  of  such  simplifications  In  »  specific 
application  is  to  reduce  the  computer  memory  and  duty  cycle  requirements  without  incurring  unacceptable 
degradation  in  the  performance  of  the  operational  filter. 

Simplifications  that  are  investigated  by  Kalman  fitter  design  engineer*  generally  fall  into  two  categoric*. 
The  fitret  of  these  categories  deals  with  the  question  ss  to  how  many  of  the  modelable  Instrument  error 
states  present  in  the  error  projection  vectors  for  accelerometer  measurement  error  9 ,  and  generalized  gyro 
drift  rate  t.  need  be  included  as  states  in  the  Kalman  filter.  The  elemental  states  to  be  con-ldercd  include 
instrument  bias,  stale  factor  errors,  sensitive  axis  mechanical  misalignments,  sensitivity  to  products  of 
acceleration  and  angular  rates,  correlated  noise,  etc.  The  decision  as  to  which  of  these  stste*  are  incor¬ 
porated  in  the  Kalman  design  mode!  is  made  after  several  design  iterations  in  which  performance  with  and 
without  various  states  present  In  the  design  model  is  determined /filter  tuning*  to  accommodate  absence  of 
states  in  the  design  mod-’  is  performed  and  trade-offs  in  the  specification  of  error  parameters  values  for 
the  actual  system  have  bean  investigated.  Such  a  design  process  is  usually  of  lengthy  duration  and 
requires  the  use  of  highly-sophisticated  simulation  software  for  its  accomplishment.  These  issues  will  not 
concern  us  further  here. 

The  second  cstegory  of  simplifications  addresses  the  error  dynamics  of  the  navigation  system  errors 
(20,  39.  59)  and  it  whst  is  of  Interest  here.  Two  types  of  simplification  ere  of  Interest .  these  being 
reduction  o'  the  modeled  error  states  ss  above,  and  secondly  the  reduction  of  the  dynamic  coupling  between 
the  error  states  that  are  retained,  Simplifications  that  have  proved  to  be  especially  useful  in  operational 
filter  design  erst 

S  Vertical  axis  model  elimination 

e  Level  axes  Coriolis  acceleration  elimination 

s  Use  of  the  •*  equation',  (62) 
e  Alternative  definitions  of  azimuth  error 
and  are  discussed  is  detail  below  . 


VERTICAL  AXIS  MODEL  ELIMINATION 


The  subset  of  the  above  error  model  equations  which  describe  the  propagation  of  error  for  the 
axis  of  the  local-level  coordinate  frame  navigation  equations  is: 

4h  =  <5  V 

z 

-  y*  -  vs +  6C.~. +  +  vx 

where  the  error  in  the  system  computed  Coriolis  acceleration  component  along  the  vertical  axis  is: 

4C  4  <4p  +  248)  *  V  -  (4 p  +  248)  •  Vv  +  (p  +  28)  •  4V  -  (p  +  28)  •  4V 

z  yx  x  y  y  x  xy 

f  3] 

Due  to  the  dependence  of  the  normal  gravity  y,  on  the  elevation; 

6h  =~2yJ  ~  2fi2 

where  J  =  0.5  [M  *  +  N  *] ,  is  the  mean  curvature  of  the  ellipsoid  and  M  and  N  are  the  meridional  and 
normal  radii  of  curvature,  the  error  in  system  computed  elevation  leads  to  divergence  of  the  vertical  axis 
errors  ( 4  n,  4VZ) ,  Because  of  this  effect  the  system  elevation  computation  error  is  normally  bounded  in 
aircraft  applications  by  the  use  of  a  reference  source  of  elevation  as  a  barometric  altimeter  which  is  also 
used  in  a  servo  loop  arrangement  to  bound  the  vertical  velocity  error  and  bias  the  error  in  the  vertical 
acceleration  measurements.  This  is  done  even  when  the  level  axes  computations  are  uncorrected  with  other 
reference  sensor  data  and  are  operating  in  the  "free-inertial"  mode. 

Since  conventional  fixed-gain  error  control  mechanizations  using  reference  elevation  measurements  have 
obtained  adequate  vertical  axis  performance  in  most  applications,  this  channel  is  often  ignored  in  the  appli¬ 
cation  of  Kalman  filtering  to  inertial  systems*  Further  since  there  is  little  ross- coupling  from  the  vertical 
axis  to  the  level  axes,  there  is  little  need  to  model  the  vertical  axis  error  states  ia  obtaining  good  control 
over  the  level  axis  error  states.  In  this  regard  the  elevation  error  does  not  affect  at  all  the  propagation 
of  error  for  the  level  axes  whereas  the  vertical  velocity  error  affects  level  axis  error  propagation  only 
through  error  in  computation  of  the  level  axes  Coriolis  acceleration  components  as  discussed  below. 

LEVEL  AXES  CORIOLIS  ACCELERATION  ELIMINATION 


vertical 


(64) 


(65) 


(66) 


The  errors  in  the  system  computed  level  axes  Coriolis  acceleration  components  are: 

4C  4  (4  p  +  24  8)  •  V  -  ( 4  p  +  248)  *  V, 

x  i  y  y  * 

+  (p  +  28)  •  4 V  -  (p  +  28;  •  4V 

*  y  y  * 

(67) 

4C  4  (4  p  +  248)  <  V.  -  (6  P  +  248).  •  V, 

y  x  z  z  x 

+  (p  +  20)  •  4V  -  (p  +  28)  •  4 V 

X  Z  *  A 


In  moat  aircraft  applications  the  vertical  velocity  V.  is  nominally  *ero  except  at  a  few  times  during  flight. 
Consequently,  modeling  of  such  an  effect  would  only  be  considered  if  small  transient  effects  were  important. 
For  example  even  if  the  vertical  velocity  were  100  feet  per  second  (fpa),  system  computed  latitude  error 
were  a  large  10  nautical  miles  (nm)  thereby  inducing  an  error  in  computed  earth  rate  of  0.03  Sec /sec  at 
mid-latitudes  (45°)*  ,  and  system  velocity  error  were  a  large  10  foe  (O.i  SeS/sec),  the  acceleration  error 
for  the  Coriolis  components  associated  with  V  is  less  than  3  miliigals. 

Due  to  bounding  of  the  vortical  velocity  error  to  a  few  feet  per  second  via  the  verticul  axis  mechani¬ 
zation  as  discussed  above,  the  magnitude  of  the  acceleration  error  term  due  to  4Vg  is  substantially  less 
than  the  uncertainty  in  tne  gravity  model  terms  Yx,y,z*  Pre*ent  in  ‘he  full  velocity  error  equations  (20). 

For  example  for  a  1  fps  orro  and  mid-latitude  operation  of  a  vehicle  traveling  at  1000  fps  in  a  direction 
reinforcing  the  level  axis  earth  rate  terms,  the  Coriolis  acceleration  error  is  less  than  5  mllUgals  which  is 
small  compared  to  the  gravity  model  uncertainty  of  40  to  50  roilligals  on  a  world-wide  basis.  These  effects 
are  a  fortunate  coincidence  for  the  Kalman  filter  designer  as  he  can  normally  eliminate  these  dynamical 
vertical  axis  dependencies  from  his  design  model  along  with  the  vertical  axis  model  as  noted  above. 

This  type  of  magnitude  of  effect  analysis  can  be  extended  to  the  Coriolis  error  components  Involving  the 
vehicle  level  velocity  components  Vx  y,  and  the  error  in  their  system  computed  values.  One  finds  again 
for  mid-latitude  operation  with  vehicle  velocity  components  of  1000  fpa,  errors  of  10  fps  in  the  system  com¬ 
puted  velocity  components  and  10  nra  latitude  error* g the  magnitude  of  the  error  in  these  system  computed 
Coriolis  acceleration  terms  is  less  than  10  miliigals.  Consequently  for  most  applications  the  error  In  the 
system  computed  Coriolis  acceleration  components  can  be  ignored  on  a  magnitude  basis  relative  to  a  store 
dominating  source  of  "noise"  that  arises  from  the  uncertainty  in  the  gravity  model.  There  are  some 


*Exceptions  to  this  rule  can  occur  when  very  precise  measurements  of  elevation  are  available  which  If  their 
uae  is  to  be  optimal,  require*  full  modeling  of  tbn  vertical  axis  error  propagation.  In  some  cases  cor¬ 
rections  can  be  obtained  for  level  axis  tilts  due  to  their  acceleration  dependent  effect  on  vertical  velocity 
error  in  (64). 

**A** ***umed  as  sn  average  condition. 

***Here  we  have  presumed  use  of  non-singular  forms  of  the  navigation  aquations  where  p#  is  comparable  to 
the  earth  rate  component  about  the  vertical  axia. 


instances  however  where  the  errrr  in  these  latter  oystem  computed  Coriolis  terms  may  be  important  on  a 
dynamical  basis.  Inspection  of  free-inertial  error  propagation  curves  reveals  that  the  cross-coupling  of 
the  level  axis  velocity  errors  between  axes  induces  modulation  of  the  Schuler  error  oscillations  with  the 
long-term  Foucault  period  obtained  as: 


(68) 


z 

which  is  24  hours  at  the  pole,  33.85  hours  at  latitude  of  45°  and  infinite  at  the  equator.  Normally  however 
when  a  Kalman  filter  is  being  employed,  the  corrections  to  the  inertial  system  errors  using  other  reference 
sensors  are  obtained  at  substantially  higher  frequencies  than  involved  here  which  "quench"  such  long-term 
oscillations  and  make  then  irrelevant.  All  in  all  then  in  most  cases,  the  Kalman  filter  designer  discovers 
that  the  dynamical  coupling  of  errors  associated  with  the  error  in  the  system  computed  Coriolis  accelera¬ 
tion  error  components  may  be  deleted  from  his  filter  design  model. 


USE  OF  THE  "*  EQUATION" 


As  noted  previously,  the  "ifi  equation"  (62),  provides  an  alternative  way  of  representing  attitude  error 
that  is  simpler  than  the  attitude  error  propagation  equation  (59),  provided  position  error  is  propagated  via 
(39)  such  that  attitude  can  be  recovered  at  any  time  instant.  Hence  in  most  Kalman  filter  design  models 
for  inertial  systems  the  two  following  equations  are  employed,  prior  to  further  simplification  which  may  be 
possible  in  some  applications,  to  describe  position  and  attitude  error  propagation: 

4$  =  4p  +  «6xp  (69) 

=  \ji  x  u  +  e.  (70) 

where  attitude  error  can  be  computed  at  any  time  via: 

♦  =  y  +  46  (71) 


ALTERNATIVE  DEFINITIONS  OF  AZIMUTH  ERROR 


The  two  attitude  errors  ♦  define  the  tilt  of  the  platform  frame  relative  to  the  reference  frame  with 
4  defining  the  azimuth  misalignment.  Since  error  in  system  computed  position  is  defined  by  the  error  in 
latitude  4 ♦  ,  and  longitude  4X,  it  is  clear  by  (4)  that  the  two  level  position  error  variables  4  0X  „  are 
sufficient  to  define  position  error  with  4  8,  as  an  error  variable  to  define  the  azimuth  misalignment  of  the 
system  computed  geodetic  frame  due  to  error  in  the  computed  wander  angle  4  a  and  the  error  in  knowledge 
of  the  north  direction  due  to  an  error  in  knowledge  of  longitude.  Consequently  there  appears  to  be  a 
form  of  redundance  In  the  system  azimuth  misalignment  definition  which  if  properly  exploited  might  lead  to 
a  simplification  of  the  describing  error  model  equations  by  reducing  by  one  the  number  of  azimuth  error 
states  to  be  considered.  A  review  of  the  material  presented  above  reveals  that  the  behavior  of  the  wander- 
angle  of  the  reference  frame  a,  has  not  specifically  been  defined.  The  discussion  to  this  point  has  only 
assumed  that  whatever  this  behavior  is,  the  resulting  error  variables: 


♦.  *°p  '  0 


40  4  . :  -  o  ♦  4  X  sin  4 
«  c 


(72) 


remain  small  such  that  second  order  effects  can  be  ignored  preserving  the  linearity  of  the  error  model. 


In  the  following  discussion  we  confine  ourselves  to  non-singular  mechanizations  of  the  local-level, 
wander- azimuth  navigation  mechanizations  as  noted  at  (8A ,  9A)  of  the  appendix.  Without  loas  of  generality 
and  (or  ease  of  exposition,  we  consider  the  most  common  wander- azimuth  mechanization  wherein: 


which  yields  via  (13A) 

and  via  the  definition 


0 


i 
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V  U*  *e 


(73) 


(74) 


P,  «  *  «P,  (?$) 

Note  a  complete  definition  of  all  terms  In  the  error  equations  (20,  34,  62)  now  only  requires  specific  defini¬ 
tion  of  the  terms  pt,  4pt,  such  that  overall  linearity  ia  preserved.  This  specification  la  obtained  by  im¬ 
posing  two  constraints.  The  first  constraint  is  (16),  which  by  (73,  74)  yUtae: 

a  *  -p^  ten  ♦  -  6pf  (70) 

The  second  constraint  results  from  the  selection  of  aaimuth  behavior  for  the  error  model  in  accordance  with 
one  of  the  alternatives  discussed  below  which  then  obtains  the  specification  of  tp*- 

THE  CASE  OF  THE  EIGHT  STATE  LEVEL  AXES  ERROR  MODEL 


In  what  might  be  termed  the  'normal*  error  model  we  apply  the  constraint: 


which  yield*  the  result  via  (75)  that: 


B  0 


«Pt  •  9 


(77) 


(78) 


Clearly  (77,  78)  are  small  so  that  an  eight  state  linear  model  for  the  error  propagation  is  obtained  using 
(20,  39,  62,  77,  78).  The  full  Cat  of  eight  differential  equations  are  summarised  in  Table  1  where  the 
additional  simplifying  assumption  of  nominally  constant  altitude  of  flight: 

V  =  0 

z 

A2  =  Y  (79) 

has  also  been  made.  An  additional  property  of  this  error  model  not  fundamental  to  the  error  propagation 
itself  is  seen  to  be  via  (76): 

o  =  -pN  tan  6  (80) 

THE  CASE  4  6  5  0 
z 

In  many  applications  it  turns  out  as  a  practical  matter  that  the  "position"  azimuth  error  term  4  8  of 
the  "normal"  error  model  is  small  relative  to  the  platform  azimuth  misalignment  9  ,  and  can  simply  be* 
ignored  in  the  Kalman  filter  design.  This  amounts  to  the  presumption  that  the  fystem  computed  local-level 
geodetic  frame  obtains  the  same  azimuth  as  the  local-level  reference  frame  even  though  it  is  angularly- 
displaced  from  it. 

Thus  we  have: 


6  8  5  0 

z 

(81) 

yielding  via  (38),  since: 

66  =0 
z 

(82) 

that: 

6  p  =o  *48  -p  *66 
*  x  y  y  x 

Hence  from  (75): 

(83) 

=  ey  *  <flx'  *x  ‘  i9y 

(84) 

Clearly  again  since  (83,  84)  are  small,  a  linear  model  for  the  error  propagation  is  obtained  from  (20,  39,62, 
83,  84)  except  that  in  this  case  by  (81)  it  has  only  seven  instead  of  eight  states.  The  seven  differential 
equations  for  this  model,  Ignoring  second-order  effects,  are  summarised  in  Table  1.  Additional  properties 
of  this  error  model  by  (76),  (60),  (4)  are: 

a  »  -p„  tan  ♦  ♦  p  *68  *  p  *66 

N  y  x  x  y 

*  £  4 

Ti  T* 

(85) 

U  a  ■  61  sir  S 

THE  CASE  4#  *  0 

The  natural  alternative  to  ths  prior  case  is  that  where  the  azimuth  of  the  platform  frame  coincides 
with  that  of  the  reference  frame  such  that: 

♦»  i0 

(66) 

yielding  via  (59),  since: 

i,  »o 

(87) 

that: 

6  e  “  4  ♦  4y  *  **j(  "  *  My  *  ** 

which  by  (7,  17,  68,  62)  can  be  expressed  as: 

(86) 

4 P  «p  •  4 8  - p  >66  ■  i 

a  *x  y  y  *  Ts 

(89) 

Hence  from  (75): 

Cg  “  *  Oy  *  4  8^  *  p#  •  4  8^ 

(90) 

Clearly  again  since  (19,  90)  are  small,  a  linear  error  ejodel  for  the  error  propagation  is  obtained  front  (20, 
39,  62,  $9.  90)  where  in  this  case  only  seven  states  are  present  since  the  substitution: 

(91) 


ia  mads.  The  seven  differential  equations  for  this  model  Ignoring  second-order  effects,  ere  summarised  in 
Table  1.  Additions!  properties  of  this  model  by  (76),  (4)  are* 


A  COMPARISON  OF  THREE  LEVEL  AXES  LINEAR  ERROR  MODELS  FOR  THE  LOCAL-LEVEL,  WANDER- AZIMUTH  (p 


**«:<•«  (62)  4  8^,  =  48^  cos  a  -  5  8g  sin  a  (4) 

4t>*«0  (W>  4  6^  =  4  X  sin  ♦  +  4  a 

0  *  4  8  (19)  4  p  =  -  4V  / R 

*  -  5*  (60)  4  p*  =  4V  /R  (16 
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a  =  -  p„  tan  4>  +  +  p  *48  -  p  *40  (92) 

N  rz  y  x  x  y 

4  a  =  -  ip  -  4  sin  t 
rz 

One  advantage  of  this  design  model  relative  to  the  prior  approximation  can  be  appreciated  by  comparing 
the  velocity  and  position  error  equations  in  the  second  and  third  columns  of  Table  1.  This  comparison  indi¬ 
cates  that  the  effect  of  azimuth  misalignment  can  be  modeled  either  at  the  acceleration  level  (4  es  =  0)  or 
the  velocity  level  ( $2  =  0) .  The  latter  model  can  often  be  exploited  to  reduce  computer  duty-cycle  require¬ 
ments  in  that  the  coefficient  terms  (px,y)  being  in  effect  integrals  of  the  acceleration  terms  (AX(y) ,  are 
smoother  and  easier  to  deal  with  in  implementing  the  model  in  the  digital  computer. 

A  PARTICULAR  KALMAN  FILTER  DESIGN  MODEL 

If  we  combine  a  number  of  the  error  model  simplifications  discussed  above,  e.g.,  elimination  of  the 
vertical  axis  model  and  Coriolis  acceleration,  use  of  the  equation"  and  the  azimuth  error  definition 
42  =  0  ,  the  following  relatively  simple  seven  state  design  model  for  a  Kalman  filter  is  obtained: 
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This  model  has  proved  useful  in  practice. 
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APPENDIX  -  SUMMARY  OF  LOCAL-LEVEL  NAVIGATION  MECHANIZATION  EQUATIONS 


The  coordinate  frame  [x,y,z]  referred  to  herein  has  two  axes  [x,y]  in  the  level  plane  with  the  z 
axis  normal  to  the  earth's  surface  at  the  position  of  the  inertial  system.  The  rates  of  change  of  the 
Inertial  system  velocity  relative  to  the  earth  taken  relative  to  the  rotating  local-level  coordinate  frame  and 
expressed  along  these  axes  are; 


u 


V 


X 


=  A  +  C  - 


Y 


(1A) 


where: 


A  4 


is  the  specific  force  measured  by  the  accelerometers  due  to  the  true 
system  acceleration  and  modeled  gravity  components  expressed  along 
the  local-level  coordinate  axes.  These  measurements  can  be  obtained 
by  direct  instrumentation  wherein  the  accelerometers  are  maintained 
coincident  with  the  local-level  coordinate  system  as  in  a  Schuler-tuned 
mechanization  or  transformed  onto  these  coordinates  as  occurs  with 
strapdown  or  space-stable  inertial  systems. 


c  A 


[ <P  +  2fl)s 
t(p  +  2ft)x 
I(P  +  20)y 


Vy  -  (P  ♦  2fl)y 

V  -  (p  +  20), 

V  -  (p  +  20) 

x  x 


are  the  Coriolis  acceleration  components  which 
I  +  201  V  account  for  earth  rotation  0,  and  local-level 
p  i  '  navigation  frame  rotation  w  -  p  +  U,  relative 
to  inertial  space. 


P  A 


are  the  angular  rates  of  rotation  of  the  local-level  coordinates 
expressed  about  these  axes  which  result  as  the  system  moves 
relative  to  the  earth. 


0  A 


LV* 


are  the  angular  rates  of  rotation  of  the  earth  relative  to  inertial 
space  expressed  about  the  local-level  coordinate  axes  . 


Y  4 


y 


are  components  of  the  modeled  gravity  vector  of  the  earth.  Usually 
the  normal  gravity  field  is  assumed  which  corresponds  to  an  ellipsoidal 
equipotential  surface  as  the  reference  figure  of  the  earth  where 
Yj,  =«  y y  3  0  and  the  effect  of  earth  rotation  rate  is  included  in  the 
gravity  determination. 


|p  ♦  201  A 
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-(p  ♦  20)^ 

<o  ♦  20) 
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<0  ♦  20)t 
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-<p  ♦  2Q)x 


-<P  ♦  20)y 
<P  ♦  20)^ 
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is  a  skew- symmetric  matrix  expressing  the  sum 
of  earth  rate  and  local-level  frame  spatial  rate 
with  respect  to  inertial  space. 


The  relative  angular  rates  of  rotation  are  computed  after  the  accelerometer  measurements  are  corrected  for 
Coriolis  accelerations  and  the  norma)  gravity  and  integrated  to  obtain  the  computed  level  velocity  compo¬ 
nents  V  ,  as; 

*«y 


(2A) 


where; 

R  are  the  radii  of  curvature  of  an  ellipsoids!  equipotential  surface  assumed  as  the  figure  of 
'y  the  earth.  Note  different  datums  (different  ellipsoids  of  reference)  are  often  used  which 
better  Qt  the  geoid  in  the  different  local  areas  of  system  use. 


♦  ,*  are  the  geodetic  latitude  and  longitude  of  system  position 
a  is  ths  azimuth  of  the  y  axis  In  the  level  plane 

pu  •  (p  sin  a  ♦  p„  cm  a)  Is  the  relative  angular  rate  about  the  north  axis. 

n  *  .  y 

Note  a  a  0  In  the  expression  for  p  above  yields  ths  north-slaved  local-level  mechanization 
U  a(0)  -  0.  * 


The  earth  rate  components  are  computed  as: 


ra  *■ 
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“ft  cos  *  sin  o" 
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ft  cos  $  cos  a 
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L  ZJ 

(3A) 


The  relative  angular  rates  of  change  can  be  used  to  compute  the  change  In  inertial  system  geodetic 
position  and  the  azimuth  angle  a,  of  the  local-level  coordinate  system.  Normally  this  is  achieved  using  a 
direction  cosine  mechanization  of  the  form 


ID]  =  (pi  [D] 


(4A) 


‘x*1 

-x- 

y 

=  ID] 

Y 

Zm 

The  matrix  (D]  is  the  orthogonal  direction  cosine  transformation  between  a  set  of  earth-fixed  axes  [X , Y ,Z], 
usually  with  Y  coincident  with  the  terrestrial  pole  and  X,  Z  in  the  equatorial  plane,  and  the  local-level 
coordinate  system  as  illustrated  in  Figure  1.  In  this  case  the  local-level  coordinate  axes  are  realized  from 
the  earth-fixed  coordinates  by  performing  a  counter-clockwise  rotation  about  the  earth's  polar  axis  X,  a 
clockwise  rotation  ♦  ,  about  the  displaced  X-axis  in  the  equatorial  plane  and  a  final  counter-clockwise 
rotation  a,  about  the  then  vertical  axis  z.  More  explicitly: 


<X,x> 

<Y  ,x> 

<Z  ,x* 

ID)  4 

<X,y> 

<Y,y> 

<Z  ,y> 

<X,*> 

<Y,*> 

<Z,z> 

(5A) 


<X,x>  =  cos  a  cos  X  -  sin  a  sin  *  sin  X 

<X,y>  =  -  sin  a  cos  X  -  cos  a  sin  ♦  sin  X 

<X,s>  *  cos  ♦  sin  X 

<Y,x>  -  sin  a  cos  ♦ 

<Y  ,y>  e  cos  a  cos  ♦ 

«Y,*»  «•  sin  ♦ 

<Z,x>  =  -  cos  a  sin  X  -  sin  a  sin  ♦  cos  X 

<Z,y»  e  sin  a  sin  X  -  cos  a  sin  ♦  cos  X 

<Z,»  >  =  cos  ♦  cos  X 

Further: 


<6A) 


is  the  snU-symmetric  matrix  of  relative  angular  rates  of  change  the  navigation  coordinate  axes  relative  to 
the  earth-fixed  frame  expressed  in  the  navigation  coordinate  axes.  These  rates  result  as  the  inertial  sys¬ 
tem  position  changes  relative  to  the  earth.  More  explicitly  the  geodetic  position  and  wander  angle  rate#  of 
change  easy  be  expressed  ss: 


♦  «  -p£  »  (py  sin  o  >  cos  a) 

X  ®  pN  sec  |e]  (7A) 

a  a  Pt  ‘  0N  ten  !♦] 


As  s  note  of  interest  only  six  elements  of  the  direction  cosine  matrix  need  to  be  propagated  in  the  mechani¬ 
sation  of  the  navigation  equations.  Inspection  of  the  propagation  equation  (4A)  reveals  that  the  elements 
of  any  ct  ;umn  of  the  direction  cosine  matrix  is  propagated  using  only  the  other  two  elements  of  the  column 
and  the  appropriate  relative  angular  rate*.  Any  column  (row)  tpecifiet  an  earth-fixed  (local-level)  coordi¬ 
nate  axis  relative  to  the  local-level  (earth-fixed)  frame.  Two  such  columns  (rows)  are  sufficient  to  com¬ 
pletely  define  the  transformation  since  the  missing  axis  is  simply  the  vector  cross-product  of  the  other  two 
axes,  e.g,,  Z  ■  XxY.  Further  inspection  of  the  direction  cosines  (SA)  reveals  If.o.X)  can  be  determined 
fully  from  five  of  the  direction  cosine  elements. 


where  p„  varies  dependent  on  the  type  of  azimuth  mechanization  selected.  The  direction  cosine  mechani¬ 
zation  avoids  the  apparent  signularities  above  if  the  relative  angular  rates  Px,y,z>  remain  non-singular. 
The  level  components  are  non-singular  functions  of  the  system  velocity  relative  to  the  earth  via  (2A).  The 
azimuth  relative  rotation  rate  p.  is  usually  selected  to  be  a  non-singular  function  y ,  which  can  but  need 
not  be  a  function  of  the  computed  navigation  variables! 

P  =  y  (8A) 

z 

Note  for  non-singular  behavior  of  pt,  the  rate  of  azimuth  wander  angle  change  is  singular  as: 

a  =  —  Pjj  tan  ♦  +  y  (9A) 

The  final  navigation  equation  is  that  used  to  compute  the  system  position  change  along  the  local  vertical 
relative  to  the  reference  ellipsoid.  The  equation  that  is  integrated  to  obtain  elevation  change  is: 

h  =  V  (10A) 

2 

In  words,  the  time  rate  of  change  of  elevation  is  the  system  velocity  relative  to  the  earth  projected  along 
the  vertical  axis  of  the  local-level  reference  frame. 

ACTUAL  REPRESENTATION  OF  THE  LOCAL-LEVEL  NAVIGATION  MECHANIZATION  EQUATIONS 

The  correct  representation  of  the  rate  of  change  of  velocity  relative  to  the  earth  with  respect  to  the 
local- level  coordinate  frame,  that  is  actually  mechanized  in  the  system  computer  differs  from  (1A)  and  is 
written: 


where: 


and: 


(11A) 


are  the  accelerometer  measurements  as  they  are  made  along  the 
platform  coordinate  axes  as  introduced  in  the  text  of  the  chapter. 
Note  since  the  accelerometers  measure  the  sum  of  system  acceleration 
and  actual  gravity  (as  opposed  to  modeled  gravity)  and  further 
make  these  measurements  with  error  due  to  instrument  imperfections, 
the  vector  An  should  implicitly  include  such  effects.  We  choose 
here  however  to  represent  the  difference  between  actual  and  modeled 
gravity  and  instrument  measurement  error  both  in  the  vector  9 
of  the  text  and  let  An  represent  the  sum  of  system  acceleration  amd 
modeled  gravity  as  viewed  from  the  platform  coordinate  axes. 


are  the  components  of  the  modeled  gravity  that  are  computed  using 
system  computed  values  of  position  and  wander  angle  it  ,X,o|c. 


are  components  of  the  earth  rotation  rate  that  are  computed  using  the 
system  computed  values  of  position  and  wander  angle,  ( ♦  .aj^. 


(12A) 


are  the  ayetem  computed  values  of  tha  relative  angular  rate  using  the  system  computed  veluee  of  velocity 
V  ,  that  result  from  tha  Integration  of(liA),  the  redli  of  curvature  that  are  computed  using  ayetem  com¬ 
puted  value*  of  position  and  wander  angle  It, a)  and  is  the  aslmuth  relative  rotation  rate  that  is  not 
explicitly  specified  at  this  paint.  Note  however  the  tame  In  this  equation: 


♦  pw  un  ♦ 

”e  c 


are  all  obtained  freer  values  which  are  computed  in  the  system  computer.  The  reader  Is  cautioned  not  to 
aeeocUte  the  velocity  triplet  V  ,  with  the  computer  coordinate  system  Introduced  In  the  text  of  the  chapter 
hut  simply  consider  it  as  the  set  of  numbers  that  result  from  the  integration  of  Equation  (llA). 
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The  actual  direction  cosine  matrix  between  the  earth-fixed  axes  and  system  computed  geodetic  frame  is 
obtained  using  the  system  computed  relative  angular  rates  between  these  two  coordinate  frames  as; 

tblc  =  [pJc  ED]C  (14A) 

where: 


l»lc  A 


and  [0 1  is  the  computation  of  [D]  in  (SA)  above  using  [♦  , X, a]  . 
c  c 

Finally,  the  actual  representation  of  the  equation  which  is  integrated  in  the  system  computer  to  obtain 
elevation  is: 


(15A) 
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SUMMARY 

For  destinating  interception  trajectories  for  torpedo-ship  systems  it  is  evident  to  use  on  one  hand  optimal 
control  theory  and  on  the  other  hand  optimal  filtering  techniques.  At  first  a  simple  mathematical  model  is 
given  for  the  plane  torpedo  motion,  which  is  sufficient  to  investigate  principal  effects  of  controlling 
and  filtering.  For  known  target  trajectories,  then  in  a  first  main  part  of  the  paper  an  optimal  control  law 
is  derived  which  implies  the  minimization  of  a  linear  combination  of  interception  time  and  engine  energy. 
The  second  main  part  is  dedicated  to  the  filtering  problem  of  the  measurement  data.  Thereby  it  is  shown 
that  for  the  torpedo  a  continuously  working  Kalman  filter  can  be  used.  For  generating  the  target  trajectory 
it  is  more  advantageous  to  take  discrete  equations  in  relative  coordinates.  For  this  an  appropriate  algo¬ 
rithm  is  derived  which  bases  on  an  extended  Kalman  filter.  This  algorithm  is  used  for  smoothing  and  filte¬ 
ring  the  target  (ship)  data  and  furthermore  for  generating  the  target  trajectory.  As  the  central  point  of 
the  paper  is  not  the  mathematical  theory  but  the  engineering  application  for  all  theoretical  derivations, 
simulation  results  are  given,  which  are  obtained  by  calculations  on  a  hybrid  computer. 

1.  INTRODUCTION 

Besides  applications  in  aeronautics  and  astronautics  especially  in  naval  engineering  interception  problems 
often  occur  for  the  guidance  of  naval  vehicles  by  which  one  understands  in  general  the  "hard"  interaction 
of  two  moving  objects,  e.  g.  a  target  and  a  pursuer.  This  means  mathematically  that  at  a  prescribed  or  by 
an  algorithm  (guidance  law)  fixed  point  of  time  the  position  coordinates  of  the  two  moving  vehicles  have 
to  be  equal. 

A  well-known  interception  trajectory  is  for  example  the  dog  fight  curve,  where  a  pursuer  is  controlled  in 
such  a  way  that  hia  forward  axis  is  always  directed  on  the  target  tl).  This  classical  trajectory  has  the 
advantage  of  a  gor.eraUy  good  accuracy.  The  disadvantage  is  the  long  interception  time  especially  in  Guch 
cases,  where  the  velocities  of  the  target  and  the  pursuer  are  nearly  equal  1 2 J .  Referring  to  this  the 
different  so-called  interception  course  procedures  [3l  including  the  proportional  navigation  [u]  are  more 
favorable.  They  use  some  information  about  the  motion  of  the  target  in  advance  in  order  to  hit  the  target 
at  a  prescribed  collision  point.  Especially  at  the  end  of  the  motion,  theee  procedures  have  the  disadvan¬ 
tage  that  stability  problems  occur  in  the  guidance  law  which  deteriorate  the  accuracy  or  eut  off  the 
interception  completely  (Sj.  Modern  time  domain  procedures  of  the  control  theory,  as  for  instance  the 
quadratic  optimal  control,  are  used  in  only  a  few  cases  by  now  for  the  conception  of  guidance  laws  for 
interception  trajectories  t&),  [7],  The  reasons  for  this  lie  in  the  relatively  high  mathematical  expen¬ 
diture  which  is  troublesome  especially  for  the  solution  of  probleas  in  real-time. 

Depending  on  the  set  problem  tise-and  fuel-  or  energy-optimal  algorithm*  are  offering  for  modern  solution* 
of  interception  trajectories.  By  this,  approximately  the  spectrum  of  the  most  important  applications  for 
the  problem  torpedo-ship  is  covered.  Because  of  the  complexity  of  the  equations  of  motion,  especially  of 
the  pursuer,  the  guidance  laws  cannot  given  analytically  in  this  case.  If  one  wants  to  exclude  numerical 
procedures,  which  is  useful  for  basic  declarations  and  necessary  for  real-time  tasks  then  one  obviously 
has  to  simplify  the  mathematical  model  of  the  pursuer.  A  reasonable  restriction  is  the  consideration 
of  only  plane  motions,  such  as  they  occur  for  example  for  an  interception  torpedo-surface  ship. If  one  empha¬ 
sises  furthermore  energetic  points  of  view  then  one  gets  linear  equations  of  motion,  which  are  derived 
from  the  kinetics  of  mass  points  with  in  the  mean  only  little  alteration*  of  the  velocity  of  the  pursuer. 

On  this  model  it  is  easy  to  apply  the  optimal  control  theory. 

The  realisation  of  the  optimal  control  laws  assumes  ths  measurement  of  the  position  and  the  velocity 
coordinates  of  the  target  (ship)  and  ths  pursuer  (torpedo).  In  both  cases  atochsstic  disturbances  occur. 
Because  of  the  linear  mathematical  modal  of  the  pursuer  the  disturbances  for  this  one  can  be  eliminated 
in  a  simple  may  by  using  a  Kalman  filter.  In  the  cane  of  the  target  the  task  is  not  so  easy, ss  the 
target  trajectory  it  generally  not  known  and  haa  to  be  generated  from  the  measurement  data.  As  the 
measurement  equation  is  nonlinear  and  a  linearisation  about  a  nominal  trajectory  is  not  possible,  one  gets 
et  first  *  nonlinear  filter  problem  (8)  for  the  target  trajectory.  However,  this  problem  cen  be  attributed 
to  an  extended  Kalman  filter  so  that  altogethar  this  measurement  task  can  be  handled  by  the  linear 
theory,  too. 

2.  MATHEMATICAL  MODEL 


Starting  from  a  plana  motion  for  the  target  and  the  pursuer  and  assuming  the  validity  of  tha  kinetics  of 
mass  points  for  ths  pursuer  then  the  normal l ted  aquations  of  motion  for  the  torpedo  can  be  described  in 
accordance  with  Fig.  1  In  the  form 
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Thereby  x.  and  x.  are  the  position  coordinates  of  the  torpedo,  x,  and  x.  its  relative  velocities  in  rela¬ 
tion  to  its  flo*(  field,  x,  und  x,  the  components  T  ,  and  T  ,  of  the  thrust  T,  x_  the  direction  of  the 
thrust  in  the  coordinate  System  Cx,,x.)  and  Xg  thexSngularxvelocity  of  the  torpedo.  The  components  of  the 
flow  field  are  v  and  v. .  Further  the  coefficient  in  the  quadratic  drag  law  is  denoted  with  e,  and  the 
moment  of  inertia  of  the  torpedo  about  its  yaw  axis  is  denoted  with  y .  The  time  constant  of  the  engine 
is  6,  the  amplification  of  the  engine  is  k.  As  control  variables  are  introduced  u, ,  as  a  measure  for  the 
amount  of  the  thrust  T,  and  u.  as  the  rudder  angle.  The  time  is  denoted  with  t,  accordingly  to  that 
d/dt  =  (*). 

The  system  of  the  equations  (2.1)  is  nonlinear  and  additionally  time  varying  as  far  as  instationary  flow 
fields  are  permitted.  Altogether  it  is  not  accessible  to  an  analytic  treatment  within  the  scope  of  an 
optimization  problem.  Therefore,  in  the  following  the  simplifications 

1)  Y  "*  0 

2)  0  -*  0 

3)  vf  a  const. 

4)  x^  ♦  x*  w  1 

are  taken,  i.  e.  the  dynamics  of  the  angular  motion  of  the  torpedo  and  that  of  the  engine  are  neglected 
and  a  constant  flow  field  is  assumed.  Furthermore  the  system  (2.1)  is  considered  to  be  normalized  in 
euch  a  way  that  in  addition  to  v  alto  tha  relative  velocity  v  of  the  torpedo  can  be  regerded  approxi¬ 
mately  at  constant  in  its  absolute  value.  Under  these  assumptions  the  system  (2.1)  turns  in 


X1  ° 


*3  +  V1 

.  VV 

axio 

\  ♦  v2 

*  X2(V 

°  X20 

-  **Xj  -  u^costUj) 

•  *3<V 

8  *30 

-  c*x4  -  u^einUj) 

.  \<V 

8XU0  * 

(2.2) 


*2  = 

*3  S 
« 

8 


As  new  control  variables  now  u,  as  thrust  and  u.  as  its  direction  in  the  coordinate  system  (x  ,x,)  are 
introduced.  Teking  finally  is  is 


Uj^  a  u^coafuj) 


Uj  a  Uj.siniuj) 

the  linear  pursuer  dynamic  follows 


(2.3) 
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0  0 
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which  means  that  the  motioo  of  the  torpedo  is  describe*}  by  a  linear  system  of  the  font 
x  a  A\x  ♦  B*u  ♦  *  ,  x(t0>  a  x^  . 


(2.5) 
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At  first  it  is  assumed  for  the  target  that  its  dynamics  is  completely  known  for  t  ^  t  .  It  will  be 
described  by  the  position  coordinates  ' 


«1  3  «i(t) 

C2  =  C2(t) 

and  the  belonging  velocities 
43(t)  =  Ej_ 

3  h  • 

so  that  the  vector  of  the  relative  motion  of  target  (ship)  and  pursuer  (toi-pedo)  is  given  by 
e(t)  =  £(t)  -  x(t)  . 

3.  OPTIMIZATION  PROBLEM 


(2.6) 


(2.7) 


(2.8) 


In  the  following,  first  the  optimisation  problem  is  formulated,  then  it  is  solved  and  discussed  in 
detail . 

3.1  Formulation 

As  mentioned  preliminary,  for  an  interception  torpedo-ship  time-  and  energy-optimal  solutions  are  of 
interest.  Therefore  the  optimisation  problem  -  this  means  the  destination  of  the  control  variables 
tf.  and  u0  in  Eq.  (2.4)  and  the  determination  of  the  inter 
that 


the  quadratic  performance  index 

X 

J(u,T)  s  ~eT(T)>P*e(T)  ♦  ^  •  f  (  «♦< l-« ) •  C eT( t: ) *£•«( t )  ♦  u*(t)j)*dt 

*  t  s0  ~  ~  1 


interception  time  T  -  is  formulated  in  such  a  way 

(3.1) 


is  minimised.  Thereby  the  weighting  factor  4  considers  the  interception  time  T.  It  is  defined  for 

0  <  4  <  i  .  (3.2) 

Consequently  one  gets  for  4=0  pure  energy-optimal  and  for  4  *  l  pure  time-optimal  solutions.  Decidedly, 
the  letter  case  should  he  excluded  because  of  the  linearity  of  the  optimisation  problem.  The  weighting 
matrices  P  and  £  in  Eq.  (3.1)  ere  given  by 
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the 


whereby  with  P  the  interception  condition  s.(T)  =  t,(T),  xJT)  •  (_,(T)  and  with  £  a  weighting  of 
control  d«viaflone(t)  is  realised.  Consequently  it  is  i  0,  Pjj*’  0  and  q^  _»“o,  q?J  0. 

3.2  Solution 

For  the  solution  of  the  optimisation  problem  in  the  following  the  absolute  value  of  the  thrust  u,(t)  is 
considered  to  be  unconstraint.  9y  this  the  control  variables  tf.(t)  and  tiL(t)  are  unconstraint ,  too. 
Therefore  one  can  use  the  methods  of  the  calculus  of  variation!,  an  application  of  the  maximum  principle 
Is  superfluous.  Moreover,  by  this  the  linearity  of  the  optimisation  problem  Is  guaranteed. 

from  the  calculus  of  variations  (9)  one  gets  for  the  optimal  control  variables 
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with  f.  and  ♦  as  components  of  the  adjoint  vector  a  belonging  to  a.  With  Eq.  (5.4)  it  follows  from 

Eq.  <3T3> 


U1  9  “  *  ♦ett.T)' 

tan(Uj)  a  ♦w(t,T)/f3(t,T)  . 
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The  optimal  interception  time  T  can  be  computed  from 


i)  -  eT(T)-P-e(T)J  =  0  , 


with  H  as  HAMILTONIAN.  The  condition  (3.6)  is  equivalent  to  the  demand 

^  =  0.  (3.7) 

The  explicit  solution  of  the  optimization  problem  now  requires  the  destination  of  the  adjoint  vector 
jfr ( t ) .  If  this  is  done  then^the  optimal  control  variables  are  explicitly  known  from  Eq.  (3.4),  the 
optimal  interception  time  T  can  be  computed  from  Eq.  (3.6)  and  the  ooen-loop  control  system  (2.4),  (3.4) 
can  be  fed-back.  For  this  at  first  a  special  case. 

3.2.1  Special  Case  2=0 

In  the  special  case  £  *  0, where  no  weighting  of  the  control  deviation  o(t)  in  the  performance  index  (3.1) 
inppens , the  destination  of  jjKt)  and  by  this  the  solution  of  the  optimization  problem  is  very  simple.  One 
gets  for  the  optimal  thrust  and  for  the  optimal  direction  of  the  thrust  in  the  coordinate  system  (x^,Xj) 

,  (3.8) 

tan[Uj(T)]  =  tan[o(T)]  . 

Thereby  K  and  a  ere  constants  of  integration,  which  follow  from  the  transversal icy  condition 

♦j(T)  =  p.j-U.tTJ-x^T)},  i  =  1,2  (3.9) 


and  are  given  by 
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For  an  interception  'qual  for  both  position  coordinates,  j>  a  p„,  s  p  is  set.  The  auxiliary  functions 
k^(T), . . . ,kj( t ,T>  in  Eq.  (3.10),  which  are  introduced  as  aMreviitiono,  are  defined  by 

VT1 5  *10  •  vT  *  •  VT- 
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1 3.11) 


kj(t,T)  a  -  i  *  *»t  *  »  4  '  «  e  t*,'4 l-*?ht't*\») 


As  it  can  be  seen  fw#  Eq.  (.*,«)  the  optseal  thrust  is  an  *npr*«wo*i  function  with  u  CT,T)  *  0  and  its 
direction  is  constant,  wb*ch  mentis  that  the  control  procedure  works  with  an  opt  leal  deflection  angle,  as 
the  target  coordinates  (  (V)  and  L J T )  are  considered  as  completely  kncntn  before.  With  Eq.  (3.4)  and 
Eq.  (3.10)  for  the  trajectory  os'  tse  pus-suer  follows 


ai(t,r)  a  a  *  v  *t  *  -^-(l-e****)  »  t ,T) 

i'll -«)  * 

M  ,T)  a  *  V  -t  s  -SS-U-V***)  a  (t,T)  , 

*  10  J  *  <3*f 1*4)  ? 


(3.1J) 


which  results  in  tnv  special  case  of  vanishing  initial  velocities  *  s  a.  -  0  and  vanishing  the  flow 
field  :  *j  <  0  )ii  10  wo 
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The  optimal  trajectory  is  ttnrefsri  a  Straight  line.  Finally,  the  optimal  {.atandspt (on  ti«s  *  is  siianpotad 
fro* 


V .  H.  j-^1^-} 
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which  has  to  ba  done  nusarlcally . 


An  example  for  the  solution  derived  shows  Fig.  2  and  Fig.  3.  In  this  example  the  target  trajectory 


^(t)  S  (l+Vjj-t 

t2(t)  =  0 

for  !  <  0  and 

C1(t)  =  sin(t)  ♦  v^t 
C2(t)  =  cos(t)  -  1  +  v  .t 


for  t  >  3  is  assumed.  Thus  the  trajectory  is  a  circle  which  is  displaced  by  the  velocities  and  v  of 
the  flow  field.  For  the  pursuer  the  initial  conditions  x,  =  0,  x.  =  1,  x_  =  x^q  =  0  are  valid.  The  drag 
coefficient  is  c  -  1.  Furthermore  it  is  v  =  0,1,  v  =  0i5nd  p  =  io3.  Aim  of  the  Investigation  is  tj  look 
for  the  influence  of  the  weighting  i  4  which  is  responsible  for  the  optimal  interception  time  T. 

In  Fig.  2  the  optimal  interception  time  ?  mid  the  constants  of  integration  K  and  o  are  drawn  over  the 
parameter  4.  In  Fig.  3  the  target  trajectory  is  pointed  out  and  for  different  values  of  4  the  interception 
trajectories  are  given.  The  figuration  outlines  that  by  an  appropriate  choice  of  4  any  combinations  of 
time-  and  energy-optimal  pursuer  trajectories  can  be  realized. 


The  solution  so  far  includes  only  the  open-loop  control  problem.  If  a  closed-loop  control  is  needed, 
then  the  theory  given  in  (9)  yields  with  p  *  p2J  s  p 
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which  mea.ta  that  the  optimal  closed-loop  control  law  is  of  the  form 

u(t,T)  =  R(T-0-x(t)  -  fi(t.T)  .  (3. IS) 

So  it  consists  of  a  linear  feed-back  and  additionally  of  a  feed-forward  control.  The  time-varying 
coefficients  of  the  control  law  in  £q.  (3.15!  are  given  by 
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and  for  the  feed-forward  control  one  gets 

g.(t,t)  «  «“1/2(T-t)*tj(T)-vl-rJ/Si{T-U 


(J.19) 


which  follows  both  from  the  solution  of  a  KSCCATlwatrS*  differential  elation  (»'  The  central  law 
depends  or.  the  target  final  coordinates  f.(T)  and  MTS  bmcavse  of  she  feed -fore,-.  \  control  gft.TK 
Therefore  these  are  to  be  known.  The  coefficient#  of  the  controller  are  pu-e  *  'j.-e  functions,  which  can 
he  calculated  off-line.  For  p*»l  they  are  nearly  !  dependent  of  4,  which  ee»n#  that  the  weighting  of  the 
interception  tie*  ia  needed  only  for  destinatSfg  itself  from  (3.1).  exemplary  for  «  -  1  and  p  *  lO3  the 
Coefficients  of  the  controller  are  drawn  in  tit-  Jn  order  is  make  them  independent  of  the  inter¬ 
ception  time  T  and  the  parameter  4  ih»y  are  normalised  with  (1-4)  and  as  new  variable  tie  time  t  *  T-t  it 
introduced.  The  example  of  the  open-loop  control  in  fig.  b^and  fig.  9  is  again  taken  as  a  basis  for  the 
doted  loop  control  as  well.  In  fig.  5  the  optimal  thrust  u.(t!  i*  drawn  for  4  *  0,7  betfc  for  the  epen-aed 
closed  -  loop  control.  One  see*  the  formal  conformity  of  both1 curve*.  Only  el  the  end  of  the  trajectory 
little  deviations  occur «  which  base  on  the  interception  condition  which  is  with  p  =  10*  only  finite 
weighted.  Additionally,  In  fig.  5  the  optimal  thrust  for  one  with 

iv^t)  *  0,S*lo< t-0,S>-9(  t-l»5>«e<  t-J,S)-e{ t-3,5)) 

disturbed  motion  is  demon.  3y  this  disturbance)  which  includes  two  rectacgulaj-  pulses,  the  efficiency  of 
defensives  meaeuree  against  the  torpedo  can  be  simulated.  The  trajectories  of  the  problem  mentioned 
are  given  in  f  ig.  6.  Thereby  it  is  obvious  that  the  closed-loop  control  works  satisfying  also  is  tbs  case 
of  tbs  disturbed  motion. 


3.2,2  General  Case  2*0 


In  the  general  ca&a  Q  +  0  the  conditions  (3.4)  are  the  same  as  before.  Of  course  for  the  destination  of 
the  adjoint  functions  ^Tt)  and  f  (t)  now  distinctions  are  necessary,  as  the  optimization  problem  contains 
for  the  eigenvalues  X^  of  the  system  the  characteristic  equations 

X^-e2-X?+q.i  =  0  ,  i  s  1,2  (3.19) 

therefore  the  three  c;.ses 
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(3.20) 
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have  to  be  treated  separately.  As  the  procedure  is  identical  in  all  three  cases  nere  case  3)  is  taken, 
being  of  special  interest  in  practice. All  further  considerations  are  referred  to  this  case.  With  the 
abbreviations 


/l 

Yi =  sir 

•v/rKr-£]  ■ 


(3.21) 


which  are  the  absolute  values  of  the  real  and  imaginary  parts  of  the  eigenvalues  >.,  i  -  1,2,  and  the 
auxiliary  functions  given  in  Appendix  A  one  gets  x 


Wt’T)  1  -  [mii(t)  •  ^riTy-moi(t)]-,('io-(1-4)^ii-fi+2(t*T)  • 


oi 

The  constants  of  integration  i|^q,  i  =  1,2  are  related  to 
p.ku(T)  +  (l-i)-qu-k2i(T) 

1+rr«r1ii(T)+<1ii,:L2i(T) 


(3.22) 


(3.23) 


with  the  connection 


=  K*cos(a) 
4»2o  =  K*sin(a) 


(3.24) 


to  the  constants  K  and  a  used  before.  In  Eq.  (3.23)  it  is  assumed  again  p,,  *  p„-  *  p,  the  abbreviations 
k..  ,(T),  k,.(T),  l..(T)  and  12>(T)  are  assigned  in  Appendix  A.  In  Eq.  (3.22)  it  Is  now  important  that 
because  orxEq.  (AV4)  and  Eq.  CA.8)  the  solution  of  the  optimization  problem  depends  not  only  on  the 
final  state  (^(T),  £2(T))of  the  target  but  also  on  the  whole  target  trajectory  for  0  <  t  '  T  . 

With  Eq,  (3.22)  and  Eq.  (3.4)  it  is  now  possible  to  integrate  the  enuationu  of  motion  (2.4).  By  this 
the  trajectory  of  the  torpedo  is  known.  The  analytic  expressions  for  x,(t)  and  x_(t)  will  not  be  given 
here,  in  this  connection  it  is  referred  to  (lo).  In  order  to  show  the  influence  of  the  weighting 
matrix  £  it  is  referred  to  the  example  treated  before.  It  is  shown  in  Tig.  7  with  a  parameter  variation 
of  q,.  *  q„„  =  q,  the  time  of  interception  T  is  thereby  fixed.  It  i»  evident  from  the  graphs  that  with 
the  parameter  q  completely  different  interception  trajectories  can  be  realized.  This  is  important  in 
order  to  make  an  attack  to  the  ship  with  the  torpedo  for  instance  from  behind.  The  consideration  till 
now  are  only  valid  for  the  open- loop  control  problem.  If  one  wants  to  use  a  closed-loop  control,  which 
is  generally  necessary,  one  is  led  again  to  Eq.  (3.15).  The  coefficients  of  the  controller  are  now 
given  by 


d..(T-t)  d-.(T-t) 


ri+2(T-t) 


rd^T-tV 


•roi(T"t)  * 


a-«)*qu*L 


rC3rT-t)-d^T-tHdli(T-t).d?iiT-th 


d'i(T-t> 


(3.25) 


i  *  1,2 


The  abbreviation  r  .(T-t)  is  assigned  in  Appendix  B.  The  functions  c,,,...,d..  in  Eq.  (3.25)  follow  from 
Appendix  A,  if  one°repl«ces  there  the  argument  t  by  t-T.  In  contrasiJlo  Eq.  (j .17)  the  coefficient#  of 
the  controller  have  now  stationary  values  not  equal  to  zero,  which  can  be  eeen  from 


(3.26) 


r,  =  lim  r.(T-t)  =  ( 1-S) • VqTT 
1  T-*-  1  1X 

?i+2  =  lim  r>i+2(T-t)  =  (l-4M-etv42+2-V5^3  ,  i  =  1,2 

T-+« 

The  stationary  value  of  the  coefficient  r,  is  not  at  all  influenced  by  the  drag,  the  stationary  value 
of  the  coefficient  r,+2  only  little,  as  0j<  e  <_  1  can  be  assumed  from  physical  reasons  and  furthermore 
(3.20)  deals  vith  case  3).  Moreover  the  little  influence  of  e  on  r,  and  on  r^+2  is  not  only  valid  for 
the  stationary  values  but  in  general  also  for  0  ^  t  <  T  <  as  it  can  be  seen  from  a  numerical 

exploitation  of  Eq.  (3.25).  For  the  feed-forward- control- g^( t,T),  i  =  1,2  in  (3.15)  one  gets  now 


*i(t»T)  s^r«i(T)-Iri+2(T-t)-ai(T-t)1-vi  - 

-  qii-(ri(T-t)*iu(t,T)+ri+2(T-t)*i2.(t,T)+(l-«)-i4.(t,T)] 


(3.27) 


with  the  auxiliary  functions  h^(T-t)  and  O.(T-t)  given  in  Appendix  B.  The  integrals  i..(j=l,2,4;  1=1,2) 
are  thereby  explained  as  in  Appendix  A  but1now  with  t  as  lower  and  T  as  upper  integration  limit.  From 
Eq.  (3.27)  it  is  evident  that  for  the  feed-forward  control  in  analogy  to  the  open- loop  control  the 
whole  target  trajectory  has  to  be  known  for  0  <  t  <  T.  It  is  not  sufficient  to  know  only  the  final 
values  (^(T),  t2(T)). 

The  control  law  (3.15)  with  the  coefficients  (3.25)  ana  the  feed-forward  control  (3.27)  produces 
altogether  good  results.  It  has,  however  the  disadvantage  that  it  cannot  be  used  for  unknown  target 
trajectories.  Therefore  one  has  to  try  with  a  suboptimal  solution  to  eliminate  this  disadvantage.  For 
this  purpose  one  develops  the  integrals  (A. 4)  and  (A. 8)  given  in  Appendix  A  by  partial  integration  up  to 
the  accelerations  ^(T),  i*l,2.  The  feed-forward  control  (3.27)  then  goes  over  in 

gt(t,T)  =  ri(T-t)*Ci(t)+ri+2(T-t)-[5i+2(t)-v.]+Ri(t,T)  ,  (3.28) 

with  the  residual  function 


RjU.T)  =  U-«)*c*Ui+2(tMi+2(T))  -  O^T-thU^m-Vj]  ♦ 

♦  ri(T-t)-Iu(t,T)+r1+2(T-t).I21(t,T)-(l-4)-qu-Ilj.(t,T) 


(3.29) 


This  residual  function  depends  on  the  integrals  given  in  Appendix  B  and  on  an  additional  expression  re¬ 
sulting  from  the  hydrodynamic  drag,  Tor  ships  as  targets  it  is  sure  that  their  accelerations  are  small. 

So  far  it  is  allowed  to  neglect  these  Integral*  the  influence  of  which  for  t-»T  in  any  way  vanishes. 
Furthermore  it  can  he  proven  that  also  the  additive  expressions  of  higher  order  depending  on  i  are  small. 
By  this  it  is  possible  to  put 

(tjit.T)  *  0  (3.30) 

for  0  t  T  and  the  optimal  control  law  can  be  replaced  by  the  suboptimal  expression 

^ 3.3i) 

♦  +  •  i  5  1.3  . 

In  the  auboptimal  control  law  no  preliminary  knowledge  of  the  target  trajeetiry  is  necessary.  Only  the 
actual  target  coordinates  for  the  specific  time  t  are  needed. 

For  the  example  treated  before,  but  now  with  v.  s  0,  i  s  1,2  and  <  e  0,5,  Fig.  8  shows  the  difference 
between  the  ptirnttl  and  the  suboptimal  solution.  One  sees  that  the  optimal  solution  is  comparable  In 
«  certain  scope  with  the  proportional  navigation,  the  suboptimal  solution  with  a  dog-fight  curve. 

An  additional  example  is  given  in  >'ig,  9,  where  the  target  with  the  coordinates 


4^(0  =  (l+v.M 

tj(t)  *  0 


for  t  >  0  is  pursued.  Variation  parameter  is  again  q. . 
polnts~ouv  that  also  in  tha  suboptimal  case  different  pursuer  trajectories afi  realisable  by  an 
appropriate  choice  of  q,  which  is  extremely  important  for  tactical  considerations. 


and 


q«,  respectively.  The  figure 


4.  OPTIMAL  FILTERING 


In  the  further  consideration*  the  general  suboptimal  control  law  (3.31)  is  used.  In  this  control  law 
occur  tha  ata*»  variates  x,,...,x  of  tfrt  pursuer  (torpedo)  end  the  stats  variables  of  the 

target  (ship)  If  one  uses  in  accordance  to  Eq.  (2.6)  relative  coordinates  then  the  number  of  state 
variables  is  reduced  from  elgnt  to  four  but  in  this  case  no  inertial  declarations  for  the  coordinates 
are  possible.  It  is  ell  the  sene,  which  consideration  is  used,  in  any  case  the  state  variables  contained 
in  the  control  law  have  to  be  measured.  These  measurements  are  subject  to  stochastic  errors  so  that  one 


has  to  develop  appropriate  filter  algorithms. 

4.1  Kalman  Filter  for  Torpedo 


As  shown  in  Eq.  (2.4)  the  torpedo  can  be  described  by  linear  state  equations.  One  can  assume  that  in 
these  state  equations  no  stochastic  parts  occur  if  for  example  no  influences  of  wave  motions  of  the 
flow  field  are  considered.  With  a  proportional  linear  working  measurement  device  one  gets  the  measurement 
equation 


+  w(t) 


if  only  the  position  coordinates  and 


'l  0  0  o' 
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0  0  0  1 

X4 

(4.1) 


(4.2) 


if  also  the  velocity  coordinates  are  measured,  i.  e.  in  general  is 


i  s  £.’*  *  w(t)  .  (4.3) 

Assuming  that  w  in  Eq.  (4.3)  is  a  normal  distributed  white  noise  process  with  E[w(t)]  =  0,  then  for 
(2.4)  and  (4.l7  a  linear  Kalman  filter  can  bo  outlined.  In  accordance  with  the  theoretical  background 
given  in  [ll]  for  this  case 


K(t)*[^  -  C-xl  ♦  d  ,x(0)  1  ^  s  Etx(0>)  (4.4) 

is  valid. 

Here  is  for  abbreviation  d  a  B-h+v  the  deterministic  part  in  Eq.  (2.11).  The  Kalman  matrix  K(t)  now 
follows  frotr 

Wt)  s  P(t)'CTf 1  ,  (4.5) 

where  the  covariance  matrix  P(t)  obeys  the  R ICC AT I  matrix  differential  equation 


P  a  A-P  ♦  P-AT  -  P>CT-tfl.C>P  ,  P(0)  8  E((X  -x  Wx*l«  >TJ  • 

—  «•  -•  —  ••  '«  •*-  —  —  — H)  — O 

furthermore  it  is 

Elw(t)-w(ltt)l  :  R*4(  t ) 

for  the  noise  process  in  (4.3).  If  one  assumes  for  PCO)  and  R  diagonal  matrices  of  the  forts 

P(0)  *  dtag(o|t)  ,  i  s  1.....4 

R  s  dUg(r^)  ,  t  >  1 . * 

and  postulates  additionally  that  there  are  equal  stochastic  qualities  for  the  position  coordinates  on  one 
hand  a  ltd  for  the  velocity  coordinates  on  the  other  hand,  which  means 


(4.5) 


(4.7) 


(4.8) 


and 


°U  *  *22 
UJ3  *  °44 


rU  8  r72 
r33  *  **33 


(4. 9) 


(4.10) 


then  the  filter  can  be  given  analytically  in  a  simple  way.  One  gets 


*1 

’kn 

0 

l*kl3 

0 

\ 

kU-Vk13-y3 

*2 

*3 

e 

0 

*k31 

"k22 

0 

0 

^s,k33* 

l'k24 

0 

* 

*2 

*3 

t 

k23-vVy4 

k3l’yltk33’y3 

K 

0 

-k 

42 

0 

v 

(4.11) 


Thereby  is  -  because  of  Eq.  (4.9)  and  (4.10)  - 


kll  =  k22’  k13  =  k24 

k33  '  k44’  k31  =  k42  ’ 

The  explicit  expressions  of  the  elements  k^(  id, . . .  ,4;  j=l,...,4)  are  assigned  in  Appendix  C. 
If  the  velocities  x^  and  x4  are  not  measured  then 

vv° 

is  valid.  Prom  this  follows 


In  order  to  consider  this  case  the  auxiliary  variable  A  is  introduced  in  the  elements  k. .  in  Appendix  C. 
It  is  3 


A  =  1 


for  measuring  of  x3  and  x4  and 
A  =  0 


for  nonmeasuring  of  x,  and  x„.  By  this  the  stated  expressions  are  valid  for  both  cases.  In  the  elements 
of  the  Kalman  matrix  is  evident  that  these  are  not  explicitly  dependent  on  the  relations  (4.0)  but  depend 
only  on  the  ratios 


2 

P 


(4.13) 


This  simplifies  considerably  the  choice  of  appropriate  values  especially  for  £(0). 

In  order  to  have  an  idea  of  the  time  dependance  of  the  kjj( 1*1, . . .  ,4 ;  jd,...,4),  in  Fig.  10  and  Fig.  U 

the  amplification  k . . ( t )  Is  drawn  exemplarily  above  the  time.  Parameter  is  the  ratio  p2.  In  Fig.  10  there 

exists  no  measurement  of  x,  and  x.,  t.  e.  A  :  0,  in  Tig.  11  with  measurement  of  x,  and  x4  is  A  *  1 .  At 

first  It  is  evident  that  for  small  values  of  p2,  which  means  a  verj  good  measurement  with  little  variance, 

the  initial  amplification  is  very  high.  This  is  obvious.  The  filter  namely  considers  the  measurement  more 
than  the  initial  condition  0  »  which  is  only  known  with  a  certain  probability.  For  great  values  of  p2  the 
opposite  consideration  is  valid.  Furthermore  it  is  evident  in  comparison  of  rig.  10  with  rig.  11  between 

each  other  that  in  the  case  6  *  0  the  amplifications  are  greater  than  in  the  case  A  -  1.  This  is  obvious, 

too,  as  in  the  first  case,  because  of  the  missing  measurement  of  velocity,  the  whole  information  is 
included  in  the  measurement  data  of  the  position  coordinates  (x,,x2>,  with  the  help  of  which  the  velocity 
coordinates  v.  and  x  have  to  be  generated.  The  efficiency  of  tne  filter  can  Lc  seen  in  Fig.  12.  In  this 
figure  it  is  referred  to  the  example  treated  in  chapter  3.  It  is  an  interception  considered  with  4  =  0 
for  a  known  target  trajectory.  The  measurement  data  y,,...,y4  are  disturbed  with  a  known  measurement 
noise  of  the  navigation  instruments.  These  disturbed  state  variables  yield  the  optimal  control  u^( c ) 
in  Fig.  12a.  Following  to  this  the  measurement  data  y,»...,y4  are  fed  to  the  Kalman  filter,  this  gene¬ 
rates  x,,...,x4  and  by  these  the  optimal  control  is  realised.  This  shows  Fig.  13b.  One  sees  clearly  that 

the  filter  adapts  to  the  mean  value  E(w<t})  2  £  of  the  disturbances  and  gives  altogether  a  vary  good 
result. 


4.2  Paesivc  Tracking 


For  the  destination  of  the  target  data  needed  in  the  control  law,  i.  e.  the  state  variables 
t.(t),...,C4(t)  one  can  go  two  different  ways.  On  one  hand  the  variables  C.(t),..,,C4(t)  can  be  deter¬ 
minated  directly  by  active  bearing  measurements  (active  tracking).  On  the  other  hand  it  is  also  possible 
to  use  only  passive  bearing  measurements  (passive  tracking).  From  this  one  gets  the  relative  count i nates 
e.(t)..,.,e4(t).  As  the  state  vector  x  is  known  from  chapter  4,i  also  in  this  case  the  inertial  coordina¬ 
te*  CA(t) . C4(t)  can  be  stated  by  Eq.  (2.0).  In  the  further  considerations  this  way  is  gone. 


For  the  passive  target  tracking  it  is  useful  to  change  from  a  continuous  consideration  to  a  discrete  one. 
This  ha*  advantages  for  the  algorithms  to  be  developed.  For  this  it  is  necessary  to  write  the  equations 
of  motion  (3.4)  of  the  torpedo  in  a  discrete  fora.  With  the  sampling  time  that  mean* 
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The  dynamics  of  the  torpedo  is  consequently  described  by  the  general  difference  equation 

x[(ktl)*T  ]  =  £(T  ) »x(k»T  )  +B(T  )*u(k‘T  )  i  v(T  )  . 

— *  O  **-0  —  o  —  o  —  o  —  o 

For  the  target  (ship)  the  considerations  have  to  be  improved  in  some  details. 


(4.14b) 


(4.15) 


Case  1:  Assuming  the  ship  is  moving  with  constant  velocity,  the  discrete  aquations  of  motion  for  the 
target  are  given  by 
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,  k  =  0,1,2,..., 


(4.16) 


where  now  £_(t)  and  £  (t)  are  relative  velocities  in  accordance  with  x,(t)  and  x.(t).  Inserting 
Eq.  (4.14)  and  (4.16)  in  Eq.  (2.8)  one  gets  as  relative  motion  between  the  torpedo  and  the  ship 
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(4.17) 


which  is  identical  with  the  control  deviation  e(t)  in  Eq.  (2.8).  In  Eq.  (4,17)  i(k-TQ)  ia  a  known  contx-ol 
vector  given  explicitly  by  ~ 
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Consequently,  the  relative  motion  is  described  by  the  general  linear  difference  equation 
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Case  2:  The  assumption  of  constant  ship  velocity  is  in  some  cases  not  fulfilled.  Indeed  it  is  sufficient 
to  restrict  for  variable  velocity  on  linear  changes.  With  this  essumption  Eq.  (4.16)  turnso  in 

..  .  *  2  .  .  .  _ 
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ooo  0  1  o 


0  0  0  O  0 


<i(k*v 

Mk-T0i 

Vk*V 

c4(k.ro) 

V'V 

Vk*V 


(4.20) 


In  Eq .  (4.20)  the  additional  state  variables  5-  and  Sg  are  constant  parameters  to  be  identified.  The 
difference  equation  for  the  relative  vector  e(t),  which  is  now  of  the  dimension  (6x1)  follows  from 
Eq.  (4.20)  in  the  same  way  as  in  the  case  before.  F(T  )  is  the  system  matrix  from  (4.20)  and  the 

control  vector  z(k*T  )  derives  from  (4.18). 

—  o 

As  measurement  equation  for  the  generation  of  the  target  trajectory,  which  means  the  destination  of  the 
vector  £(  k  •  )  only  the  bearing  angle 


+  n(k-To) 


(4.21) 


is  available  [8].  Thereby  n(k‘To)  is  a  white  noise  process  with  normal  distribution 
E[n(k-TQ)]  =  0 


(j-T  )*n(k-T  )]  =  \ 

o  o  [0  )  j  ♦  x  , 


(4.22) 


k  =  0,1,2,... 


The  measurement  equation  (4.21)  is  nonlinear.  From  this  equation  in  connection  with  Eq.  (4.19)  the 
vector  e(k-TQ)  has  to  be  determined.  If  one  excludes  pathological  cases  [8]  this  can  be  done  in  a  deter¬ 
ministic  way,  i.  e.  n(k-T  )  =  0  for  Eq.  (4.17)  with  four,  and  for  Eq.  (4.20)  with  six  measurements. 
Necessary  condition  for  tRe  observability  is  thereby  a  sufficient  course  changing  of  the  pursuer  in  order 
to  have  enough  information  on  the  target  for  solving  Eq.  (4.19).  In  the  stochastic  case  as  well  at  least 
four  measurements  are  needed  to  solve  (4.17)  and  six  to  solve  (4.20),  respectively,  but  in  general  it  is 
better  to  have  more  in  order  to  carry  out  a  data  preprocessing.  The  stochastic  measurements  of  Eq.  (4.21) 
have  to  be  prepared  by  a  dynamic  filter.  Following  to  the  considerations  in  [8]  and  [12]  one  then  obtains 
an  extended  Kalman  filter  belonging  to  (4.19)  and  given  in  Appendix  D.  This  filter  has  been  derived  for 
the  measurement  Eq.  (4.17),  for  Eq.  (4.20)  it  has  to  be  extended  correspondingly.  The  filter  algorithm 
itself  follows  from  a  linearization  of  the  measurement  equation  about  the  momentary  estimation  vector 
£(k+l|k).  Now,  simulations  show  that  the  derived  algorithm  has  an  inefficient  convergence  behaviour, 
especially  in  respect  of  the  covariance  matrix  P(klk).^This  bases  on  the  dependence  of  the  linearized 
measurement  vector  o(k)  on  the  estimated  state  vector  e( kl Vr— 1) ,  by  which  a  feed-back  occurs  in  the  calcu¬ 
lation  of  the  amplification  vector  k(k),  which  represents  here  the  Kalman  matrix.  A  decoupling  of  state 
estimation  and  covariance  matrix  calculation  can  be  reached  in  a  simple  way,  if  one  submits  the  measure¬ 
ment  equation  to  a  pseudo-linearization  {8],  [12].  Then  one  gets  the  following  algorithm  with  better 
convergence  behaviour: 


Prediction  Phase: 


e(k+llk)  *  F(k+l,k).e(k,k)  t  z(k) 

P(ktllk)  *  F(k+l,k)*P(k!k)*FT(ktl,k)  ,  k=  0,1,2,... 


Measurement  Phase: 


cT(k+l)  s  (cosB(k+l)  ,  -  einB(ktl),  0,  0) 


Correction  Phase: 


k(k+l)  *  P(knlk)-c(k+l)‘|cT(ktl)*P(ktllk)«c(ktl>to2(ktl)| 
eCk+lIktl)  =  i(kUlk)-k(ktl>.cT(ktl)-i(k>llk) 

P(ktllk+l)  e  £<k+llk)-6(ktl).cT<k+l)*£(kntk> 


(4.23) 


(4.24) 


(4.25) 


Initial  Condition: 


t(0i0)  o  o 

P(0l0)  e  diag(o2)  . 


(4.26) 


In  thia  algorithm,  as  well  as  in  the  extended  Kalman  filter  In  Appendix  D,  the  argument  T  it  omitted. 
The  variance  o*  in  Eq.  (4.26)  can  ba  set  in  general  equal  to  one.  ° 


The  algorithm  given  by  (4,23)  to  (4.26)  is  now  used  to  generate  the  target  trajectory 


CjU)  s  1-t 
(2(t)  i  o 

for  t  >  0.  The  pursuer  (torpedo)  thereby  drives  the  following  identification  trajectory  with  the 
initial  conditions  x^  *  1,13)  xJQ  =  4, SO: 

0  <  t  <  1:  Xx(t)  *  J)  a  const. 

X2(t)  s  -  • 


l^t  <2:  xx(t) 
x2(t) 

2  _<  t  _<  3:  Xl(t) 

x2(t) 

3  <  t  <  4:  x^t) 

x2(t) 

4  _i  t  _<  5:  x^t) 

Xj(t) 


l*(t-l)+x10 
x2(l)  =  const. 

Xl(2)  =  const. 

-  l-(t-2)+x2(l) 

l-(t-3)+x1(2) 
x2(3)  =  const. 

4 )  =  const . 

-  l-(t-4)+x3(3) 


Consequently  the  pursuer  dynamics  is  assumed  as  an  ideal  rectangular  trajectory  leading  to  certain 
simplifications  in  Eq.  (4.18).  As  sampling  time  T  =  1/12  is  chosen,  i.  e.  there  are  12  scanning  steps 
within  each  course  correction  of  the  pursuer.  By  ?his  we  get  the  possibility  of  a  preprocessing  in  the 
algorithm  (4.23)  to  (4.26).  The  results  of  the  simulation  are  given  in  Fig.  13  to  Fig.  15  for  the 
standard  deviations  of  the  measurement  noise  of  a  =  (0,5°;  1  ;  2°).  In  Fig.  13  the  relative  distance 
error  AR  =  (R-R)/R  is  drawn  in  per  cent  with  R  =  (e^+ef)1'2.  Fig.  14  shows  the  course  error  AS  o  9-0,  and 
in  Fig.  15  the  velocity  error  Av  =  v  -  v  is  assigned  with  v  =  (Cj+C?)1/2.  The  geometric  configuration 
of  the  parameters  R,0  and  v  is  Ixplafned  ?n  Fig.  13  again.  8 


From  all  three  figures  it  is  evident  that  the  filter  is  not  working  until  the  first  course  correction 
of  the  torpedo.  This  bases  on  the  fact  -  as  mentioned  above  -  that  the  problem  is  not  observable 
before  [8j.  After  the  first  course  correction  the  filter  gives  usable  results  although  it  cannot  be 
overlooked  that  the  filter  would  diverge  without  a  further  course  correction.  This  bases  especially  on 
the  parallel  course  of  torpedo  and  ship  during  the  second  motion  interval  1  <  t  <  2.  This  divergence  can 
be  removed  by  the  second  course  correction  of  the  torpedo.  After  this  the  stationary  accuracy  is  reached 
in  the  relative  distance  and  in  the  velocity.  The  accuracy  can  be  raised  in  the  course  by  another  course 
correction. 


5.  CONTROL  AND  FILTERING 


It  is  obious  to  combine  the  results  of  chapter  3  and  chapter  4  in  order  to  carry  out  a  closed  loop 
control-filter  procedure .  This  is  done  by  a  hybrid  simulation  on  the  basis  of  the  simulation  plan  of 
Fig.  16. 

On  the  top  of  the  figure  the  dynamics  of  the  torpedo  is  given  as  derived  in  chapter  2  inclusive  the  Kalman 
filter  of  chapter  4.1.  Below  the  target  filter  is  outlined  and  the  nonlinear  measurement  equation  for  the 
relative  target  data  and  the  controllers  are  given.  The  two  systems  -  torpedo  (pursuer)  and  ship  (target)  - 
are  connected  by  a  sample  and  hold  circuit,  which  carries  over  the  continuous  system  of  the  torpedo  in  a 
discrete  one,  as  it  in  needed  for  the  target  filter. 

In  a  first  phase  the  target  trajectory  has  to  be  generated.  In  this  identification  phase  the  switch 
between  the  two  controllers  is  turned  to  the  left  (1)  and  the  system  is  working  in  an  open  loop.  If  the 
torpedo  has  enough  information  about  the  target,  the  identification  phase  is  finished,  i.  e.  the  switch 
between  the  two  controllers  is  turned  to  the  right  (2)  and  the  closed  loop  control  begin*.  During  this 
phase  the  target  filter  is  working  in  parallel.  If  an  additional  time-optimisation  it  netded  as  outlined 
in  chaptar  3,  the  controller  which  in  hardware  is  a  microprocessor  or  a  minicomputer,  respectively, 
carries  out  a  target  trajectory  prediction  and  destinatea  T  from  Eq.  (3.6).  In  this  paper,  however,  this 
is  not  a  central  point. 

As  en  example  of  the  simulation  procedure  the  results  of  Fig,  17  are  given.  Referring  to  the  target 
trajectory  of  chapter  3,2,2,  where  the  target  la  moving  on  the  x.-t.-axie  with  the  normalised  constant 
velocity  "one"  the  pursuer  is  started  with  the  initial  condition!  x*(0)  =  1,12;  k,(0)  *  4, so,  the 
variance  of  the  measurement  device  is  c2  s  i°.  in  the  identification  phase  0  <  t  <  3  the  target  trajectory 
it  gentrated.  In  order  to  look  for  geometric  influences  two  different  identification  trajectories  are 
chosen.  At  the  time  t=3  the  control  phase  begins,  it  is  finished  at  t&9.  Tht  controlled  trajectories  are 
comparabla  with  the  results  in  Fig.  9.  The  obtained  accuracy  can  ba  considered  es  satisfied.  In  the 
x,-C,-axis  one  gets  an  absolute  error  of  Ae.  a  -  o»15  for  the  upper,  and  of  Ae,  n  -  0,275  for  tha  lower 
pdreuer  trajectory,  in  the  xJ-C„-axii  the  corresponding  values  are  Ae,  1  -  0,025  and  As,  *  0,03, 
respect ivtly.  *  ‘  *  * 

6.  EXTENSIONS 

In  chapter  2  a  control  law  ia  dtrived  which  it  well  appropriate  to  any  target  trajectories.  In  chapter  4 
end  chapter  $  the  considerations  are  focussed  on  targets  with  constant  velocities.  An  extension  for 
linear  changat  in  tha  target  velocity  1.  e.  constant  acceleration,  U  given  by  Eq.  (4.20).  In  order  to 
prove  the  efficiency  of  the  algorithms  derived  Including  this  equation  tha  example  of  chapter  4.2  le 
taken  but  now  for 

Cx(t)  s  o.S-t  ♦  0,125-t* 

tj(t)  i  0  . 


Consequently,  the  ship  now  has  an  acceleration  of  5^  =  0,25  in  the  direction  of  £,.  The  identification 
course  of  the  torpedo  is  very  similar  to  that  given  in  chapter  4.2.  The  results  or  this  example  are 
assigned  in  Fig.  18.  Without  any  detailed  discussion  it  is  obvious  that  the  target  filter  works  in  this 
case  efficiently,  too. 

Future  efforts  of  the  authors  will  be  concentrated  especially  on  targets  with  any  velocity  of  time. 
Furthermore,  the  theoretical  considerations  will  be  concentrated  in  general  on  the  problem  of  the  best 
identification  trajectory.  This  may  be  done  by  an  intensive  investigation  of  the  observability  matrix. 

7.  CONCLUSION 

The  paper  presented  deals  with  control  and  filter  problems  for  the  interception  of  torpedo-ship  situations. 
Central  points  are  engineering  applications  and  not  primary  theoretical  aspects.  The  restriction  on  plane 
motions  is  not  decisive,  it  is  only  done  for  formulas  limitations.  All  analytic  results  are  prepared 
such,  that  they  can  be  realized  in  a  simple  manner  numerically  by  a  microprocessor  or  a  minicomputer, 
respectively.  Some  of  the  calculations  thereby  can  be  done  off-line  which  is  appropriate  for  real  time 
situations . 
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APPENDIX  A 
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Auxiliary  Functions  for  the  Open-loop  Control  Problem  in  the  Case  £  ♦  0. 
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Y,-(3u?-y,)  u.'(3y?-iDj) 

(t)  - j — j—  ch(Yi,t)-sin(mi-t)  + - — j — j—  sh(Y.  •t)*cos(u>i»t) 


2Yi-»i-(Yi+«Ji) 


2Yi-“i-(Yi+“i) 


(t)  =  — i— •  sh(Y{  •t)‘sin(u.  *t) 
*y  j  ^  1  1 


(t>  - 


2Yi*«1>(Yi+«i) 


■j — j— ch(Yi‘t)*sin(<Di‘t)  - 


2Yi‘“i‘<7i+"i) 


5 — j— shiY^ti'cosCw^t) 


m21(t)  =  c2i(t)  +  s*Cjj(t) 
mu(t)  -  c^it)  +  e>m2i(t) 

moi(t)  =  V(t)  +  eMBli(t> 


C3i<T> 


+  I m  ,(t)*(c  ,(T)-1)  -»  ,(T)-(c  .(t) 


oi'w'"-oi'  w’*#  '  "oi'  w  vvoi'  W'*'J  '  q^~rtTVi 


[ 

•[V’>-^T7T'V'>j 


-1)1 . — L. 

J  <»<<•■ 


1^(0  s  /  ,  i  S  1,2 


a  CJt(t>  *  t*Ojj(t) 
U^it)  a  ca(t)  -  fdjjit) 

dol(U  S  e0i(t>  ‘  t,dU(t) 


r  ranlTi  0ii'Trcn'TM 

ku(T)  a  [^uTo^ty.«--TfTj  •  *io  * Ir^rfT *  ‘S^lfcTfyl  *  (xu?.0tvi)  * 

♦  bgm  •  •  r^rrr]  •  -  v  b&n "  ^>1  •  *  5>r 


rd,,(T) 


e,,<T)-c,.<T), 


k2t(T)  e  kgm ♦ «« *  *  *Jo  ♦  l«^m ♦  <«  *  r;tTfv-.;;mj  *  (*um 

r«3«<T>  (c3i(T>‘q^**oi(T,,1  ri3l(T) 

♦  fern  *  (coi(T,‘I)  * . o^rfr^Tfr-]  — v  [s^m  •  <i  r  .^ma^ro] 


fd„(T) 


0*V  * 


_  hlCT),u(T)  csl(T) 

hi1”  ■ 

. 'll1”-!!1” 

hi”  ooirfi-.01(fi 


(A.l) 
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(A. 3) 
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1li*t)  =  ^  03i(t‘T)‘tl(T)"dt 
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i3i(t)  =  /  coi(t-T)*Ci(T),dT  ,  i  =  1,2  . 


APPENDIX  B 


Auxiliary  Functions  for  the  Closed-loop  Control  Problem  in  the  Case  £  ♦  0. 

roi(T_t>  slJ^^{p*[coi(T-t)*doi(T-t)+%i,c2i(T-t>-d2i(T-t)]  - 
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APPENDIX'  C 

Eltaeot*  of  the  KaImo  Matrix  K(t)  for  the  Torpedo  inertial  filter. 

l  3 
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k22(t)  =  ku(t) 
k24(t)  =  kx3^^ 
k42(t)  1  k31^^ 
k44^  3  k33^^ 


(C.3) 


APPENDIX  D 

Extended  Kalman  Filter  for  the  Relative  Notion  of  Torpedo-Ship. 


Prediction  Phase: 


eCk+ilk)  =  F(k+l,k)e(klk)  ♦  tU) 


Ptktllk)  s  F(ktl,k).p(klk)-FT(ktl,k)  ,  k*  0,1,2,... 
Keasureoent  Phase: 
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Correction  Phase: 
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Initial  Condition: 


£(010)  s  dia*(e3)  ;  o3  > >  l 
3(010)  =  0  . 
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Fig.  11  Amplification  k.,(t)  of  Kalman  matrix 
(with  velocity  Aiasureaenta) 


Fig.  15  Target  filter:  velocity  error 


Fig.  17  Interception  trajectories  with  identification  and  control 


(T» 


♦ 


c.c.2 

♦ 


•  •• 


c.c.3 


;c.c.4 


i 

t 

i 

i 

i 

i 

i 

i 

i 

i 


u 


rrr 


♦  ♦  ♦  ♦  ♦  ♦ 


► 


ter  for  an  Interoeption  with  variable  target  veloaily 


0-1 


SEPARATED-BIAS  ESTIMATION  AND 
SOME  APPLICATIONS 
by 

Bernard  Friedland 

The  Singer  Company,  Kearfott  Division 
1150  McBride  Avenue 
Little  Falls,  New  Jersey 
07424 
U.S.A. 


SUMMARY 

A  number  of  applications  of  Kalman-Bucy  filtering  require  the  estimation  of  unknown  constants  (biases) 
as  well  as  dynamic  state  variables.  In  1969,  a  method  of  separating  the  estimation  of  the  bias  vector  from 
the  estimation  of  the  dynamic  state  vector  was  presented  as  an  alternative  tq  state  vector  augmentation 
which  often  had  numerical  conditioning  difficulties.  The  optimum  estimate  x  of  the  dynamic  state  was  shown 
to  be  the  sum  of  a  biao-free  estimate  x  (i.e.,  the  estimate  that  would  be  obtained  if  the  constant  bias 
vector  b  were  zero)  and  a  correction  term  Vfc  ,  where  b  is  the  optimum  estimate  of  the  bias  and  V 
is  a  matrix  determined  from  the  other  matrices  of  the  problem.  Moreover,  the  bias  estimate  b  can  be 
obtained  from  the  residuals  (innovations)  of  the  bias-free  estimator.  This  paper  reviews  the  general  theory, 
using  a  new  derivation,  and  summarizes  some  of  the  extensions  that  various  investigators  have  contributed 
during  the  past  decade.  Several  applications,  including  calibration  aud  failure  detection  and  identification, 
are  discussed. 


1.  INTRODUCTION 

No  contribution  since  World  War  II  has  influenced  system  science  more  than  the  recursive  filtering 
theory  of  Kalman  and  Bucy  [1,2].  The  theoretical  significance  and  practical  utility  of  this  work  became 
widely  recognized  within  a  few  short  years  of  its  advent  in  the  early  1960's.  Dozens  of  papers  soon 
appeared  which  presented  alternate  derivations  and  interpretations  and  demonstrated  potential  applications 
in  various  fields — most  notably  aerospace  but  also  in  industrial  process  control  and  even  in  the  field  of 
econometrics. 

All  this  activity  made  the  benefits  of  Kalman-Bucy  filtering  quickly  evident  to  a  wide  audience.  And 
it  exposed  some  of  the  limitations  chat  were  not  obvious  at  first.  One  of  these  limitations  was  the  tendency 
of  the  calculated  quantities  (particularly  the  "covariance  matrix")  to  become  ill-conditioned,  with  the  elaspo 
of  time,  in  processes  of  high  dimension.  (Considerable  research  has  been  devoted  to  general  methods  of 
improving  numerical  conditioning  of  the  required  calculations— -and  continues  to  the  present  time — but  this 
general  subject  is  beyond  the  scope  of  this  paper.) 

Even  when  ill-conditioning  did  not  cause  serious  problems,  the  implementation  of  the  Kalman  filter  in 
many  Instances  created  a  severe  burden  for  the  typical  airborne  computer  of  the  early  sixties  and  motivated 
a  quest  for  ways  to  reduce  the  computational  requirements,  even  at  the  expense  of  a  sacrifice  in  the 
theoretically  attainable  performance. 

The  problems  of  computer  loading  and  prospective  111-conditioning  had  to  be  faced  in  one  of  the  early 
proposed  applications  of  Kalman  filtering:  mixing  of  navigation  aid  data  with  inertial  data  in  alded-inertlal 
navigation  systems.  In  this  application  [3],  most  of  the  variables  to  bo  estimated  are  constants  (biases, 
drift  rates,  scale  factor  errors,  misalignment  angles,  etc.).  The  customary  treatment  of  these  unknown  con¬ 
stants  as  state  variables  results  in  state  vectors  of  high  dimension.  Around  1969  we  reasoned  that  it  should 
be  possible  to  exploit  the  fact  that  muny,  if  not  most,  of  the  state  variables  are  constants  to  reduce  the 
complexity  of  the  filter,  and  thereby  to  alleviate  the  computational  burden  and  to  minimize  the  possibility 
of  ill-conditioning.  We  initiated  an  analysis  which  culminated  in  our  paper  [4]  in  which  the  estimation  of 
the  constant  or  "bias"  parameters  was  separated  from  the  estimation  of  the  dynamic  state  variables. 

We  showed  that  it  is  possible  to  obtain  an  optimum  estimate  $  of  the  dynamic  state  using  a  filter 
having  the  structure  shown  in  Fig.  1,  and  consisting  of  a  bias-f roe-state  estimator,  a  bias-estimator,  and 
a  bias-correction  matrix  V  .  Mathematically  the  optimum  state  estimate  X  is  the  sum  of  the  blas-free- 
state  estimate  x  and  a  correction  term  vt>  ,  where  £  is  the  optimum  estimate  of  the  bias,  l.o., 

x  -  x  +  v6  (1) 

The  bias-free  state  estimate  x  is  obtained  by  processing  the  observations  in  a  Kalman  filter  designed 
under  the  assumption  that  the  bias  vector  b  is  identically  zero.  In  the  standard  implementation  of  the 
bias-free  filter,  the  difference 

F-y-y  (2) 

between  the  actual  observation  and  the  estimate  thereof  is  produced.  This  difference  signal  known  as  the 
"residual"  or  nowadays  by  the  more  popular  term  "Innovation"  is  the  input  to  a  second  filter  which  can  be 
called  the  "bias-estimator"  because  Its  output  is  the  optimum  estimate  b  of  the  unknown  bias  vector  b  . 

The  bias  estimate  t>  ,  multiplied  by  the  correction  matrix  V  is  finally  added  to  x  ,  in  accordance  with 
Eq. (1)  to  yield  the  desired  optimum  state  estimate. 

We  expected  to  find  that  the  bias-separated  filter  implementation  would  require  fewer  numerical  operations 
than  the  augmented-state  implementation.  But  we  found,  to  our  disappointment,  that  the  number  of  operations 
for  each  Implementation  were  comparable,  (The  bias-separated  Implementation  did  open  some  new  possibilities 
for  approximations  that  may  not  have  been  evident  in  the  augmented-state  implementation.) 

But  the  advantages  of  the  bias-separated  implementation  for  avoiding  numerical  111-condltlonlng  are 


obvious.  In  the  augmented-#tat«  implementation  the  overall  process  is  of  order  N  +  K  ,  where  N  is  the 
number  of  dynamic  state  variables  (i.e.t  the  dimension  of  x  )  and  K  is  the  nuaber  of  biases  (the  dimension 
of  b  )  and  the  8  +  1  variables  are  all  coupled,  in  the  filter  and  in  the  covariance  matrix  propagation. 

In  the  bias-separated  implementation  the  maximum  dimension  one  needs  to  be  concerned  with  the  larger  of  N 
or  K  ,  and  errors  in  the  estimation  of  the  bias  do  not  contaminate  the  estimation  of  the  bias-free  estimate 
x  of  the  dynamic  state. 

The  strategy  we  employed  in  1969  for  deriving  the  bias-separated  filter  was  motivated  by  a  similar 
strategy  we  had  then  been  using' in  our  study  of  quasi-optimum  control.  Investigators  unfamiliar  with  that 
technique  found  the  results  to  be  lacking  in  motivation.  Several  authors  (Agee  and  Turner  {5J .  Mendel  and 
Washburn  16,7),  and  Bierman  [8])  have  contributed  to  correcting  this  deficiency  by  providing  alternate 
derivations  and  Interpretations. 

The  bias-free  residuals  (or  innovations)  occurred  naturally  in  the  derivation  of  (4)  but  their  signif¬ 
icance  was  not  fully  appreciated  at  the  time:  the  Interpretations  resulting  from  the  work  of  Kallath  et  al 
[9,10]  had  only  just  begun  to  emerge.  In  retrospect  it  is  evident  that  the  bias  separation  method  Introduced 
in  (4)  exemplifies  one  of  the  applications  that  can  be  made  of  the  residuals  of  a  Kalman  filter.  Failure 
detection  and  dlagnoals  [11]  exemplifies  another  application  that  can  be  made  of  the  residuals.  We  shall 
return  to  this  application  subsequently. 


2.  REVIEW  OF  THEORY 


It  has  bean  remarked  above  that  the  bias-separated  structure  of  Fig.  1  cm  be  derived  in  a  number  of 
different  ways.  Since  new  methods  of  derivation  can  afford  new  Insights,  we  offer  here  still  another 
derivation  based  on  the  theory  of  linear  observers.  Sines  the  latter  theory  does  not  depend  on  properties 
of  stochastic  processes,  the  derivation  shows  that  the  structural  properties  of  the  separated-blas  estimation 
algorithm  transcends  the  stochastic  process  underpinnings. 


For  simplicity  we  consider  only  a  continuous- time  process 

x  *  Ax  +  Bb  ♦  u 


with  observations  given  by 


O) 


y  »  Cx  ♦  Db  «■  v  (4) 

where  b  is  a  constant  (but  is  known)  vactor  (called  the  "bias'*)  and  u  and  v  white  nolee  processes  having 
known  apactral  danslty  matt leas,  Q  and  R  reapactively. 


In  accordanca  with  well-known  theory  [12 J,  an  observer  for  the  process  Cqa.(l)  -  (4)  la  defined  by 

x  •  Ax  ♦  3b  ♦  K^ty  -  Cx  -  t)6)  (J) 

s 

b  -  K^(y  -  Cx  -  06)  (6) 

These  relations  ere  depicted  in  Fig.  2s.  The  gain  matrix 


.fV 


(7) 


is  chosen  to  make  the  observer  asymptotically  stable.  If  the  gain  matrix  la  chosen  optimally,  the  observer 
Bqs.(S)  -  (6)  la  the  Kalman  filter;  otherwise  the  observer  has  only  the  property  that  the  error 


«  • 
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Figure  2.  Two  Forms  of  Observers  for  Linear  Systems  with  Bias. 


Now  consider  the  possibility  of  expressing  the  observer  Eq.(5)  and  Eq.(6)  in  the  bias-separated  fora 
shown  in  Fig. 2b,  l.e. , 


x  -  x  +  Vb 

(8) 

where 

x  is  the  state  of  the  bias-free  observer,  given  by 

x  -  Ax  +  Kr 

(9) 

A 

and  b 

is  the  biss  estimate  given  by 

b  -  E^(r  -  Hb) 

(10) 

where 

r  is  the  bias- free  residual,  given  by 

r  •  y  -  Cx 

(11) 

Note  the  following: 


•  The_bies«free  observer  Eq.(9)  retains  the  general  fora  of  an  observer  end  is  hence  asymptotically 
stable  if  K  ,  as  yet  unspecified,  la  appropriately  chosen. 

a  The  bias  estimator  Eq.(10)  is  also  in  the  fora  of  an  observer  except  that  its  input  la  the  bias  free 
residual  r  (not  y  ).  Moreover,  the  bias  filter  gain  K.  is  the  asae  aatrlx  that  appears  In  Eq.(6).  In 
principle,  the  results  to  be  obtained  could  be  generaliteo  sooewhat  by  permitting  E.  in  Eq.(6)  and  K.  in 
Eq. (10)  to  be  different  aatrlcea.  We  will  not  do  this,  however 


Our  objective  in  the  analysis  that  follows  is  to  find  relationships  between  the  aatrlcea  V  ,  K  and  H 
that  must  hold  in  order  Eqa.(B)  -  (11)  be  equivalent  to  Eqa.(3)  -  (6).  To  thla  end,  subatltute  Eq.(8)  loco 
Eq.(6)  to  obtain 

b  •  K^y  -  C(x  +  Vb)  -  DbJ 
*  E^T  -  (CV  ♦  D)b) 

which  la  of  the  fora  of  Eq. (10)  provided  that 

H  -  CV  ♦  0  (12) 


Thla  ia  one  of  the  relatlona  we  ere  seeking. 

Also,  substitute  Eq . (8)  into  Eq.(3)  to  obtain 

x  +  Vb*Vb-A(x  +  Vb)  +  8b'f  E^Or  -  Ub) 

-  Ax  ♦  tj  *  (t  ♦  AV  -  r^H)^  (13) 

lut,  froa  Eq.(9)  and  Eq.(10),  the  left-hand  aide  of  Eq,(l3)  is 

a  +  V^  +  Vb-AxeKre-  VE^lr  -  Kb)  +  Vb 
The  Ax  ten  cancels  on  both  ai(*aa  of  Eq. (11)  leaving 

(E  ♦  1^)7  +  (V  -  VK^Hlb  -  t7  ♦  (>  ♦  AV  -  KH)S 

Thus  Eq. (S)  la  aatiaflsd  for  all  r  and  b  provided  that 

*  "  ‘  v*b  (U) 

V  -  AV  +  VK^H  +  »  -  E^H 

-  (A  -  EC) V  +  B  -  ED 


(») 


Thus  the  augmented-state  observer  of  Fig. 2a,  with  gain  matrices  K  and  K.  obtained  by  any  method 
whatsoever,  can  be  transformed  into  the  bias-separated  form  of  Fig. 2b,  provided  the  matrices  H  ,  V  ,  and 
K  satisfy  the  two  algebraic  equations  (12)  and  (14)  and  the  matrix  differential  equation  (15),  which  becomes 
the  matrix  Rlccatl  equation 

V  -  (A  -  KC)V  +  Vjy)  +  VKjCV  +  B  -  KD  (16) 

upon  substitution  of  Eq.(14). 

This  derivation  is  strictly  algebraic  and  does  not  require  that  any  significance  be  attached  to  the 
matrices  that  appear  in  the  respective  relations,  but  only  that  a  matrix  V  which  satisfies  Eq.(16)  can  be 
found.  (The  general  conditions  on  A  ,  B  ,  C  ,  0  ,  K  and  K,  that  guarantee  the  existence  of  a  solution 
to  Eq.(16)  have,  to  our  knowledge,  not  been  explored.*  In  particular,  there  is  no  requirement  that  K  and 
K.  be  optimum  for  the  noise  u  and  v  .  And,  irrespective,  of  the  optimality  of  these  gains,  the  steady 
state  errors  in  the  estimation  of  x  and  b  will  tend  to  aero  if  these  gains  result  in  a  stable  observer. 

An  alternate  demonstration  of  this  property  was  given  in  [13]. 

If  the  observer  gains  K  and  are  optimum  for  the  noises  u  and  v  in  Eq.(3)  and  Eq.(4),  however, 
then  the  bias-separated  filter  is  also  optimum.  And  it  is  then  possible  to  provide  interpretations  of  the 
matrices  K  and  V  .  In  particular,  as  shown  in  [4],  K  is  the  optimum  gain  for  the  bias-free  filter,  i.e., 
for  estimating  the  state  x  when  b  is  known  to  be  identically  zero,  l.e., 

K  -  PC'R-1  (17) 

where  P  »  AP  +  PA'  -  PC'R_lCP  +  Q  (18) 


with  Q  and  R  being  the  spectral  density  matrices  of  u  and  v  ,  respectively.  Moreover,  the  matrix  V 
can  be  interpreted  as  the  ratio  of  the  cross-covariance  matrix  of  x  and  b  to  the  covariance  matrix  of  b  . 
Specifically,  if 


P^  -  R[(x  -  i)(b  -  b)  *  J 
Pfa  -  E((b  -  b)(b  -  b)*] 


then,  as  shown  in  [4], 


V  ■  p  .?  * 
xb  b 


(19) 


(20) 


This  helps  to  provide  an  intuitive  interpretation  of  the  bias  correction  equation  (8),  in  which  V  is 
seen  to  be  the  gain  matrix  for  correcting  the  bias.  By  Eq.(2Q)  thia  matrix  is  proportional  to  the  cross- 
correlation  between  the  error  in  estimating  the  state  and  the  error  in  estimating  the  bias.  If  the  influence 
of  the  latter  on  the  former  la  relatively  weak,  as  measured  by  a  small  cross-correlation  matrix  P  .  ,  it  is 
only  reasonable  to  expect  Chat  the  correction  to  the  bias-free  estimate  x  ,  whan  knowledge  of  b  *li  obtained, 
would  likewise  be  small.  Likewise,  if  the  cross-correlation  between  the  bias  estimate  and  the  state  estimate 
is  strong,  we  should  expect  a  largo  bias  correction.  In  addition,  we  would  expect  the  magnitude  of  the  bias 
correction  to  be  Inversely  proportional  to  the  uncertainty  in  the  estimate,  and  this  explains  the  presence 
of  PbA  in  Bq. (20). 


Another  useful  relationship  involving  V  ,  as  given  in  [4],  is 


F  •  P  VPfeV'  (21) 

A  _ 

where  P  is  the  covariance  matrix  of  the  estimate  of  x  in  the  presence  of  blaa,  end  P  lg  the  covarlence 
in  the  abeeece  of  blaa.  Since  VMV*  le  e  poetive-(eemi)  definite  matrix,  it  le  clear  t^at  P  le  largar  than 
P  ,  which  la  of  course  to  be  expected.  But  Bq.(21)  quantifies  ths  dlffsrsncs  bstwesn  P  snd  P  .  In 
particular  if  VP.  V'  is  small  rslstlve  to  P  ,  thsn  ths  inerssas  in  trror  due  to  b  is  correspondingly 

small  and  b  is  not  significant  is  ths  estimation  of  x  .  Since  it  is  posslbls  to  include  ell  the  blee 
verlebles  except  one,  eey  b.  ,  la  the  etete  x  end  epply  the  reeult  of  Eq.(21)  to  b,  alone,  this  provides 
t  wsy  of  sssssslng  ths  effect  of  each  component  b,  of  the  bise  vactqr  b  on_the  estimate  of  x  .  Those 
components  which  do  not  contribute  significantly  to  the  Increese  in  P  over  P  are  candidates  for  omission 
in  a  euboptlaal  Implementation. 

A  useful  Interpretation  of  the  bias  estimation  equation  (10)  can  be  had  by  consldarlng  the  problem  of 
estimating  an  unknown  constant  b  observed  through  noise,  l.s. , 


with  observation  r  given  by 


b  -  0 


r  •  Hb  +  C 


(22) 

(23) 


where  (  is  white  nolee,  having  a  spectral  density  matrix  R  .  Dirtct  application  of  basic  Kalman  flltar 
theory  shown  that  tha  optimum  estimator  la  exactly  in  the  fora  of  Eq.(10)  with  tha  gain  matrix  given  by 

-  PbH't'X  (24) 

with  P,5  being  tha  solution  of  the  variance  aquation 

Pb  -  -  PbH'R'lHPb  (25) 

It  le  shown  in  [4]  th*l  these  ere  precisely  the  relatione  that  are  eetisfled  by  K.  and  F.  .  Hence  the 
operation  of  the  bias  estimator  Ej.  (10)  can  ba  interpreted  as  the  extraction  of  a  constant  observed  i )  white 
notes,  with  tho  residual  vector  r  being  the  observation.  If  the  bias  b  le  known  to  be  isro,  then  by 


Eq.(23)  r  is  zero-mean  white  noise  with  the  sane  spectral  density  as  the  original  observation  noise.  This 
K  confirms  a  well-known  property  of  the  residuals  in  a  bias-free  filter.  But  Eq.(23)  also  explains  the  effect 

I'  of  a  nonzero  bias  b  in  the  original^  dynamic  equations  on  the  residual  of  a  Kalman  filter  designed  for  zero 

bias.  In  particular,  the  residual  r  is  not  a  zero  mean  process  but  rather  has  a  mean  given  by  Hb  with 
|  the  "equivalent  observation"  matrix  H  given  by  Eq.(12).  As  one  might  have  expected  the  observation  bias 

I  matrix  D  appears  directly  in  H  .  But  the  dynamic  bias  matrix  B  appears  in  H  only  through  its  influence 

I  on  V  .  Moreover,  since  D  also  appears  in  the  equation  for  V  it  is  not  entirely  accounted  for  by  the 

^  D  term  in  Eq.(12). 

i 

'  The  interpretation  of  Eq,(10)  as  the  extraction  of  a  constant  observed  in  white  noise,  was  first 

j  advanced  by  Mendel  and  Washburn  [6,7]  (see  also  [13]).  This  interpretation  is  particularly  useful  in 

|  failure  detection  applications  as  discussed  below. 

s  For  simplicity,  the  above  review  was  given  in  terms  of  a  continuous-time  process.  There  are  exactly 

analogous  results  for  discrete-time  systems  which  are  presented  in  suouary  form  in  Appendix  A. 


3.  EXTENSIONS  OF  THEORY 

Alternate  Derivations — As  often  happens  with  theoretical  results,  not  everyone  was  pleased  with  the  method 
used  to  derive  the  bias  separated  filter,  and  several  investigators  contributed  alternate  derivations  which 
may  provide  added  insight  or  suggest  further  extensions. 

In  1971,  Lin  and  Sage  [14]  reported  on  their  approach  to  bias  estimation  using  maximum  likelihood 
methods  and  which  entailed  solution  of  a  two-point  boundary-value  problem.  They  obtained  results  which  were 
subsequently  shown  by  Godbole  [IS]  to  be  identical  to  the  original  results  in  [4].  As  Godbole  pointed  out, 
this  fact  was  hardly  surprising,  since  it  had  been  known  for  several  years  that  the  Kalman  filter  la  a 
recursive  implementation  of  the  solution  of  the  two-point  boundary  value  problem. 

In  1972,  Agee  and  Turner  [5]  derived  equations  for  the  (discrete-time)  bias-separated  filter  by  starting 
with  the  correction  equation  of  the  form 

x  »  x  +  vb 


and,  by  a  method  somewhat  similar  to  the  method  used  in  Section  2,  determined  the  conditions  under  which  the 
decoupling  is  possible.  One  of  their  conclusions  is  that  the  partitioning  la  only  possible  when  the  bias  is 
not  a  random  process.  In  other  words,  although  the  bias  need  not  be  a  constant,  but  rather  may  be  given  by 

b  -  2b  (26) 

it  would  not  be  permissible  to  Include  a  noise  input  on  the  right-hand  aide  of  Eq.(26)>  Hence  any  attempt  at 
axtsnding  the  result  to  tha  case  in  which  the  bias  la  a  random  process  mutt  of  necessity  leed  to  a  sub- 
optimum  filter.  It  ia  noted,  however,  that  the  derivation  in  Section  2,  it  Independent  of  tha  manner  in 
which  the  gains  X  and  X.  are  obtained  for  the  augnented-atate  filter.  The  augmented  state  filter  (Fig. 2a) 
and  the  corresponding  gains  X  and  X.  ,  generally  ecu  be  found  even  if  tha  biaa  b  is  e  random  proceaa. 
Thus  it  would  aaam  that  tha  restriction  that  b  not  ba  a  random  procaa#  is  somehow  superfluous.  Tha  Ages 
and  Turner  result  of  [3]  thus  suggests  either  that  the  blea-free  filter  of  Flg.2b  that  produces  x  it  not 
the  optimum  filter  for  the  proceaa  with  b  s  0  ,  or  that  e  solution  to_Eq. (16)  for  tha  correction  matrix  V 
cannot  be  found.  It  le  interesting,  but  of  no  real  importance,  that  x  la  tha  estimate  of  x  in  tha 
absence  of  bias.  If  this  wart  not  the  caaa,  but  if  we  could  solve  for  V  ,  X  and  H  ,  than  the  biaa- 
aepereted  structure  of  Flg.2b  could  still  be  used. 

Also  in  1972,  Tscker  et  el,  in  studying  control  of  interconnected  energy  eytteme  [16]  (apparently 
Independently)  discovered  the  bias-separation  result  within  tha  f timework  of  linear  optimum  control  theory. 

In  tha  early  1970' a  the  square-root  method  of  implementing  the  optimum  recursive  filter  had  been  gaining 
in  popularity  as  another  method  of  overcoming  problem*  of  numerical  conditioning.  In  1973,  Blarman,  an 
active  Investigator  in  this  fiald  suggested  (  g]  that  “tha  [ square-root  information  filtar]  8XXF  la  a  natural 
method  of  dealing  with  blasts,"  and  davalopad  a  derivation  using  this  methodology.  In  the  course  of  thl* 
development,  several  additional  results  and  lncarpretationa  emerged.  He  showed  tha  relationship  between  the 
correction  matrix  V  and  tha  "estimation  sensitivity"  and  "consider  covariance"  matrices  of  Importance  in 
orbit  determination.  Ha  also  pointed  out  how  tha  bias-separation  method  can  ba  used  to  compute  smoothing 
eolutlooe. 


A  very  rigorous  development  of  tha  results  of  (4)  for  both  continuous- time  and  discrete  time  systems 
was  published  in  1978  by  Handel  and  Washburn  [6,7]  baaed  on  Washburn's  1977  doctoral  dissertation.  The 
development  assumed  tha  bias  separated  form  of  Sq.(6)  end.  Ilka  Ages  and  Turner,  found  the  conditions  under 
which  Iq.(8)  is  valid.  In  tits  courts  of  this  development  they  demonstrated  that  tha  estimation  of  tha  bias 
using  ehs  residuals  (Innovations)  of  tha  blea-free  filter  is  equivalent  to  extraction  of  a  constant  observed 
in  whits  noise,  and  used  this  proparty  to  apply  well-known  algorithms  In  which  the  bias  vector  change* 
dimension  with  time. 

Extension  to  Tlme-Varrina  lias— The  original  development  of  tha  blaa-sspsratsd  algorithm,  aa  given  in  [4], 
was  confined  to  a  constant  biaa,  l.a.,  b  •  0  ,  but  it  was  remarked  at  that  time  that  tha  extension  to  a 
time-varying  bias  would  ba  fairly  staple.  The  explicit  extension  was  first  presented  by  Tacker  and  Las 
(171  In  1972.  Blarman  subsequently  pointed  out  [IB]  that  tha  reaulta  of  Tacker  and  Lea  could  ba  obtained 
more  directly  by  noting  that  If  6  •  Zb  ,  than  b(t)  “  S(t,0)bQ  where  9(t,0)  la  tha  state  transition 
matrix  corresponding  to  Z  and  hence  the  time-varying  biaa  estimation  problem  can  ba  replaced  by  tha 
problem  of  estimating  tha  initial  state  bQ  of  b(t)  . 

Extension  to  Monli«gr  fiTIf  1  "r  dynamic  systems  of  practical  interest  art  linear)  Kalmen-Bocy  filtering 
ia  often  used  to  eat lasts  tha  stats  of  such  nonlinear  ays tame,  nevertheless.  Tha  standard  technique  used 
for  nonlinear  systems  la  tha  "extended  Kalman  filter,"  (EXF)  in  which  tha  actual  nonlinear  aquations  are 
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used  in  computing  the  residuals  and  for  the  dynamic  model,  but  in  which  the  correction  due  to  the  residual 
la  linear.  Specifically,  for  a  continuous-time  process 


with  observetion s  given  by 

s  -  f(z)  +  u 

(27) 

y  -  g(*)  +  v 

(28) 

in  which  u  end  v  are  white  noise  processus 

,  the  EKF  equations  are 

ft  -  f (ft)  +  Kr 

(29) 

with 

f  -  y  -  g(2) 

(30) 

The  gain  matrix  K  is  computed  from  the 

covariance  matrix  P  ,  i.e., 

K  -  PC'B'1 

(31) 

with 

P  -  FP  +  PP’  -  PG'R'XGP  +  Q 

(32) 

in  which  F  and  C  are  Jacobian  matrices  of  f  and  g  ,  respectively,  evaluated  along  the  estimated 
trajectory,  i.e.. 


The  covariance  matrix  P  la  computed,  aloof  with  ft  ,  as  part  of  the  EXF  algorithm. 


In  many  practical  applications  the  EXF  algorithm  outlined  above  works  quite  well.  In  these  applications 
it  would  be  worthwhile  to  develop  a  bias-separated  form  of  the  EK ¥  for  the  case  in  which  the  state  a 
includes  dynamic  variables  and  biases,  i.e.,  in  systems  in  which  the  state  vector  *  can  be  partitioned 
into  a  dynamic  state  x  and  a  bias  b  : 


s  - 


and  hence 

f(i)  «  f(x,b)  ,  g(a)  •  g(x,b) 

The  direct  extenelon  of  the  eeperated-blae  form  to  the  EXF  ie  not  ea  obvious  as  it  might  appear  at  first 
glance,  owing  to  the  nonlinear  nature  of  f(  )  and  g(  >  .  In  particular,  the  Jacobian  matrices  that  enter 
into  the  covariance  matrix  P  of  Eq.(32)  are  evaluated  at  the  optimum  eatlmato  of  x  and  b  ,  so  that  the 
representation  of  the  variance  equation  in  the  manner  of  Eq.(21)  which  leads  to  the  espereted-blae  form  may 
not  be  valid,  and  this  lands  to  tone  difficulty  in  determining  the  gelns  X  end  to  be  used  in  the  bi as¬ 
st  periled  form.  It  may  be  argued,  with  Agee  end  Turner  {5),  that  the  EKF  le  noi  optimum  anyway,  and  hence 
any  reasonable  choice  of  galoe  might  be  acceptable.  Adopting  this  viewpoint,  however,  does  mot  and  the 
setter.  In  the  bias  separated  structure,  for  example,  the  blaa  free  reel dual  is 

r  •  y  -  Cx 

The  counterpart  of  this  in  the  nonlinear  equation  la 

r  •  y  -  g(x,0) 

In  other  worda  the  etete  used  to  compute  the  expected  observation  la  *  not  ft  .  This  la  counter  to  the 
epirlt  of  the  EXP  in  which  the  linearisation  is  always  made  about  the  complete  state  estimate.  If  the  effect 
of  the  blaa  on  the  estimated  state  ie  small.  It  may  not  matter  too  much  whether  the  linearisation  is  shout 
&  or  qbout  x  .  When  the  blss  Is  slgnlflcsnt,  however,  the  difference  between  ft  and  x  may  he  enough  to 
affect  the  results  significantly. 

The  generalisation  of  tha  blaa-sapsratloo  algorithm  to  nonlinear  dynamics  was  coostdersd  by  Si oh a  sod 
Mahalaoabla  (191  In  1972.  They  applied  tha  results  of  1 4 1  to  partitioning  the  dynamic  stats  and  bias 
estimation  equations.  They  considered  both  the  discrete-time  and  the  continuous-time  csss,  but  did  not 
elucidate  tha  problem  of  where  to  evaluate  the  requisite  partial  derivatives. 

Tha  special  case  In  which  the  hies  b  enters  linearly  into  the  dynamics  and  observations  wee  studied 

carefully  by  Handel  (20]  in  1976.  Considering  only  tha  discrete-time  problem,  he  showed  that  the  separated- 
blsa  algorithm  fits  into  tha  1XF  algorithm,  exempt  that  tha  matrix  V  in  tha  correction  equation  must  ba 
recomputed  after  the  bias-free  state  update  la  nrdsr  to  implement  the  EKF  algorithm  exactly.  He  does  not 
address  Che  issue  of  whether  the  additional  computer  tine  needed  to  compute  V  twice  per  time -update  step 
la  Justifiable  in  view  of  tha  likelihood  that  tha  EXF  is  not  optimal. 

A  method  of  possibly  overcoming  any  problem*  that  might  arise  because  of  nonlinear  affects  on  the 

difference  between  ft  apd  x  is  to  reset  tha  blaa  computation  from  time  to  tins,  i.e. ,  by  setting  x  to 
ft  and,  simultaneously  b  to  ssro.  This  operation  merely  resets  tha  means.  It  would  be  tan  roper  to  reset 
the  covariance  me tries*,  since  the  uncertainty  in  the  blaa  le  not  changed  by  resetting  tbs  mean.  The 
resetting  operation  can  be  performed  whenever  a  suitable  (ad  hoc)  tint  reveals  that  the  difference  between 
ft  end  a  has  a  significant  nonlinear  effect. 
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filter.  Ia  the  case  of  •  continuous- tine  process  with  discrete-tine  observations,  the  state  estimate 
between  observations  is  determined  by 

Vp 

\izs 


(34) 


where 
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where  P  is  the  covariance  matrix.  The  state  estimate  is  updated  at  Instants  t^  of  observation  using 

UtJ  -  i(t4>  +  Kty^)  -  g<i(tJL))]  (3S) 

where  x(t.)  is  the  optimum  estimate  of  the  state  Juqt  prior  to  the  observation,  obtained  by  integrating 
Eq.(34)  over  the  interval  ( 3  starting  with  *(t,  ,).  The  presence  of  the  term  l/2(3f/3*)'P  in 
Eq.(34)  distinguishes  the  second-order  filter  from  the  Elf. 


In  1974,  Shreve  and  Bedrick  (21)  showed  that  the  covariance  matrix  can  be  propagated  in  the  separated 
form,  as  given  in  Appendix  A,  but  that  the  fundamental  state  separation  equation,  Eq.(l),  does  not  generally 
apply  unless  the  observation  equations  are  linear. 

Extension  to  Noise  on  Bias — By  the  manner  in  which  the  basic  theory  was  developed  in  (4)  it  was  apparent 
that  it  would  be  difficult  to  extend  the  bias  separation  result  to  the  case  in  which  b  is  a  random 
process,  i.e.. 


b  -  5 

where  (  is  white  noise.  The  difficulty  was  confirmed  by  the  aforementioned  1972  analysis  of  Agee  and 
Turner  (3). 

Since  an  absolutely  constant  bias  is  s  mathematical  idealisation — no  physical  bias  is  perfectly  constant 
for  all  time — an  extension  of  the  bias-aeparatlon  algorithm,  even  ss  an  approximation,  would  he  highly 
desirable.  The  need  for  such  an  extension  was  recognised  by  Tanaka  (22)  who,  In  1973,  developed  an  algorithm 
for  diacrete-time  systems  which  retains  some  of  the  features  of  the  original  algorithm  but  does  not  completely 
decouple  the  bias  calculations  from  the  calculation  of  the  dynamic  state.  The  poealhllity  of  dropping  terns 
in  Tanaka's  algorithm  when  the  nolee  ou  the  bias  la  small  might  merit  attention,  but  does  not  seem  to  have 
bean  explored. 

Note  recently,  Washburn  and  Kendo 1  (23)  have  generalised  the  results  of  (4)  in  several  directions. 

They  considered  the  general  process 


X  •  Ax  ♦  c&s  •  u 

x  • Ct  *  tiht  ♦  w 


(14) 


with  y  -  Kx  ♦  cHs  ♦  v 

which,  as  e*0  reduces  to  two  uncoupled  systems  in  which  the  suhstate  t  ia  not  observed.  They  treated 
this  problem  by  assuming  the  desired  optimum  estimate  x  to  he  of  the  form 

x  «  Cx  *  V*  ♦  t  03) 


This  is  e  generalisation  of  the  basic  separated  estimation  formula,  Eq.(l),  owing  to  the  appeerenca  of  the 
matrix  C  and  the  correction  term  c  •  For  the  general  case  of  £q.(34),  the  generalised  separated 
estimation  formula  has  scarcely  any  advantage  over  the  cugaented-etate  form  that  would  arise  directly  from 
Eq. (34) ,  When  t  is  small,  however,  they  fcave  shown,  using  perturbation  methods,  that  a  subnptimsl  estimator 
of  tht  form  of  Cq.U)  (i.e,,  with  Ol  and  t*0  )  can  be  foimd.  These  results  were  illustrated  In  1 23 )  by  e 
simple  example  which,  however,  is  not  very  convincing:  perhaps  it  does  not  show  the  results  to  their  best 
sdvmotege.  Another  shortcoming  of  the  Washburn-Mendel  analysis  is  that  too  many  terms  are  deleted  when 
t*0  .  There  ie  no  need  to  include  t  in  front  of  I  and  H  in  Eq. (31)  sine*  existing  theory  already 
permits  the  treatment  of  these  terms  without  the  need  for  approximation,  nevertheless,  this  analysts 
suggests  the  possibility  of  using  perturbation  methods  as  s  general  method  of  extending  tits  results  of  the 
separated -bias  estimation  algorithm. 


4.  am,  i  an  cm  to  failuie  &mcn«s  axo  estimation 


The  difference  between  the  actual  observation  end  its  optimum  estimate ,  known  as  the  re- tdual  r  or 
innovation  process,  has  known  statistical  properties:  namely 

(1)  E(rJ  *  0  (aero  mean) 

(ii)  I(r(t)r'(T)J  •  Wd(t-t)  (white  noise  of  spectral  density  V) 

Deviations  of  the  empirical  atatiatics  of  tba  process  ts  obtained  from  operating  data,  may  serve  as 
sn  indicator  that  ths  actual  process  is  not  the  seme  ss  the  process  for  which  the  optimum  filter  wee  deelgmed. 
If  such  deviations  start  email  but  suddenly  become  Urge,  this  could  be  svtdemca  that  a  change  (I.e.,  a 
failure)  haa  taken  place  in  the  system.  The  general  use  of  residuals  for  fault  detection  sod  Isolation  was 
suggested  in  1971  by  Nthra  sad  Feschoo  (11).  A  number  of  Investigator*  subsequently  took  this  suggest loo 
end  developed  techniques  for  estimating  changes  in  bias,  and  hence  for  detecting  and  correcting  of  eyetem 
failures  that  nay  be  attributed  to  such  changes. 


In  the  context  of  fellure  detection.  It  le  useful  to  drew  e  distinction  between  determining  whether 
or  not  s  blss  Is  present  and  eatlaating  l(s  site  (i.e.,  aegnltude  and  direction)  given  (or  assuming)  that 
It  is  present.  Since  an  estimated  bias  b  of  xero  Is  equivalent  to  no  bias,  It  Is  reasonable  to  believe 
that  the  separated-blas  estimation  algorithm  can  be  used  to  advantage  for  failure  detection  and  estimation. 
In  1977  Bellingham  and  Lees  have  reported  using  essentially  this  procedure  for  detection  of  malfunctions  In 
chemical  process  control  systems.  The  basic  technique  and  some  simulation  results  are  given  In  [24]  and 
and  some  experimental  results  are  presented  la  1 25 1  * 

A  significant  limitation  of  the  separated  bias  algorithm,  as  developed  In  [4],  is  that  It  rests  on  the 
assumption  that  the  bias  la  constant  from  the  start  of  estimation  forever  after.  The  consequence  of  this 
assumption  Is  that  the  bias  estimator  gain  matrix  tends  to  zero  (as  1/t)  and  hence  the  capability  of  the 
estimator  to  track  a  bias  change  that  occurs  after  the  estimator  is  turned  on  diminishes  with  time.  Since 
the  theory  was  developed  on  the  assumption  that  the  bias  does  not  change  there  Is  no  reason  to  expect  that 
the  bias  estimator  would  have  such  a  capability,  but  it  does  Impose  a  practical  limitation  upon  using  the 
basic  algorithm  without  modification.  One  obvious  ad-hoc  modification  would  be  simply  to  prevent  the  bias 
estimator  gain  from  going  to  aero,  by  assuming,  for  example,  that  the  uncertainty  In  the  bias  cannot  be 
reduced  beyond  an  arbitrarily  specified  level.  This  assumption  prevents  the  blss  estimator  gain  from  being 
reduced  to  zero.  As  has  been  shown  In  [13]  and  [26]  constant  bias  is  estimated  with  zero  steady  state 
error.  This  means  that  even  if  the  bias  changes  after  the  estimator  Is  turned  on.  If  it  remains  constant 
after  the  change,  it  will  be  estimated  without  error.  In  fact,  if  the  bias  is  piecewlae-constant,  but  the 
Intervals  between  transitions  are  widely  spaced  in  time,  the  bias  filter  will  track  the  piecewlae-constant 
bias  moderately  well. 

But  preventing  the  bias  estimator  gain  from  going  to  zero  is  an  ad-hoc  remedy  and  it  ought  to  be 
poaalbla  to  achieve  better  performance  by  uaing  more  aophisticated  statistical  procedures. 

One  procedure  for  failure  detection  which  has  recently  been  receiving  a  great  deal  of  attention  is  the 
“generalised  likelihood  ratio  (CLR)  technique”  developed  over  the  past  two  decades  on  the  basis  of  the 
pioneering  work  of  Abraham  Wald  on  sequential  estimation,  the  direct  application  of  the  Cl*  technique  for 
estimating  jumps  (Including,  but  in  principle  not  limited  to,  changes  in  bias)  in  dynamic  syatema  was 
proposed  by  Ulllsky  end  Jones  (27)  in  1976.  They  showed  that  the  residual  In  caae  of  bias  change  can  be 
expreaaed  as 


r  •  Cv  -  v  (36) 

where  v  is  the  bias  change  to  be  estimated,  w  is  aero  mean  white  noise,  and  C  is  a  matrix  calculated 
in  "he  CL*  algorithm.  And,  reasoning  from  (36)  they  established  a  correction  equation  of  the  form 

x  •  *  ♦  (S-F)v  (37) 

where  X  N  the  eat (mate  of  x  under  the  hypothesis  of  ao  failure 

v  la  the  CL*  estimate  of  v 
«  ,  *  are  matrices  defined  by  toe  C'_a  elgorlthm. 

The  CL*  eeUmet*  v  is  obtained  by  processing  tbs  sequence  of  residuals,  as  explained  In  [27). 

These  results  apparent!*  wars  obta.aed  Independent  of  the  prior  results  on  Mae  estimation.  It  became 
clear  subsequently,  however,  that  the  matrix  C  in  Sq. (36)  U  equivalent  to  M  in  lq. (23)  shove  and  S-F  1$ 
equivalent  to  V  In  feq.(l).  Hence  s  close  ralstivnehi-  between  the  CL*  method  and  the  sarlter  bias 
estimstlm  mst^d  cap  be  sse«  i  both  methods  no*  the  residual*  of  th-.  bias- 't*#  ftltsr  to  obtain  an  estimate 
of  the  blss  (it  or  v  )  and  bqth  use. the  resulting  estimate  to  correct  the  Usa-fres  estimate.  The  procedure* 
used  to  obtain  the  estimate  b  or  v  are  different,  of  course.  cviu&  to.qh*  assumption  that  b  le  a  constant 
for  all  ting,  the  ettlMtiun  equation  for  b  1a  linear,  and  fairly  simple,  whereas  the  CL*  algorithm  for 
obtaining  v  la  nonlinear  sad  quite  complex.  In  circles  in  which  tbs  CL*  algorithm  is  popular,  it  Is 
generally  felt  that  the  added  complexity  of  tits  algorithm  gives  it  advantages  in  terms  of  superior  performance. 

Later  In  1976,  Chian  and  Adams  (2*)  published  Mother  failure  detection  method  which  also  usee  the 
residuals  of  the  bias-free  estimator,  this  technique  employs  lie  ssqusetUl  probability  ratio  test  (SntT), 
which  also  derives  from  the  original  work  of  \t\ld  but  differs  in  eose  of  its  details  from  the  CL*  method  of 
vtlllsky  and  Jones.  CV’en  and  Adams  observed  several  deficiencies  of  the  5F*T,  namely  that  the  latter  did 
not  account  for  the  likelihood  cf  no  folium  when  the  estimator  is  first  tuned  on,  end  consequsmtly  chat 
a  failure,  it  it  were  to  occur,  would  occur  later,  they  proposed  an  ad  hoc  correction  to  the  S?*T  to  overcome 
these  deficiencies,  end  demot.sirated  the  applicability  of  the  method  to  inertial  navigation  system. 

tn  1979  Chang  and  Duma  [  2f)  returned  to  the  CL*  approach  of  WllUky  end  Jonas,  thsy  presented  a  recursive 
algorithm  for  iaplemamtlag  tha  CL*  calculations  and  showed  that  this  algorithm  can  be  interpreted  as  being 
those  of  a  Kalman  filter  for  estimating  s  constant  bias,  starting  nt  the  time  or  occurrence  of  tha  )oqi.  The 
relationship  between  the  bias  separation  method  and  tha  CL*  approach  was  thus  made  quite  explicit.  (This 
relationship  was  recently  considered  further  by  Caglnyea  )  30)). 

toolbar  approach  to  tha  problem  of  detecting  and  eatlaating  tha  occurrence  and  magnitudes  of  transitions 
la  a  plccswUs  constant  signal  was  introduced  by  Frledlend  la  1979  [31,32).  Tbs  underlying  ids*  is  to 
model  changes  la  tha  bins  ns  the  result  of  a  highly  namgsussian  noise  input,  i.e.. 


where  v  has  a  probability  density  function  that  Includes  s  delta  function  at  tha  origin  to  account  for 
tha  finite  probability  that  no  transition  la  b  takes  place  at  any  given  instant.  Using  tvallabU  theory 
for  maxima  likelihood  estimation  with  momgsusslsn  noise,  Frledlend  developed  an  approximate  recursive 
algorithm  for  obtaining  a  maximum  likelihood  estimate  of  b  .  Then,  assuming  that  the  interpretation  of 

H  m.  f bar  hi  a*  u  MulvaUnt  to  extraction  of  s  constant  burled  in  whits  noise)  remains 


valid  when  the  bias  is  not  constant  but  only  piecewise  constant,  Frledland  and  Grabousky  (  33]  developed  a 
recursive  algorithm  for  detection  of  failures  in  dynamic  systems.  This  algorithm  seems  computationally 
comparable  to  the  Chang-Dunn  recursive  implementation  of  the  GLR  algorithm,  but  a  detailed  comparison  of 
the  two,  which  would  reveal  any  significant  differences,  remains  to  be  performed. 

5.  ADDITIONAL  APPLICATIONS 

Trajectory  Estimation — Our  work  on  separated  bias  estimation  was  motivated  by  our  observation  that  there  are 
many  problems  in  which  there  are  relatively  few  dynamic  state  variables  but  a  large  number  of  parameters  to 
be  estimated  and  that  separation  of  the  parameter  estimation  from  the  estimation  of  the  dynamic  state  variables 
would  be  computationally  advantageous.  This  was  borne  out  by  the  experience  of  Agee  and  Turner  [5]  wtv  used 
the  method  in  the  program  they  developed  for  their  BIT  (Best  Estimate  of  Trajectory)  computer  pi  They 

found,  for  example,  that  although  there  might  be  only  nine  dynamic  state  variables,  there  might  be  s*  many 
as  sixty-six  constant  parameters  in  an  accurate  model  of  the  sensors  used  on  the  White  Sauds  Missile  Range 
where  the  program  would  be  employed.  They  examined  several  other  techniques  but  discarded  them  because  of 
numerical  difficulties  or  because  they  did  not  produce  satisfactory  estimates  of  the  bias.  They  reported 
using  the  algorithm,  with  the  modifications  discussed  earlier,  with  both  simulated  and  actual  trajectory 
data. 


We  used  the  bias  separation  technique  in  our  own  work  on  orbit  sad  satellite  mass  determination  [34] 
which  was  reported  in  1970.  Although  there  were  not  a  large  number  of  parameteis  to  be  estimated,  the 
method  was  beneficial  because  our  progrssa  was  to  be  davsioped  by  modifying  an  existing  program  in  which  no 
provision  was  made  for  estimating  these  and  which  was  written  in  a  manner  that  did  not  readily 

lend  Itself  to  increasing  the  dimension  of  the  state  vector  to  accoonodate  the  additional  state  variables. 
But  it  was  fairly  easy  to  add  the  bias  estimation  capability  to  tbe  existing  program. 

Aided- Inertial  Navigation— When  salaen  filtering  technique  is  applied  to  mixing  inertial  data  with  other 

navigation  data,  it  is~fcupd  that  many  of  the  variables  to  be  estimated  are  actually  biases.  These  Include 
gyro  end  accsiurottstyr  hie*  (“drift  rates")  scale  factor  uncertainties,  misalignment  angles,  and  uncertain 
parameters  is*  ih?  lysvijatiou  aid  such  as  floppier  scale  factor  and  boresight  errors.  The  bias  separation 
techniq^r  ieecvthsd  herein  would  be  appropriate  for  this  application.  Its  use  in  this  context  la  suggested 
by  Fsitiili  1 15 j  and  alluded  to  by  Seat;  et  al  !  3$ )  in  connection  with  testing  of  inertial  navigation  systems 
and  components. 


Calibration— -Also  in  connection  with  inertial  navigation,  as  well  as  in  other  applications,  it  is  necessary 
to  detersise  the  bias  vector  b  in  a  system  defined  by  Eqs.O)  and  (4).  An  optimum  estimate  of  the  state 
x  ia  often  not  required  in  such  eeeee.  It  is  clear  that  the  strutter;  of  Fig.  1  can  be  used  tu  obtain  an 
optimum  estimate  b  ,  and  the  correction  term  Vh  can  be  omitted  if  x  la  not  required.  This  application 
was  described  by  Frisdlaad  |ji)  tn  1977  in  which  the  method  wag  illustrated  by  an  example  of  the  calibration 
(f.e.,  determination  of  drift  retes  and  g-de pendent  error  coefficient#)  ot  a  two-axis  gyro. 

A  by-product  of  the  analyst*  of  (}7J  vea  an  alternate  representation  of  the  bias  estimation  equation, 

Fq . (10) ,  U  was  shown  that  the  bias  b  can  be  expreesed  as  the  product  of  the  bias  covariance  matrix  P^ 
aa  obtained  in  £q.(2$)  with  a  vector  q  which  ia  obtained  by  integrating  the  weighted  residuals,  In 
mathematical  terms ,  it  wma  shown  that 


b  *  V 

i») 

where 

*  -  a*s*lr 

09) 

TMa  form  of  the  bias  estimation  equation  is  readily  verified  by  substituting  the  espee# sioo  for  &  and 
its  derivative  into  Eq.(lO),  raking  into  account  that  P^  is  given  by  Eq.(15). 

This  form  of  the  calibration  equation  Is  advantageous  when  an  estimate  of  b  is  needed  only  at  the 
end  of  a  fixed  calibration  interval.  In  this  case  it  is  necessary  only  to  determine  the  vector  q  ,  by 
numerical  integration  of  the  weighted  residual  H'g  r.  Then  at  the  end  of  the  predetermined  calibration 
time  T  the  estimate  U  obtained  by  multiplying  q{T  )  by  the  eoverienee  matris  F^fT  )  .  It  is  thus 
not  necessary  to  store  F.  for  other  instants  of  time.  This  results  ltj  a  considerable  reduction  in 
computet  storage  requirements  over  what  might  be  required  to  compute  b  using  Sq.(lO). 

From  tq.(IS)  It  is  readily  determined  that 

Fb(t)  (P‘l<0)  ♦  iu>rl 


Lit) 

»-l. 


where 

I  H'itJB^SiirJdt 

0 

In  the  absent*  of  e  priori  information  on  b  P.  *(0)  •  0  and  F.(t)  •  L  't)  .  ttenc*  L  must  have  aa 
inverse  for  some  value  of  t  to  order  that  b  can  be  determined,  te  most  canes  it  la  mecessary  to 
introduce  motion  into  the  dynamic  system,  l.e. ,  to  make  C(t)  and  D(t)  time-varying  so  as  to  produce  a 
matris  B(t)  ,  defined  by  Eq.(l2)  which  results  in  a  nonaingular  1.  matrix  for  some  value  of  time.  It  ia 
argued  in  |  37}  that  w.  optimal  choice  of  8  would  be  such  that  L(t)  ia  a  diagonal  matrix  which  ia  a 
gaaerailtad  orthogonality  requirement  an  the  elements  of  the  matrix  8  . 


Cio*«d-Looo  Control— It  la  of tae  necessary  to  deal  with  biases  in  closed- loop  systems.  Fortunately  the 
bias -separation  technique  ie  applicable  to  dosed  loop  systems  as  well  aa  open-loop  rysteac.  This  was 
demonstrated  by  Tacks r,  Las,  et  al  (16,17)  in  1977,  la  compaction  with  their  studies  of  interconnected 
electric  power  systems.  Another  use  of  the  aepareted-blaa  eatinatiom  technique  io  etsanctisq  with 
decent  rail  and  Control  of  power  systems  was  reported  by  Vsinkateewerlu  mad  MsLalanabi*  (  38)  In  1977.  The 
use  of  thin  technique  by  Bellingham  amd  Lacs  (26,71)  in  cnoaoctlea  with  closed-loop  control  of  chemical 
processes  was  discussed  above. 
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CONCLUSIONS 

The  pervasiveness  of  Kalaan-Bucy  filtering  theory  in  so  many  aspects  of  modern  control  and  estimation 
has  motivated  a  great  deal  of  research  directed  toward  making  the  theory  easier  to  use.  The  bias  separation 
method  reviewed  in  this  paper  is  a  result  of  this  type  of  research.  One  would  hope  that  this,  and  others 
in  the  same  spirit,  enhance  the  practical  utility  of  the  basic  theory. 

The  bias-separation  technique,  originally  presented  in  1969,  and  subsequently  extended  in  various 
directions,  has  a  variety  of  practical  applications  some  of  which  were  described  here.  But  a  number  of 
issues  regarding  this  technique  remain  and  merit  further  Investigation.  We  have  already  remarked  that  the 
question  of  how  the  theory  may  be  extended  to  cover  the  case  in  which  the  bias  is  not  deterministic  is  not 
completely  settled;  the  negative  result  of  Agee  and  Turner  counters  the  formal  separation  property  used 
in  the  derivation  presented  in  Section  2  of  this  paper.  Another  issue  that  is  not  fully  resolved  is  the 
proper  extension  to  nonlinear  systems. 

Another  area  of  investigation  that  might  merit  further  attention  is  the  relationship  between  the 
bias  separation  method  discussed  in  this  paper  to  other  methods  of  simplifying  the  Kalman  filter  calculations. 
Some  work  in  this  direction  has  already  been  done.  In  1974,  Samant  and  Sorenson  [39]  compared  the  bias 
separation  method  of  this  paper  with  an  order-reduction  method  in  which  only  a  portion  of  the  state-vector 
is  optimally  estimated.  Both  the  Samant  and  Sorenson  algorithm  and  the  bias  separation  algorithm  are  optimum 
under  the  same  set  of  assumptions  and  hence  ought  to  give  the  same  results.  But  there  are  differences  in 
computational  efficiency  as  measured  in  storage  and  number  of  operations.  Samant  and  Sorenson  conclude  that 
their  algorithm  requires  a  larger  number  of  operations,  and  is  thus  less  efficient  in  terms  of  speed,  but 
that  it  may  be  more  efficient  in  terms  of  storage.  In  1977,  Chang  and  Dunn  [40]  studied  the  errors  caused 
by  omitting  some  state  variables  from  the  model  used  in  the  design  of_  the^estimator.  If  the  states  omitted 
are  biases  then,  by  virtue  of  Eq.(l)  the  error  is  given  by  e»x-X”Vb.  Since  V  and  the  statistics 
of  6  arc  known,  the  statistical  properties  of  the  error  due  to  omission  of  b  can  readily  be  determined. 
Chang  and  Dunn,  however,  consider  a  more  general  case  than  is  represented  by  our  model  as  given  by  Eq .  (3) . 
Additional  studies  of  these  approximation  methods  might  be  worthwhile. 

Another  investigation  that  might  be  pursued  is  to  determine  the  Implications  of  the  dual  of  the  bias- 
separation  algorithm  In  regard  to  deterministic  optimum  control.  The  mathematical  duality  between  determin¬ 
istic  (linear,  quadratic)  optimum  control  and  Kalman-Bucy  filtering  is  well  knevn.  Hence  the  bias-separation 
method  ought  to  have  a  dual  in  optimum  control  and  this  dual  might  have  interesting  properties  with 
practical  application. 

With  regard  to  applications,  we  see  no  reason  why  any  problem  in  which  biases  are  treated  by  augmenting 
the  state  vector  cannot  be  treated  by  the  method  of  this  paper.  The  algorithm  is  entirely  straightforward. 

In  preparing  this  paper,  we  have  endeavored  to  review  all  the  applications  that  have  been  described  in 
archival  journals,  but  there  was  no  feasible  method  of  covering  applications  described  in  other  literature 
such  as  technical  reports  issued  under  government  contracts,  etc.,  except  those  that  were  brought  to  our 
attention,  such  as  the  report  of  Agee  and  Turner  [5],  or  a  recent  report  by  W.  E.  Hall,  et  al  [41]  describing 
an  application  to  rotorcraft  parameter  identification.  We  would  be  gr  teful  to  receive  descriptions  of 
additional  applications  that  may  be  known  to  readers  of  this  paper. 
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valid  when  the  bias  Is  not  constant  but  only  piecewise  constant,  Frledland  and  Grabousky  [  33]  developed  a 
recursive  algoritha  for  detection  of  failures  In  dynamic  systems.  This  algorithm  seems  computationally 
comparable  to  the  Chang-Dunn  recursive  Implementation  of  the  GLR  algorithm,  but  a  detailed  comparison  of 
the  two,  which  would  reveal  any  significant  differences,  remains  to  be  performed. 

5.  ADDITIONAL  APPLICATIONS 

Trajectory  Estimation — Our  work  on  separated  bias  estimation  was  motivated  by  our  observation  that  there  are 
many  problem  In  which  there  are  relatively  few  dynamic  state  variables  but  a  large  number  of  parameters  to 
be  estimated  and  that  separation  of  the  parameter  estimation  from  the  estimation  of  the  dynamic  state  variables 
would  be  computationally  advantageous.  This  was  borne  out  by  the  experience  of  Agee  and  Turner  [5]  who  used 
the  method  in  the  program  they  developed  for  their  BET  (Best  Estimate  of  Trajectory)  computer  program.  They 
found,  for  example,  that  although  there  might  be  only  nine  dynamic  state  variables,  there  might  be  as  many 
as  sixty-six  constant  parameters  In  an  accurate  model  of  the  sensors  used  on  the  White  Sands  Missile  Range 
where  the  program  would  be  employed.  They  examined  several  other  techniques  but  discarded  them  because  of 
numerical  difficulties  or  because  they  did  not  produce  satisfactory  estimates  of  the  bias.  They  reported 
using  the  algorithm,  with  the  modifications  discussed  earlier,  with  both  simulated  and  actual  trajectory 
data. 


We  used  the  bias  separation  technique  In  our  own  work  on  orbit  and  satellite  mass  determination  [34  ] 
which  was  reported  in  1970.  Although  there  were  not  a  large  number  of  parameters  to  be  estimated,  the 
method  was  beneficial  because  our  program  was  to  be  developed  by  modifying  an  existing  program  In  which  no 
provision  was  made  for  estimating  these  parameters,  and  which  was  written  in  a  manner  that  did  not  readily 
lend  itself  to  increasing  the  dimension  of  the  state  vector  to  accommodate  the  additional  state  variables. 
But  it  was  fairly  easy  to  add  the  bias  estimation  capability  to  the  existing  program. 

Aided- Inertial  Navigation— t-^en  the  Kalman  filtering  technique  is  applied  to  mixing  Inertial  data  with  other 
navigation  data,  it  is  found  that  many  of  the  variables  to  be  eef  '.mated  arc  actually  biases-  Those  include 
gyro  and  uccelerometer  bias  ("i.rift  rates")  scale  factor  uncertainties,  misalignment  angles,  and  uncertain 
parameters  in  the  navigation  aid  such  as  doppler  scsle  factor  and  bores ight  errors.  The  bias  separation 
technique  described  herein  would  be  appropriate  for  thia  application.  Its  use  In  this  context  is  suggested 
by  Farrell  (35]  and  alluded  to  by  Nash  et  cl  (34]  in  connection  with  testing  of  Inertial  navigation  systems 
and  components. 

Calibration — Also  in  connection  with  Inqrtial  navigation,  as  well  as  In  other  applications,  it  Is  necessary 
to  determine  the  bias  vector  b  In  a  system  defined  by  Eqs,(3)  and  (4).  An  optimum  estimate  of  the  state 
x  it  often  not  required  In  such  cents.  It  la  ejear  that  the  etructurj  of  Fig.  I  can  0*  uaed  to  obtain  an 
optimum  estimate  b  ,  and  the  correction  tarn  Vb  can  be  omi::*d  if  x  ts  not  required.  This  application 
was  described  by  Fnedlsnd  J  37 1  In  1977  in  which  the  method  we#  illustrated  by  an  example  of  the  calibration 
(i.e»,  determination  of  drift  rates  and  g-dependent  error  coefficients)  of  e  two-ati*  gyro. 

A  by-product  of  the  analysis  of  [37]  was  an  alternate  representation  of  the  bias  estfration  equation, 
Kq.(lO),  It  was  shown  that  the  blue  b  can  be  expressed  at  the  product  of  the  bier  covariance  matrix  P. 
as  obtained  In  tq.(25)  with  s  vector  q  which  la  obtained  by  integrating  the  “sighted  rot 1 duals.  In 
mathematical  terse,  it  was  shown  that 

b  •  P^q  (M) 

where  4  •  H'»_1r  09) 

This  form  of  the  bias  estimation  equation  la  readily  verified  by  substituting  the  expression  for  6  and 
its  derivative  into  Eq.flO),  taking  Into  account  that  Pfe  ts  given  hy  Kq.<25). 

Thia  form  of  tha  calibration  aquation  la  advantageous  when  an  satinets  of  b  is  needed  only  at  the 
end  of  a  fixed  calibration  Interval.  In  this  case  l|_ie  necessary  only  to  determine  the  vector  q  ,  hy 
numerical  Integration  of  the  weighted  residual  H'l  r.  Then  at  the  and  of  the  predetermined  calibration 
time  T  the  estimate  is  obtained  by  multiplying  q(T  )  by  the  covariance  matrix  P.(T  )  .  It  Is  thus 
not  nscSssary  to  store  P.  for  other  instants  of  time.  This  resultw  in  a  considerable  reduction  In 
computer  etoraga  requirements  over  what  might  be  required  to  compute  6  using  £q.(10). 

From  Eq, (25)  It  Is  readily  determined  that 

Pb(0  (Pbl(0)  +  L(t)fl 


whero 


In  tha  abaanca  of  a  priori  information  on  b 
lnversa  for  some  value  of  t  in  order  that  b 


L(t)  -  /VtOB’SiMdT 
-1  0  -1 

P  *(0)  -  0  and  P.(t)  •  L  (c)  .  Hence  L  suit  have  an 
can  be  determined.  In  moat  casts  it  is  necasssry  to 


introduca  motion  into  tha  dynamic  ays tarn,  i.a.,  to  maka  C(t)  and  D(t)  time-varying  ao  as  to  produce  a 
matrix  H(t)  ,  daflnad  by  Eq.(12)  which  rtaulta  in  a  nonalngular  1  matrix  for  soma  value  of  time.  It  la 
argued  in  [37]  that  an  optlaal  cholca  of  H  would  be  auch  that  L(t)  la  n  diagonal  matrix  which  is  a 
generalised  orthogonality  requirement  on  tha  elements  of  tha  matrix  H  . 


Closed-Loop  Control— It  is  often  necessary  to  deal  with  blaaaa  in  closed-loop  ay  a  tarns.  Portisutsly  th* 
bias-separation  technique  is  applicable  to  closed  loop  systema  ea  well  as  open-loop  eye tens.  This  was 
demonstrated  by  Taekar,  Lae,  et  el  [16,171  In  1972,  in  connection  with  their  studies  of  Interconnected 
electric  power  systems.  Another  use  of  the  aeparated-blaa  estimation  technique  in  connection  with 
decentralised  control  of  power  systana  waa  reported  by  Vankateswarlu  and  Nahalanabls  [38)  In  1977.  The 
use  of  thia  technique  by  Bellingham  and  Leas  [24,25]  in  connection  with  closed-loop  control  of  chemical 
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Process  and  Observation  Models 

x(n+l)  “  4(n)x(n)  +  B(n)b  +  u(n) 
y(n)  “  C(n)x(n)  +  D(n)b  +  v(n) 
where  b  «  constant  (to  be  determined) 


E(u(n)u' (k) ]  -  Q(“)6nk  •  E[v(n)v' (k) ]  -  RQ5nk  ,  E[u(n)v' (k)]  -  0 


Bias-Separated  Filter 

x(n)  “  x(n)  +  V(n)b(n) 

A 

where  x(n)  -  optimum  estimate  corrected  for  bias 

x(n)  »  bias-free  estimate 
b(n)  -  optimum  estimate  of  bias 


after  processing  observation  y(n) 


Bias-Free  Filter 


where 


x(n)  »  x(n)  +  K(n)r(n) 

x(n)  «  4(n-l)x(n-l)  «  predicted  state  of  bias-free  filter 
r(n)  «  y(n)  -  C(n)x(n)  -  bias-free  residual 
K(n)  ■  bias-free  filter  gain  matrix 


Bias  Estimator 

b(n)  -  b(n-l)  +  ^(nMifa)  -  H(n)b(n-1)] 
where  H(n)  -  C(n)U(n)  4  D(n) 

K^(n)  «  bias  estimator  gain  matrix 


Matrix  Propagation  Equations 


Bias  Free  Gain: 

K(n) 

Prior  Covariance: 

P(n+1) 

Posterior  Covariance: 

P(n) 

Bias  Gain: 

Bias  Covariance: 

M-1(n+l) 

U  (n+1) 

V(n) 

P(n)C(n)[C(n)P(n)C(n)  +  R(n)]-1 
4>(n)P(n)$' (n)  +  Q(n) 

(I  -  K(n)C(n) ]P(n) 

M(n+l)H(n)R_1(n) 

M-1(n)  +  H'(n)[C(n)P(n)C'(n)  +  R(n)]"1 
♦(n)V(n)  +  B(n) 

U(n)  -  K(n)H(n) 
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SUMMARY 


This  paper  examines  the  structural  differences  and  performance  characteristics  of  several  distinct 
estimation  algorithms,  as  applied  to  some  practical  continuous  and  discrete-time  state  estimation  problems. 
The  extensive  simulation  results  presented,  indicate  that  when  noise  inputs  are  not  "too  small"  and 
appropriate  a  pricri  estimates  are  available,  the  extended  Kalman  filter  can  be  expected  to  perform 
satisfactorily.  When  nonlinear  effects  are  significant,  the  realizable  minimum  variance  filter  is 
remarkably  superior  to  any  other  filter  investigated.  When  the  level  of  noise  inputs  are  large  enough  to 
effectively  cover  the  effects  of  nonlinearities,  no  particular  filter  can  be  said  to  be  consistently 
superior  to  any  other  filter. 
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SECTION  1 

INTRODUCTION 


One  practical  problem  of  great  Importance  in  control  theory  la  the  estimation  of  the  state  of  a 
physical  system,  on  the  basis  of  noisy  measurements.  For  linear  systems  with  additive  white  noise  the 
procedure  for  obtaining  optimal  unbiased  minimum  variance  estimates  was  first  formulated  by  Kalman  and 
Bucy  [1],  and  it  has  been  successfully  applied  to  numerous  engineering  and  scientific  problems. 

In  contrast  to  this,  truly  optimal  nonlinear  estimation  algorithms  have  not  bean  practically 
lmplementable,  since  it  requires  an  infinite  dimensional  system  to  reallsa.  Therefore,  considerable 
attention  has  been  devoted  to  methods  of  approximating  the  a  posteriori  density  functions  based  on 
perturbations  relative  to  a  prescribed  referenca.  The  majority  of  these  techniques  [2-4]  employ  the  Taylor'a 
aarlas  expansions  of  the  dynamic  and  measurement  nonlinaarltles,  neglecting  second  or  higher-order  terms. 

As  long  as  the  second-order  (and  higher-order)  terms  in  the  perturbation  equations  are  negligible, 
the  application  of  first-order  extended  Kalman  filters  (BKF)  has  been  found  to  yield  valid  and  satisfactory 
results.  If  nonlinearities  are  significant,  however,  first-order  approximations  of  the  aystea  equations  are 
inadequate,  and  the  EKFs  tend  to  be  unatable  and  exhibit  divergent  behaviour.  In  soma  of  these  cases, 

Kuahner  [5]  and  Athana  at  al  [6]  reported  that  filter  performance  can  be  substantially  improved  by  local 
iteratlona  or  the  lnclueion  of  second-order  effects.  On  the  other  hand,  Schvarta  and  Stear  [7]  on  the  basis 
of  their  simulation  study,  concluded  that  the  added  computational  complexity  of  several  second-order  filters 
may  not  provide  useful  Improvements  relative  to  the  EKF,  In  Kuehner's  simulation  (3]  of  the  Van  der  Pol's 
equation,  he  reported  that  the  implementation  of  a  modified  Gaussian  second-order  filter  cannot  prevent  the 
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filter  from  diverging.  Jazwinski  [8]  in  his  textbook,  stated  that  it  is  questionable  whether  higher-order 
approximations  would  improve  performance  in  esses  where  the  extended  Kalman  filter  does  not  work  at  all 
(diverges). 

'  ,  i 

Attempting  to  alleviate  some  of  the  difficulties  of  the  Taylor  series  expansion  approach, 

Sdnahara  [9]  proposed  to  replace  nonlinear  dynamic  functions  by  quasi-linear  functions  via  statistically 
optimized  approximation.  His  results  and  also  those  of  Austin  and  Leondes  [10]  Indicate  that  this  approach 
may  be  more  accurate  than  those  of  Taylor  series  expansions. 

Recently,  Liang  and  Christensen  ill]  derived  a  realizable  minimum  variance  estimation  algorithm 
for  nonlinear  continuous  systems  using  the  matrix  minimum  principle  together  with  the  Kolmogorov  and 
Kushner's  equations.  They  also  applied  the  matrix  minimum  principal  to  derive  nonlinear  estimation  algorithms 
[12]  for  discrete-time  nonlinear  time-delayed  systems  with  measurements  corrupted  by  white  noise  and  non-white 
noise  processes.  They  noted  that  for  systems  with  polynomial,  product-type  or  state-dependent  sinusoidal 
nonlinearities  [13],  their  proposed  minimum  variance  algorithm  can  be  practically  realized  without  the  need 
of  approximation  under  the  assumption  that  the  estimation  errors  are  Gaussian. 

Since  it  is  difficult  to  theoretically  assess  the  virtues  of  any  one  nonlinear  filter  vis-a-vis 
the  others,  it  is  necessary  to  conduct  extensive  numerical  simulations  and  tests  to  provide  meaningful 
comparisons  between  the  performance  characteristics  of  the  filters.  Simulations  of  various  nonlinear  filters 
not  only  could  provide  considerable  insight  into  the  stability  behaviour  of  some  of  these  filters,  but  also 
provide  ad  hoc  guidelines  to  establish  situations  in  which  specific  nonlinear  algorithms  would  have 
demonstrable  advantages. 

In  Section  2,  we  deal  with  state  estimation  problems  of  continuous-time  nonlinear  dynamic  systems. 
Various  structures  of  dynamic  continuous-time  finite  dimensional  nonlinear  filter  are  tabulated.  Two  types 
of  dynamic  systems  with  non-negligible  nonlinearities  were  selected  snd  simulated  on  a  digital  computer. 
Extensive  simulation  results  accompanied  with  discussion,  are  presented  to  compare  the  performance  behavior 
of  theae  filters. 

In  Section  3,  we  deal  with  state  estimation  problems  of  discrete-time  nonlinear  systems.  Section 
3.1  presents  s  brief  summery  of  Liang  and  Christensen's  nonlinear  dlacrete-tlae  filtering  algorithm  [12]. 
Section  3.2  applies  their  minimum  variance  filtering  algorithm  (MVP)  to  e  general  clssa  of  discrete-time 
state  estimation  problems,  where  the  measurement  modal  and/or  system  model  contains  second-order 
nonlinearities.  Simulation  results  accompanied  by  discussion  are  presented  in  Section  3.3  to  compare  the 
performance  behavior  of  the  HVF  end  EKF. 


SECT IQS  2 

COMPARISONS  OP_CONTimroUS-TXME 
NONLINEAR  FILTERS 


2.1  INTRODUCTION 

Consider  a  class  of  nonlinear  systems  described  by  the  stochastic  differential  equation  (ft] 

f(x(t),  t]  ♦  CU(t).  t]  w(t)  (2.1) 

with  measurement  given  by 

*(t)  -  h(x(0,  t)  ♦  v(t)  (2.2) 


Where  x(t)  end  t(t)  are  the  n -dimensional  atate  and  m-dimenalonal  meaeuremant  vectors,  f  end  h  are, 
respectively,  n-  end  m-d leans tonal  uonllnasr  vector  valued  functions,  end  G  is  e  vector  valued  matrix. 


The  random  vectors  w(t)  and  v(t)  ere  etetletlcally  independent  taro-meeo  white  Causa lan  noise 
processes  such  that  for  ell  t,  t  £  to 


Coviw(t) ,  w(t)}  “  ¥w(t)  S(t-t) 
CovMt),  v(r))  -  f^CO  ft(t-t) 


(2.3) 


and 


Cov(w(t),  v(t))  -  0 


Where  <5(.)  is  the  Dim  delta  function,  and  the  variances  ?v(t)  end  V^d)  ere  moo -negative  definite  end 
positive  definite,  respectively. 


The  Initial  atata  vector  x(t_)  —  a  la  a  aero-mean  Cauaaiaa  random  process,  ladepeadant  of  w(t) 

O  0 

and  v(t)  for  t  >  t  .  with  a  positive  definite  variance  matrix 
o 

Var{«(to),  *<to>)  «  V#(to) 


In  the  design  of  nonlinear  filters,  a  masher  of  differeat  exact  and  approximate  non  11  sear  atate 
and  error-variance  equations  have  been  proposed  la  the  literature  (3,7,ft,t,U).  For  comparative  purpoeoo, 
various  structures  of  chess  nonlinear  flltera  are  tabulated  in  Table  2.1. 
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TABLE  2.1  VARIOUS  EXACT  AMD  APPROXIMATE  NONLINEAR  FILTERS 

Message  Model:  x(t)  —  f(x(t),t]  +  G[x(t),t]v(t) 

Measurement  Model:  z(t)  —  h(x(t),tj  +  v(t) 


FILTER  NOMENCLATURE 


FILTER  DYNAMICS 


ERROR- VAR IANCE  EQUATIONS 


Extended  Kalman  Filter 


Modified  Minimum 
Variance  Filter 


S  -  f <8,t>  ♦  V.  3h  11^  V,  -  V-  +  Vj  - +  G[i,t 

Y  -1  j*  -  h(S,t)l-(A)  3hT(g.t)  -1  3h(8,t)  „  . 

1  '  ~  x  38  \  38  vx  _< 

2 

8  “  f(8,t)  ♦  3  :  V-  —  Right-hand-aide  of  equation  (B) 

1  3a2  * 

y  >  v  —  y 

Vj  vi  n  v 

| i  -  h(8,t)  -  j 

v 4 -(C) 


+  G[2,t]'fwGT[a,t] 


Truncated  Minimum  Same  as  equation  (C)  V.  -  Right-hand-side  of  equation  (B) 

Variance  filter  ,  ,  ,2K/a  .  w  ,,2, 


Quasi -Moment  Minimum  Same  as  aquation  (C) 

Variants  Filter 


.  i  V  2  1  l  M*.»U  .  y  -lL  .  u(8, t)-  5 

2  *  *a2  v  '  2  b*2  « 

*•  Right-hand-side  of  equation  (B) 

*  V  :  &&&  ,  y/  {,  .  h(8,t>  - 1  ii^t':va} 


Stochastic  Linearisation  f(8  ♦  8,t)  ♦  B(S  ,  ,  .  . 

niter  T  x  vi  “  81i  tT<*  *  *.t)J  *  E{f<*  *  S,t)xT>  ♦  ci*,t;v 

h  (8  ♦  5,t))lL  ' 


oleum  Variance 


I  -  ,  CT(8,t)  *  4*  HT(8  *  8. t)lf  “l  fi{h(x  *  8.  Ox1)  — ( 

{«  -  h(«  ♦  i.t)}  -(0)  1  *  v  '  ' 

4  -  Same  as  equation  (0)  Vg  «  Right-hand-side  of  equation  (E) 

♦  fi|i  ST  hT(8  ♦  fi,t)  -  Vg  HT(i  ♦  8,t)| 

.  rv‘l  j «  -  Mi  ♦  ft,t)} 


feasor  Computet  loo: 


a2,f#  92f .  (8,0 

(^,,^.5  ,llKV 

**  1  J,h*l  J  * 

,  9\ca,t)  •  »\(M) 

j  *  — )  -S  *ae  aff —  (vt-  #  vtr  vta> 

i  at2  ij  s  M,  Mt  >r  lr  * 
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filters,  the  only  difference  is  the  error-forcing  term  that  appears  in  the  error-variance  equations.  In 
the  quasi-moment  filter  it  enters  vich  a  plus  sign,  in  the  truncated  filter  that  term  enters  with  a  minus 
sign  and  a  factor  of  one-half,  and  the  modified  minimum  variance  filter  is  a  compromise  between  the 
truncated  and  the  quasi-moment  filters.  In  comparing  the  stochastic  linearization  filter  with  the  minimum 
variance  nonlinear  filter,  the  later  has  obviously  preserved  the  error  forcing  term.  On  the  other  hand, 
comparing  the  first  four  linearized  filters  with  the  last  two  filters,  the  essential  differences  are  due  to 
approximations  made  of  expectations  of  f(x),  xh(x)  and  h(x).  It  is  also  noteworthy  to  mention  that  when 
h(x)  is  linear,  the  modified,  the  truncated  and  the  quasi-moment  minimum  variance  filters  are  Identical, 
and  the  minimum  variance  filter  is  identical  to  the  stochastic  linearization  filter. 

In  order  to  compare  the  performance  characteristics  of  these  nonlinear  filters,  two  different 
types  of  dynamic  systems  vith  non-negligible  nonlinearities  were  selected,  the  stochastic  equations  were 
transformed  to  Straronovlch's  forms  [8]  and  then  simulated  on  a  digital  computer.  The  integration  scheme 
was  performed  by  the  minimum-error-bound  fourth-order  Runge-Kutta  method  [14]. 

2.2  NONLINEAR  FILTERS  FOR  PHASE-LOCK  LOOP 


In  this  section,  we  deal  with  the  design  of  nonlinear  filters  for  a  two  dimensional  phase- lock 
loop.  This  phase  detection  problem  is  of  msjor  technological  importance  and  widely  known  in  satellite 
cosaunicatlon.  The- dynamic  model  selected  consists  of  phase  and  phaae  rate  with  all  the  plant  noise 
additive  to  the  phaae  rate  only.  Namely: 


the  measurement  model  is  represented 


as: 


*,  ” 


COS  Xj 


1 

*  V2 


■»  sin  Xj 

variances  V  ,  t  i 
l 

linearisation  fill 


where  w,  v^  and  v^  are  aero  ornan  white  Gaussian  noise  processes,  with 

Filtering  equations  for  the  extended  Kalman  filter  (EXF) ,  stochastic 
and  minimum  variance  filter  (MVF)  are  presented  as  follows: 

Extended  Kalman  Filter 
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Setting 


Minimum  Variance  Filter 


The  equations  for  and  are  Identical  with  those  of  the  S.L.F.,  whereas  the  error  variance 
equations  have  the  following  terms  in  addition  to  those  of  the  S.L.F: 

For  Vn  add  -V^  E  F 

For  Vj^  add  -V^  V^2  E  F  where 

For  V22  add  -V*2  E  F 

A  careful  examination  of  these  equations  Indicates  that  the  major  difference  between  the  EKF  and 
SIF  Is  due  to  the  approximation  made  of  E  “  exp  (-  V^/2).  The  effect  of  this  term  Is  accentuated  when  the 

measurement  residual  Is  significant,  and  the  error  variances  are  large.  The  major  difference  between  the 
SLF  and  MVF  is  due  to  the  error  forcing  terms  which  appear  in  the  error  variance  equations.  The  effect  of 
this  term  in  proportion  to  all  Che  other  terms  is  accentuated,  when  Is  small  and  the  measurement 

residual  Is  significant.  The  measurement  residual  Is  significant  when  the  a  priori  state  estimates  are 
significantly  different  from  the  true  values.  It  Is  also  affected  by  the  level  of  noise  inputs. 

To  compare  the  performance  characteristics  of  these  three  filters,  their  filtering  equations  were 
simulated  on  a  digital  computer.  Each  /^(output  response  presented  in  Figures  2.1  to  2.8  represents  the 
average  results  of  5  simulation  runs,  that  are  representative  of  many  other  simulation  runs  not  presented 
In  this  paper. 

Figures  2.1  and  2.2  show  the  effects  of  Initial  state  estimates  for  the  following  two  sets  of 
prior  statistics: 

Figure  1:  XjlO)  -  XjlQ)  -  2.0,  ty  -  -  0.01,  Vu(0)  -  VJ2(0)  -  C.01  and  V^CO)  »  0 

Figure  2:  Rj(0)  -  x2<0)  -  2.0,  tv  -  ~  0.01,  and  V.jCO)  “  V,2(0)  ~  vu(°>  “ 

When  ty  and  are  increased  to  0.1,  simulation  results  for  similar  conditions  were  obtained  and  presented 
in  Figures  2.2  and  2.4. 

From  Figure*  2.1  and  2.4,  it  Is  apparent  that  the  performance  of  the  EKF  and  SIF  are  almost 
Identical  to  that  of  the  MVF  for  small  noise  variance*  with  large  error-variance  and  when  a  priori  state 
estimates  are  elose  to  the  true  values.  However,  when  . t  state  estimate*  are  significantly  d.i>feioni 

from  the  true  values,  and  initial  error-variances  are  overly-«pM*>l*tlc,  the  performance  charactertetics  of 
both  the  EKF  and  StF  are  significantly  inferior  to  these  of  the  MVF.  The**  esperimentai  result*  agree  well 
with  what  vaa  theoretically  espeeted.  It  Is  also  clear  that  the  MVF  la  significantly  mare  insensitive  to 
the  selection  of  •  ffidK  state  estimates  and  Initial  error-variances. 

For  noise  variances  of  one  onit.^^j output*  of  these  three  filter*  are  presented  tn  Figures  ?.2 
and  2.8.  It  is  Interesting  to  observe  that  when  the  noise  Input  levels  are  relatively  high  compared  to.  the 
cffecta  of  nonllnearities,  the  EKF  is  as  good  a*  any  other  nonlinear  filter  investigated,  and  no  particular 
filter  can  he  said  to  he  consistently  superior  to  any  other  nonlinear  filter,  this  appears  to  he  intuitively 
obvious,  because  large  noise  inputs  can  effectively  “cover"  neglected  nool inear  it  lea. 

On  the  other  hand,  when  the  noise  variances  are  reduced  to  0.001,  both  the  KKF  and  StF  diverged, 
while  the  MVF  can  track  the  true  value*  of  the  phase  and  phase  rate  amaslngly  well,  typical  simulation 
results  for  some  of  these  run*  are  presented  In  Figures  2.2  and  2.g.  however.  It  sheuid  be  noted  that  If 
the  initial  error-variances  are  1.0,  the  noise  variances  are  further  reduced  to  0.00*1.  and  when  the  initial 
state  estimates  are  far  from  the  true  values  of  the  system  states,  even  the  MVF  would  diverge  as  others  had 
dome  so  much  sooner.  gut  If  one  were  to  drastically  reduce  the  initial  variances,  occasionally  the  Mvr 
and  also  the  EKF  and  StF  would  alt  be  stable  again,  with  the  MVF  still  performing  better  than  the  others, 
this  indicates  that  In  seme  application*  of  neoltmear  filters,  the  system  designer  must  be  cautious  mot 
to  select  the  initial  variance*  to  be  loo  Urge. 

2.1  EonUesat  Filter*  for  Van  Per  Fol'a  Oscillator 

Comstdur  a  state  estimation  problem  Of  the  Vaa  der  Fol'a  osclllatoi  described  by 


>  * 

«2  -  -Sj  »  J*j(l  -  *|) 

with  measurement  given  by 

«  “  *  0. ‘"J  *  » 

This  axMgle  was  selected  not  only  because  of  It*  third-order  nonllnearities,  but  also  that 
Cuthoer  had  prevtoualy  shown  that  for  this  state  estimation  problem  even  with  a  linear  measurement  model, 
the  linearised  filter  was  extremely  unstable  and  was  completely  utalea*  within  a  fraction  of  a  time  unit. 
Even  the  implementation  of  a  Cauaalen  second -order  filter  proved  to  bt  unstable  therefore.  In  this  study, 
we  are  only  concerned  with  the  comparison  of  a  minimum  variance  filter  with  a  stochastic  linearisation 
flltar. 


,.-l 


t  *  cos  a,  (z,  -  E  cos  £  )  +  1  sin  8,  (z,  -  E  sin  8) 
vr  11  1  v2  12  1 


*  wn 


The  structure!  u>  .erence  between  these  two  filters  is  mainly  due  to  the  preservation  of  error 
forcing  terms  in  the  M1’".  Namely: 

For  the  additional  term  is  -0.6  2^  [r  -  2] 

For  the  additional  term  is  -0.6  2j  l*  -  2] 

For  V  j  the  additional  term  is  -0.6  V  ^  2^  [z  -  2 1 

where  2  il  -  0.1(2^  ♦  32J  Vu> 

Therefore,  it  can  be  expected  that  the  difference  between  the  SLF  and  MVF  will  be  accentuated  whenever  the 
values  of  initial  error-varianees  and  2j  are  Increased  and  also  when  the  measurement  residual  is  significant. 

To  verify  this,  some  typical  results  of  extensive  simulations  are  presented  in  Figures  2.9  to 
2.13.  Here,  we  have 

2^0)  -  x1(0)  -  2.0,  22<0>  -  x2(0)  -  0.0  and  V'u(0)  -  0.0 

Figure  2.9  illustrates  the  effects  of  noise  corruption  on  filter  measurement  input.  Figures  2.10  to 

2.12  Illustrate  the  effects  of  noise  variances  and  initial  error-variances  on  the  performance  of  the  SLF 
and  MVF.  From  these,  it  is  evident  that  when  initial  error-variances  are  very  small,  and  the 
a  pi»£. o»*i  estimates  are  reasonably  accurate,  the  performance  characteristics  of  the  SIF  and  MVF  are  almost 
Identical,  with  the  SLF  having  a  slightly  smaller  phase-shift.  However,  as  expected  from  theoretical 
considerations,  for  Increasing  values  of  initial  errer-varience*  and  noise  variances,  the  difference 
between  the  performance  of  the  SLF  end  MVF  becomes  more  significant.  Here,  quite  consistently,  the  MVF  is 
far  more  capable  than  the  SLF  in  tracking  the  true  values  of  the  states  and  the  former  is  also  much  less 
sensitive  to  these  selections  of  initial  error-variances. 

To  further  illustrate  the  difference  in  filter  responses.  Figure  2.13  presents  the  filter  state 
salinates,  the  errors  of  state  estimates  and  their  one  sigma  error  bounds  for  the  following  statistics: 

fv  ~  0.1.  Vui0)  -  VJ2  -  o.s  and  vl2(0>  ~  o 

The  poorer  performance  of  the  SLF  is  evidently  due  to  Ita  optimistic  estimation  of  error  bounds. 

Figure  2.14  ia  presented  to  demonstrate  the  severe  effects  of  initial  state  estimates  and 
error- variance*  on  the  performance  of  the  SLF  and  MVF.  It  la  quite  evident  that  the  system  designer  most 
be  ear»ful  in  his  selections  of  appropriate  initial  error-variances.  The  extensive  simulation  experience 
of  the  author  suggests  that  in  the  design  of  the  SLF  snd  MVF  for  dynamic  avstsms  with  non' negligible 
nonlinear! tie*,  it  ts  important  not  to  use  owrty-pesa  ini  stir  initial  error-variances,  since  error-variances 
that  were  too  large  could  excessively  damp  the  system  dynamics  and  Kalman  gain  matrix  to  reject  some  of 
the  valuable  measurement  data  inputs. 
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).l  Introduction 


Consider  a  general  das*  of  diacrete-tlme  nonlinear  systems  described  by 

*(h  ♦  i>  ~  f(mik),  hi  ♦  C(sU>»  *Mk>  0.1) 

with  msssursmenu  represented  by 

*(k)  -  hix(k).  k)  *  v<h)  O.l) 

where  the  state  a  is  an  o- vet  tor;  the  essswt  ament  t  an  at- vector;  the  state  noise  sequence  w  ae  r -vector; 
the  measurement  noise  *  as  e-vector;  0  in  a  oon-ltmeer  state-dependent  bar  met r lx;  f  and  b  era. 
respectively,  n-  and  m-dtnemsioael. 


The  random  vectors  v(k)  and  v(k)  are  independent  rero-ewao  white  Cause l«n  sequence#,  for  which 


Jv(w(k3vTiJ)>  -  tjt)  gkj 

O.J) 

^ivtk^ij))  -  tv(h)  dkj 

O.ti 

f^Mkh^O))  -  0 

i).J) 

for  all  integers  k  end  J,  where  L(.)  denotes  the  expectation  operation  conditioned  bald)  —  isifl).  t(l), 
....  atk)>.  sad  T^  ate  r  v  r  and  n  x  i  positive  definite  metric*#,  respectively. 


The  dlecrete-ttme  reellceble  mlelmum  variance  unbiased  estimate  fi(k  *  1/k  »  1)  of  the  etete 
vector  a(fc  ♦  1)  has  base  derived  by  Liang  end  Christensen  (12).  Slue#  their  results  will  be  repeatedly 
weed  throughout  t!d»  paper,  they  are  stated  bare  for  quick  reference. 
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Lama  1:  The  minimum  variance  estimate  of  x(k  ♦  1)  Is  given  by  the  following  set  of  equations: 


8(k  ♦  1/k  +  1)  -  8(k  +  1/k)  ♦  Kfc+1  (z(k  ♦  1)  -  h[x(k  ♦  l),k  ♦  1/k])  (3.6) 

where  we  have: 

8 (k  ♦  1/k)  -  f[x(k),k/k]  -  E^f [x(k) ,kl/k)  (3.7) 

Kk+1  -  E^xfk  ♦  l/x)hT[x{k  ♦  l)/k)}(>v<k  +  1)  ♦  E^hfxtk  ♦  l)/k]hT[x(k  *  l)/k])}-1  (3.8) 

Vj(k  ♦  1/k)  «  [x(k.).k./k]fT[x^k).k/k] }  +  G[x(k).k]Vv(k)CT[x(k),k]  (3.9) 

anil 

V-(k  ♦  1/k  ♦  1)  -Vj(k  ♦  1/k)  -  E)t(h[x(k  ♦  l)/kliT(k  +1/3))  (3.10) 


In  order  to  provide  an  Insight  Into  the  structure  of  Liang  and  Christensen's  realizable  nonlinear 
filtering  algorithm.  In  the  next  section  their  filtering  algorithms  will  be  applied  to  estimate  the  states 
of  a  specific  class  of  ;.-nllnear  discrete-time  systems. 


3.2  Nonlinear  Systems  with  Quadratic  Measurement  Nonlinear! t tes 


In  the  special  case  of  the  state  estimation  problem  for  discrete-time  dynamic  systems  described 
by  equation  (3.1)  with  measurements  represented  by 

°-n) 

where  Er  represents  a  set  of  m  symmetric  n  x  n  matrices.  4,,  ....  $  denote  the  natural  basis  vectors: 

K  1  «  ® 
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f°! 


id 
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-i  .....  -  U, 
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Then  equations  (3.6)  to  (3.3)  can  easily  he  shown  to  be: 

8Ck  +  1/k  ♦  1)  •“  8(k  +  1/k)  ♦  *  D  -  *  l  ^  8T(k  »  l/k)r^tj)8(k  ♦  1/k) 


-  I  *  V  - (k  +  l/k>) 

M  1  *  * 

fi(k  •  1/k)  -  ft*(k),  k/k! 
n 


J  ST 


Vi  -  vi*  *  * 3  ^  *,  a‘‘k  *  "'Ki 

fvvCk  +  1)  *  (H^  +  1  I  4}  STtk  •  l/k)^Ml 

.Va(k  »  l/VX^j  •  1"^  ^  fiT(k  •  i/k)!^}* 


+  £  J  9j  IteceSS^  Va(k  •  l/k)^M  V8<k  •  l/k)M>ji 
3  ♦  ' 


J-i 
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A  numerically  stable  form  of  the  error  variance  equation  is  represented  by t 
VjCk  +  I)  -  u  -  VlsVl  *  2  ^  *T(b  .  l/hH^i 


V* *  i/k)[t .  Vt^S+1  • 


i ;  *4  fiu .  w>olf 

i 


+  Vi'V*  ’  13  *  3  J  ^  «~*{cv+i  Vh  4 


1/k) 


s‘.i  Vk  4 

Hu  equstltiii  for  Vjd  *  1/k)  remained  the  *m»  ss  that  e!  equation  (1.9). 


t«  the  special  case  that  »he  meserge  scltlsf  0.1)  is  represented  by: 


vs  s. 


where  refteeeot*  a  *et  of  a  symmetric  m#  matrices. 

Them  equations  (3.16)  sod  (3.9)  tit  respectively,  further  reduced  to: 

«(k  ♦  i/h>  -«k  t(n/i)  ♦  £  *,(iT(k/h)cJ  Sik/k)  ♦  tree#  Vj(Wk)S) 


(3.12) 


0.0) 

0.14) 


o.m 


0.16) 
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*nd 


®  T  I 

Vj(k  ♦  l/k)  -  lck  ♦  2  l  x  (k/k)G^j\’-(k/k) 


[«fc  ♦  2  £  f  xT(k/k)<^]‘  ♦  ?v(k) 

n  .  _ 

+  £  2  4,  trace  Vj(kA)G^  V-(k/k)j$ 

i.ir1  *  x  £  q 


It  should  be  noted  that  for  state  estimation  problems  with  measurements  containing  quadratic 
functions  of  the  states,  the  realisable  nonlinear  estimation  algorithms  presented  here  are  numerically 
stable.  In  the  special  case  chat  both  Che  message  and  measurement  models  consist  of  some  quadratic 
functions  of  Che  states,  the  results  presented  here  are  identi  si  to  those  derived  by  H.W.  Sorenson  [15]. 


3.3  Simulation  Results  and  Discussions 

3.3.1  State  and  Parameter  Estiaatlon  with  Quadratic  Measurement 


In  order  to  test  and  compare  the  performance  characteristics  of  the  minimum  variance  estimator 
with  those  of  the  extended  Kalman  filter,  two  simple  state  and  parameter  estimation  problems  were  selected. 
The  first  one  considered  here  is  represented  by  the  following  equations: 

x^Ck  +  1)  -  x^k)  -  x^k)  Xj(k)  +  w^ 
x2(k  +  1)  ”  x2(k)  ♦  w2 

z(k)  -  x^(k)  x2(k)  +  v 

where  noise  variances  of  v  ,  »j  and  w2  are  respectively,  4^,  and  4^  . 


The  dynamic  structures  of  the  minimum  variance  filter  and  the  extended  Kalman  filter  are 
represented  in  the  following  equations: 


Extended  Kalman  Filter 

1.  S^k  +  l/k)  -  *1(k)  -  x^fk)  x2(k) 


2.  *2(k  +  l/k) 


S2(k) 


where 


<Kk  +  l/k) 


and  Q 


|-x2(k  +  l/k)  -£j(k  +  l/k) 


Additional  Terms  for  MVF 

‘  V12(k) 

None 


3.  V(k  +  l/k)  -  *(k  +  l/k)  V(k/k)  $  (k  +  l/k)  +  Q  +  JC, 


M 

where 


1  0 


0  0 


and  J 


V12(k)  +  Vll(k)V22(k) 


4.  K  -  V(k  +  l/k)HT  W'1 

where  H  -  [$2<k  +  l/k)  ^(k  +  l/k) ] 

5.  W  -  H  V(k  +  l/k)HT  +  fv 

6.  V(k  +  1)  -  (I  -  KH)V (k  +  l/k) (I  -  KH)T  +  K  y  KT 

7.  i  -  ^(k  +  r/k)S2(k  +  l/k) 

8.  i(k  +  1)  -  8(k  +  l/k)  +  K(*  -  a) 


None 

♦  vj2(k  +  l/k)  +  V.1(k  +  l/k)V22(k  +  l/k) 

+  K[V22(k  +  l/k)  +  Vu(k  +  l/k)V22(k  +  l/k)]KT 

+  v12(k  +  l/k) 

None 


2 

It  is  apparent  that  the  major  difference  between  the  ERF  and  MVF  is  due  to  the  terms  v‘2  +  V22 

*nd  V^.  Therefore  it  can  te  expected  that  the  performance  characteristics  of  these  two  filters  will  be 
greatly  effected  by  the  selections  of  initlel  variances,  in  partlculsr,  the  choice  of  V22(0).  In  order 

to  compare  the  performance  of.  theae  two  filters,  some  typical  results  of  extensive  simulations  are  presented 
in  Figures  3,1  to  3.6.  Here,  each  of  the  graphs  presented  represents  the  average  results  of  150  simulation 
runs. 


Figurs  1  shows  the#kV#f liter  output  responses  for  perfect  a  priori  initiel  state  estimates,  with 
jiasll  nolss  vsriunce*  and  initial  error-variances.  For  this  particular  esse  it  is  obvious  that  the  MVF  is 
only  marginally  better  than  the  SKF. 

Figure  3.2  ahows  the  fleets  of  initial  state  estimates  on  filter  performance  for 

x,(0>  -  x2(0)  -  ?.0,  VU(Q)  -  Vl2(0)  -  V22(0)  -  0.01,  and  Vy  -  Yy  -  0.01 

Vhen  the  Initial  error-variances  are  increased  to  the  value  of  1.0,  simulation  results  for  similar  conditions 
era  shown  in  Figure  J.S,  while  Figu.u  3.4  shows  their  estimator  errors  and  one  sigma  error  bounds.  When 
simulation  results  of  Figures  3.2  and  3.3  if  compared  with  the  outputs  of  the  true  states  presented  in 
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Figure  2.1(a),  It  19  evident  that  the  MVF  Is  much  more  insensitive  to  "he  choice  f  initial  state 
estimate*.  Moreover,  it  is  observed  that  when  the  appropriate  lue?  oi  State  es.-iaate  ■  are  cade 
available,  the  EKF  is  slightly  better  than  the  KVF.  But  when  the  a  ;  state  estimates  are  far 

different  from  the  true  values  of  system  states,  the  HVF  is  -such  superior  than  rve  Su"?  for  foth  large 
and  small  initial  variances.  However,  vhon  the  noise  variances  art  Increased,  the  gap  betueen  the  MVF  and 
IKF  gradually  diminishes.  For  these  cases,  the  EXF  performs  almost  as  veil  as  the  KVF. 

Figures  3.5  and  3.6  further  demonstrates  the  effects  of  initial  variances  V  ,  V  and  V  ,  on 

22  12 

the  performance  of  the  two  filters.  It  is  apparent  that  the  selection  of  initial  V,,  Is  as  important  as 
the  selection  of  V^  and  V^.  it  seems  that  the  selection  of  an  inappropriate  V.^fOi'aay  have  Bore  signifi¬ 
cant  effects  on  the  stability  and  performance  of  both  filters.  From  the  author's  simulation  experience  it 
is  noted  that  in  the  design  of  the  EKF  for  systems  with  non-negligible  nonlinc lritiss,  the  designer  3hould 
be  careful  in  not  using  overly-optimistic  initial  error-variances.  For  the  MVF,  the  designer  should  be 
careful  not  to  use  overly -pessimistic  initial  error-variances,  since  error-variances  that  were  too  large 
could  excessively  enlarge  the  added  measurement  variances.  As  a  result,  valuable  measurement  data  could 
end  up  being  rejected. 


3.3.2  State  and  Parameter  Estimation  With  Linear  Measurement 


Another  simple  numerical  example  that  was  investigated  assumed  the  system  model  of  Section  3.3.1 
with  measurement  given  by 

z(k  ♦  1)  ~  xx(k)  +  v(t) 

Numerical  results  presented  la  Figures  3.7  to  3.10  were  all  obtained  from  150-run  Monte  Carlo 
simulations. 

Figures  3.7  and  3.S  show  the  effects  of  initial  state  estimates  on  the  filter  performance  for  the 
following  set  of  prior  statistics 

”  \.  -  0.01,  Vu(0)  =  V22(0)  -  V12(0)  -  1.0  and  x^O)  -  x2(0)  “  2.0 

The  results  presented  here  are  quite  similar  to  those  pn  rented  in  Figures  3.3  and  3.4.  Therefore,  the 
author's  comments  on  Figures  3.3  and  3.4  are  also  equally  applicable  here. 

To  further  demonr 1 .ate  the  effects  of  initiai  variances  V^,  V22  and  V^2  on  the  performance  of 
the  MVF  and  EKF,  Figures  ■  t..  •  1.10  are  presented  assuming 

xx(0)  -  82(0)  -  -2.0  and  x^O)  -  x.,(0)  =  2.0 

Where  Figures  3.9  and  3.10  assumed  noise  variances  of  0.01  and  1.0,  respectively.  It  is  apparent  that  the 
performance  characteristics  of  the  MVF  and  EKF  are  quite  similar  to  those  of  Figures  3.5  and  3.6,  where  the 
MVF  is  significantly  better  and  more  insensitive  to  the  selections  of  error-variances  than  the  EKF.  Various 
comments  that  were  made  in  Section  3.3.1,  are  equally  applicable  for  this  subsection. 


SECTION  4 CONCLUSIONS 


This  paper  has  presented  a  brief  summary  on  the  comparisons  of  dynamic  structures  for  various 
finite  dimensional  filters.  Extensive  simulation  results  accompanied  with  discussions,  were  presented  to 
compare  the  performance  behaviour  of  some  of  these  filters. 

From  the  extensive  numerical  results  obtained  one  can  derive  several  conclusions,  some  of  the 
most  important  are  stated  here. 

1.  When  the  level  of  noise  inputs  is  large  enough  to  effectively  cover  the  effects  of 
nonlinearities,  no  particular  filter  can  be  said  to  be  consistently  superior  to 
any  other  filter. 

2,  When  the  noise  inputs  are  not  "too  small"  (relative  to  the  effects  of  nonlinearities), 
and  as  long  as  the  a  priori  estimates  are  available,  the  extended  Kalman  filter  can  be 
expected  to  perform  as  well  as  any  other  nonlinear  filter. 

3,  When  nonlinear  effects  are  non-negligible,  the  performance  of  the  realizable 
minimum  variance  filter  is  far  superior  to  any  other  filter  investigated,  it  is 
also  much  more  insensitive  to  the  choice  of  a  priori  estimates. 

4.  In  general,  in  the  design  of  the  EKF  for  dynamic  systems  with  non-negligible 
nonlinearities,  the  designer  should  be  esutious  not  to  select  overly  optimistic 
initial  error-variances.  But.  in  the  design  of  the  MVF,  the  designer  should  be 
cautious  not  to  select  overly  pessimistic  initial  error-variances. 

It  should  also  he  noted  that  for  nonlinear  systems  with  polynomial,  product-type  and  sinusoidal 
nonlinearities,  the  derivation  and  implementation  of  the  MVF  would  only  be  slightly  more  difficult  than  the 
EKF  or  the  SLF,  etc.  But  the  MVF  could  sometimes  be  much  more  accurate  and  stable  than  the  other  estimators 
investigated. 
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FIGURE  2.1  PHASE-LOCK  LOOP  SIMULATION J  AVtf  OUTPUTS  OF  X1  AND  Xj  (5-RUN  AVERAGES) 
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FIGURE  2. 2: PHASE-LOCK  LOOP;  EFFECTS  0?  1(0).  WITH  SittU  Y  AND  LARGER  V 
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FIGURE  2.4  PHASE-LOCK  LOOP  SIMULATION; 61 V4  OUTPUTS  OF  ^  AND  X2  (5-RUN  AVERAGES) 
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FIGURE  2.10:  VAN  PER  POL'S  OSCILLATOR;  EFFECTS  OF  V  WITH  VERY  SMALL  V 
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FIGURE  2. lit  VAN  PER  POL'S  OSCILLATOR;  EFFECTS  OF  f  WITH  SHALL  V 
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FIGURE  2.12:  VAN  PER  POL’S  OSCILLATOR;  EFFECTS  OF  V  WITH  LARGER  V 

X^(0)  —  Xj_(0)  —  2.0  - TRUE  STATE 

X2(0)  -  X^O)  -  0.0  - MINIMUM  VAR.  FILTER 

Vll(0)  =  V?2(0)  “  5,0  -  -  -fOCHASTIC  LINEARIZATION  FILTER 

V12(0)  -  0.0 
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FIGURE  2.14:  VAN  PER  POL'S  OSCILLATOR;  EFFECTS  OF  INITIAL  STATE  AND  ERROR-VARIANCES 
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FIGURE  3.2;  DISCRETE  CASE  WITH  NONLINEAR  MEASUREMENT;  EFFECTS  OF  X(O)  WITH  SMALL  V  AND  V 
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FIGURE  3.3  COMPARISON  OFAvm  OUTPUTS;  EFFECTS  OF  X(0)  WITH  SMALL  T  AMD  LARGER  V 
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FIGURE  3. Si  COMPARISON  OF Av,  OUTPUTS ;  EFFECTS  OF  INITIAL  V  WITH  SMALL  V 
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FIGURE  3.6;  COMPARISON  0F^»  OUTPUTS;  EFFECTS  OF  INITIAL  V  WITH  LARGER  V 
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FIGURE  3.7;  DISCRETE  CASE  WITH  LINEAR  MEASUREMENT ;  EFFECTS  OF  X(0)  WITH  SHALL  ¥  AND  LARGE  V 
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KALMAN  FILTER  SATELLITE  ORBIT  IMPROVEMENT  USING 
LASER  RANGING  MEASUREMENTS  FROM  A  SINGLE  TRACKINO  STATION 


K.F.  Wakker  and  B.A.C.  Ambrosias 
Delft  University  of  Technology 
Department  of  Aerospace  Engineering 
Kluyverweg  1,  Delft,  The  Netherlands 


SUMMARY 


Modern  satellite  ranging  lasers  emit  short  pulses  at  a  low  oeam  divergence  and 
therefore  require  accurate  satellite  posit  Ion  predictions.  This  paper  deals  with  a  study 
to  investigate  the  possibilities  to  use  the  laser  range  observations  acquired  at  only  one 
tracking  station  to  provide  real-time  position  prediction  updates  during  a  pass,  and  also 
better  predictions  for  subsequent  passes  over  that  station.  A  computer  program,  called 
SORKA,  has  been  developed,  which  is  based  on  an  extended  Kalman  filter  scheme.  The 
computational  approach  adopted  in  SORKA  is  described  in  some  detail.  In  particular,  the 
methods  to  compute  the  state-transition  matrix  and  the  state-noise  covariance  matrix  are 
emphasised.  Typical  divergence  phenomena  arising  from  processing  accurate  range-only 
measurements  are  discussed  and  the  effectiveness  of  techniques  to  reduce  these 
instabilities  is  demonstrated.  Laser  range  measurements  acquired  during  8  successive 
passes  of  GEOS-1  over  the  Kootwijk  groundstatlon  have  been  processed  and  some  results  are 
presented. 


1.  INTRODUCTION 


Since  August  1976  the  Department  of  Geodesy  of  Delft  University  or  Technology  operates 
a  satellite  laser  ranging  system  at  Kootwijk  in  the  Netherlands.  Prom  there,  the  Working 
Group  for  Satellite  Geodesy  (WSO)  acquires  on  a  routine  basis  day  and  night  ranging  data 
of  geodetic  satellites.  Within  the  period  1976  to  1979  more  than  99,500  observations  have 
been  obtained  at  Kootwijk  during  1108  pasaea  of  BRACON-C,  QEOS-i ,  QEOS-2,  QEOS-3,  SKASAT, 
STARL8TTR  and  LAOEQS  (Ref.  1).  Ail  these  satellites  encircle  the  earth  i«  near- circular 
orbits  between  800  km  and  2300  km  altitude,  exeept  for  LAQEOS  whose  altitude  is  about 
5900  km.  In  1980,  about  13,500  observations  were  acquired  during  370  passes  of  0E0S-3, 
STARLETTB  and  LAG EOS.  The  ruby  pulse-laser  system  consists  of  a  multi-mode  0-switched 
oscillator,  a  spark-gap  activated  pulse  chopper  and  two  amplifier  stages  (Refs.  1,  2). 
The  output  energy  in  routine  operation  la  l  to  2  J,  with  a  maximum  of  3  J.  The 
tranamlttcd  User  beam  has  a  diameter  of  19  ee  and  the  divergence  is  adjustable  Trom  1  to 
20  arcmlnutea.  Untill  autumn  1$80,  the  ayatem  generated  9  ns  wide  pulsea  at  a  maximum 
rate  of  15  pulses  per  minute,  producing  measurements  with  an  accuracy  or  about  25  cm 
root-aean-aquare.  Recently,  the  accuracy  level  has  been  improved  to  about  15  cm,  by 
reducing  the  puiae  width  to  2  ns.  In  addition,  a  new  range-gate  generator  has  been 
installed,  having  a  manually  adjustable  time  window  with  a  minimum  half-width  of  0.1  us. 
This  will  poaaibly  give  the  capability  of  ranging  also  to  LAQEOS  in  daytime. 

For  many  years  there  exists  a  close  cooperation  between  WSG  and  the  Section  Orbital 
Mechanics  (S0K)  of  the  Department  of  Aerospace  Engineering  of  Delft  University  of 
Technology.  Thia  Section  supports  W$Q  in  the  field  of  orbit  computations  for  the  geodetic 
satellites  used  in  the  laser  ranging  activities.  The  support  ranges  from  satellite 
position  predictions,  needed  for  the  automatic  pointing  of  the  laser,  to  orbit 

determination  and  geophysical  parameter  estimation  from  laser  observations  acquired  at 
Kootwijk  and  other  laser  ranging  stations.  To  increase  the  accuracy  of  the  laser 
pointings,  it  was  decided  In  1978  to  Investigate  the  possibilities  to  use  laser 
observations  from  Kootwijk  in  a  (semi-}  real-time  mod*.  This  paper  describes  the 
preliminary  results  of  that  study. 

2.  LAGER  FPtHTlKG  FUED10T1QM8 

For  the  routine  operations  of  pointing  the  laser  at  a  satellite,  at  present  use  Is 
made  of  the  AIHLA3ER  computer  program,  developed  at  the  Smithsonian  Astrophystcal 

Observatory  (SAO),  Cambridge,  Massachusetts.  This  program  has  been  modified  by  SOM  to 
satisfy  the  specific  needs  of  WSG  and  is  regularly  improved  and  updated.  This  Delft- 
version  of  A1NLASER  (Ref,  3)  is  also  in  use  at  a  number  of  other  European  laser  stations. 
The  Input  for  the  orbit  prediction  program  consists  of  •  set  of  mean  orbital  elements, 
distributed  weekly  by  SAO.  These  paraueters  are  determined  by  SAO  from  the  so-called 
quick-look  laser  ranging  data  as  returned  to  SAO  by  many  groundstations  distributed  all 
over  the  world.  Experience  has  shown  that  quite  often  the  satellite  position  predictions 
on  btsis  of  ‘.Hcee  SAO  elements  and  us!»W5  the  AINLASBR  progress  reach  a  level  of  inaccuracy 

which  is  Incompatible  with  lasers  emitting  a  low-divergence  beam.  To  give  an  Indication 

of  the  accuracy  needed,  consider  a  satellite  which  passes  over  Kootwijk  at  a  distance  of 
1000  km.  Mien  the  beam  divergence  la  2  arcmlnutea,  the  diameter  of  the  beam  at  the 
altitude  of  the  satellite  Is  580  ».  So,  in  thin  case  a  position  prediction  accuracy  of 
about  300  a  la  needed  in  order  to  guarantee  that  the  satellite  is  hit  by  the  laser  pulse. 

To  correct  for  position  prediction  errors,  the  Kootwijk  laser  system  has  been  equipped 
with  a  manually  controlable  firing-time  adjustment  switch.  During  nighttime  passes,  If 
the  satellite  Is  Sunlit,  the  operator  may  look  through  the  laaer  telescope  In  order  to 
estimate  by  what  amount  the  firing  time  has  to  be  delayed  or  advanced.  Thia  la  possible 
for  satellites  up  to  a  visual  magnitude  of  *13.  Usually,  however,  if  at  the  beginning  of 


a  pass  no  return  signal  is  received,  a  systematic  variation  of  the  time  adjustment  is 
applied  until  returns  are  being  registered.  Sometimes,  the  adjustment  can  remain  constant 
during  a  pass,  but  on  other  occasions  re-adjustments  are  necessary.  By  this  technique, 
the  operator  in  fact  corrects  for  position  errors  in  the  direction  of  the  satellite's 
path  across  the  sky.  It  has  been  found  (Ref.  H)  that  a  considerable  improvement  of  the 
score  of  laser  returns  is  obtained  by  this  manual  control  of  the  laser  firing  time. 

One  of  the  main  disadvantages  of  this  technique  is  that  it  corrects  for  ln-track 
position  errors  only,  while  errors  perpendicular  to  the  satellite's  path  cannot  be 
corrected  for.  It  actually  has  occurred  that  the  observer  saw  through  the  telescope  that 
the  satellite  was  too  far  from  the  predicted  track  without  being  able  to  apply 
corrections.  Therefore,  a  need  exists  for  more-precise  position  predictions.  This  need 
becomes  even  more  stringent  with  the  development  of  more-advanced  laser  systems  operating 
with  a  smaller  beam  divergence.  But  there  13  still  another  need  for  more-accurate 
predictions.  To  minimize  the  chances  of  false  triggerings  by  noise  pulses,  a  laser  system 
uses  a  range-gate  generator,  which  determines  the  time  window  in  which  the  return  signal 
is  expected.  For  daytime  ranging  to  distant  satellites,  like  7.AGE0S,  very  short  windows 
of  up  to  0.1  pa  are  required.  But  this  Implies  that  the  radial  distance  to  the  satellite 
has  to  be  predicted  with  an  accuracy  of  better  than  30  m. 

To  Increase  the  accuracy  of  the  predicted  satellite  positions  a  number  of 
possibilities  exists.  For  instance,  it  would  be  possible  to  replace  the  SAO  elements  by 
more-accurate  orbital  parameters  and  to  use  a  satellite  position  prediction  program  that 
is  more  accurate  than  AIMLASER.  The  primary  reason  for  the  SAO  elements  being  sometimes 
inaccurate  is  that  they  are  based  on  the  quick-look  ranging  data,  which  may  be  rather 
sparse  during  some  periods  and  which  may  contain  systematic  or  gross  errors.  Therefore, 
an  approach  may  be  adopted  as  applied  by  the  University  of  Texas  at  Austin  for  the 
prediction  of  LAGEOS.  In  that  case,  several  weeks  of  laser  observations  from  a  number  of 
groundstatlons  are  used  to  determine  the  orbit  of  the  satellite  very  accurately.  From 
this  orbit  determination  an  extremely  accurate  orbit  is  extrapolated  for  a  long  time  in 
advance  to  yield  one  or  more  state  vectors  for  each  day.  From  these  state  vectors,  the 
station  generates  its  own  laser  pointing  angles  and  computes  the  satellite's  distance  by 
numerical  integration  of  the  equations  of  motion.  At  Kootwijk,  research  is  going  on  along 
this  line  to  improve  the  predictions  for  LAGEOS.  It  is  doubtful,  however,  if  such  a 
technique  will  yield  prediction  accuracies  for  satellites  below  1500  km  altitude  that 
satisfy  the  needs  of  the  narrow-beam  laser  stations. 

Therefore,  it  is  felt  that  it  would  be  attractive  to  process  in  real  time  the  laser 
range  measurements  regi  tered  during  a  pass  over  a  station  in  order  to  increase  the 
accuracy  of  subsequent  laser  pointings  and  window  settings  during  that  pass.  This  means 
that  all  computations  have  to  be  performed  on  an  on-line  computer  within  the  period 
between  two  successive  laser  firings.  To  start  up  the  process  it  1b  required  that  at  the 
beginning  of  a  pass  observations  are  actually  acquired.  Fortunately,  at  low  elevation 
angles  the  slant  range  to  the  satellite  is  large  so  that  there  is  a  good  chance  that  the 
satellite  is  within  the  laser  beam,  even  if  the  position  predictions  are  relatively  bad. 
After  the  first  observations  have  been  processed  and  the  accuracy  of  the  poeltion 
prediction  has  Increased,  the  beam  divergence  and  the  time  window  setting  may  be 
narrowed.  In  case  the  predictions  are  so  bad  that  no  returns  are  received,  a  systematic 
search  procedure  may  be  executed  at  the  beginning  of  a  pass.  This  type  of  closed-loop 
tracking  is  primarily  applicable  to  stations  with  medium-  to  high-power  lasers,  since 
only  then  sufficiently  wide  beams  can  be  used  to  guarantee  laser  returns. 

Recent  observations  from  a  station  can  also  be  used  in  an  off-line  modo  to  improve  the 
orbit  and  by  that  the  laser  pointing  angles  for  subsequent  passes.  This  also  makes  the 
station  less  dependent  on  a  distant  large  computing  center.  This  aspeot  could  be  of  great 
importance  for  the  mobile  laser  ranging  systems.  These  systems,  however,  are  designed 
around  low-power  lasers  (about  10  mJ  per  shot)  operating  with  a  berm  divergence  of  about 
0.5  arcminute  and  a  high  repetition  rate  of  about  10  pulses  per  second.  Signal  levels  are 
on  the  average  below  one  photoelectron  per  shot  for  LAGEOS  and  in  the  order  of  ten  photo¬ 
electrons  per  shot  for  satellites  in  lower  orbits.  As  a  result,  large  numbers  of  false 
returns  are  recorded  which  need  to  be  identified  in  order  not  to  upset  the  orbit 
Improvement  scheme.  Whether  this  identification  will  be  possible  during  a  satellite  pass 
over  the  station  still  has  to  bo  investigated. 

The  main  objective  against  the  real-time  laser  pointing  update  approaoh  is  the  faot 
that  from  the  orbital  mechanics  point  of  view,  it  is  very  precarious  to  use  range 
measurements  from  only  one  groundstation  to  improve  the  satellite's  orbit.  It  is  evident 
that  from  such  observations  during  one  pass  very  little  information  is  gained  on  the 
orientation  of  the  orbital  plane.  Therefore,  the  basic  scheme  presently  envisaged  is  that 
previous  range  measurements  obtained  during  several  passes  over  the  tracking  station 
yield  the  orientation  of  the  orbit  to  such  a  level)  of  accuracy  that  during  the  next  pass 
the  range  measurements  can  be  used  in  real  time  mainly  to  improve  the  prediction  of  the 
satellite's  position  in  tne  orbital  plane.  During  the  very  first  part  of  a  pass  the  laser 
mount's  angular  position  read-outs  can  possibly  also  be  used  as  observation  quantities. 
Though  these  angles  will,  in  general,  contain  systematic  errors,  the  inclusion  of  these 
quantities  for  the  whole  tracking  period  as  quasi-observations  with  a  relatively  low 
weight  might  also  be  attractive  to  stabilize  the  orbit  improvement  process.  Nevertheless, 
it  is  anticipated  that,  periodically,  after  long  observation  gaps,  additional  orbital 
information  will  be  required  which  is  derived  from  tracking  data  aoquired  at  other 
groundstatlons. 
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3.  THE  EXTENDED  KALMAN  FILTER 

It  Is  well  known  that,  basically,  there  exist  two  methods  for  processing  satellite 
tracking  data  to  estimate  the  orbit  of  a  satellite.  The  batch  processor,  which  yields  the 
estimate  at  some  reference  epoch,  requires  that  the  entire  sequence  of  observations  be 
processed  before  the  estimate  can  be  made.  On  the  other  hand,  the  sequential  orbit 
determination  procedure  processes  one  observation  at  a  time  and  produces  an  estimate  of 
the  state  vector  at  the  observation  time.  So,  this  sequential  procedure  suits  best  the 
requirements  for  real-time  orbit  improvement,  although  it  can  also  be  used  to  predict  the 
next  satellite  pass  over  the  station.  While  the  formal  mathematical  equivalence  between 
the  sequential  estimation  algorithm  and  the  batch  estimation  algorithm  can  be  shown,  it 
is  known  from  practical  experience  that  the  sequential  process  is  much  more  sensitive  to 
errors  introduced  by  linearizations,  which  may  result  in  estimate  divergence  problems. 

The  technique  selected  in  this  study  is  the  extended  Kalman  filter,  which  is  described 
in  many  textbooks  (e.g.  Ref.  5).  An  interesting  geometric  derivation  of  the  Kalman  filter 
equations  is  given  in  Ref.  6.  Applications  of  the  extended  Kalman  filter  to  orbit 
dynamics  problems  are  discussed  extensively  in  Refs.  7-1*1  •  For  this  study  a  computer 
program,  called  SORKA  (Satellite  .Orbit  .Refinement  using  a  KAlman  filter)  has  been 
developed,  which  at  the  moment  functions  primarily  as  a  test  program.  Therefore,  it 
contains  at  present  different  options  and  alternatives  to  investigate  in  detail  the 
Kalman  filter  characteristics  for  this  application.  It  is  emphasized  here  that  the  aim  of 
SORKA  is  not  to  reach  the  high  accuracy  level  and  the  extensive  capabilities  of  computer 
programs  like  GEODYN  (NaSA)  or  UTOPIA  (Univ.  of  Texas),  but  merely  to  satisfy  the 
accuracy  level  needed  for  laser  pointings  and  to  be  compatible  with  small  local 
computers.  Until  now,  SORKA  runs  on  an  IBM  370/158  computer,  but  in  the  design  of  the 
program  precautions  have  been  taken  to  make  the  implementation  of  a  stripped-down  version 
on  a  small  local  computer  possible.  To  elucidate  the  description  of  the  computational 
approach  adopted  in  SORKA,  a  brief  outline  of  the  filter  scheme  will  be  given. 

The  satellite  motion  can  be  described  by  the  set  of  equations 

X  -  F(X,  t);  X(tQ)  -  ^  (1) 

where  X.  denotes  the  state  vector.  When  is  specified  and  F  is  known,  the  orbit  of  the 

satellite,  and  thus  its  state  at  a  later  time,  can  he  obtained  from  Integration  of  Eq. 
(1).  In  orbit  determinations,  X*  is  not  known  perfectly  and  therefore  observations  of  the 
motion  must  be  processed  to  obtain  a  more-accurate  estimate  of  the  state  vector.  Assume 
that  a  best  estimate  of  the  state  at  Is  known  with  an  accuracy  represented  by  the 
covariance  matrix  PQ.Tn  the  time-update  step,  Eq.  (1)  is  Integrated  to  yield  an  initial 
state  estimate  at  tne  time  of  the  first  observation  ti  :  X^.  Its  covariance  matrix,  Pj, 
is  computed  from 

P1  "  *1,0  P0  ♦lfo  +  Ql,0 

where  ♦,  Is  the  state-transition  matrix,  used  to  map  the  state  from  t0  to  t.  and 
defined 1by 


and  Qj  0  is  the  state-noise  covariance  matrix,  representing  errors  occurring  during  the 
state  Sector  integration.  Those  errors,  which  are  assumed  in  this  study  to  be  Gaussian 
distributed  with  a  zero  mean,  include  such  effects  as  deterministic  model  errors, 
numerical  integration  errors  and  random  process  noise. 

In  the  subsequent  observation-update  step  the  Kalman  filter  adds  the  information  of 
the  observations  at  ti  to  the  initial  eatimatea,  yielding  more-accurate  estimates  Xi 
and  Pi.  For  this,  in  SORKA  the  general  non-linear  relations  between  the  measured 
quantities  and  the  state  veotor  are  used: 

Z  ■  G(X,  t)  ♦  V  (2) 

where  the  stoohastlo  veotor  V  represents  Qausslan  distributed  measurement  errors  with  a 
zero  mean  and  a  covariance  aaTrlx  R.  The  observation  residuals  at  t1  :  jt^,  are  defined  by 

*i  “  h  ‘  h 

where  j5i*  denotea  the  predicted  observations  as  computed  from  Eq.  (2)  and  the  initial 
estimate  *. 

«!*  -  Otix.  V 

The  new  estimate  X^  la  given  by 

ix  -  ♦  Kj  &x  (3) 

where  is  the  Kalman  gain  matrix,  satisfying 

Ki  *  pi  8iT<hi  pi  hiT  *  fti >‘l 
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and  Is  the  observation  matrix,  defined  by 


So,  the  matrix  H  Is  evaluated  for  the  most  accurate  state  estimate  available  at  this 
stage:  the  reference  state  X^.  The  state  covariance  matrix  at  corresponding  to  the 
optimal  state  estimate  can  be  found  from 

Pj  -  (I  -  Kj  Hj)  ?x 

where  I  is  a  unit  matrix  of  appropriate  dimensions.  Once  and  are  known,  the  same 
process  is  repeated  to  compute  the  state  and  the  state  covariance  matrix  at  the  time  of 
the  next  observation.  In  this  way,  the  best  estimate  at  any  time  contains  the  information 
of  the  last  and  all  previous  observations. 

The  scheme  given  above  holds  for  all  types  of  observations.  In  this  study,  the 
observations  are  ranges  from  the  laser  to  the  satellite,  but  SORKA  has  been  designed  such 
that  in  the  future  also  azimuth  and  elevation  observations  can  be  dealt  with.  A3  a 
reasonable  assumption,  rar.^e,  azimuth  and  elevation  can  be  considered  independent 
observations.  Then,  a  computational  simplification  is  possible,  permitting  the  individual 
observations  at  each  observation  time  to  enter  the  algorithm  one  after  another  as 
scalars.  In  that  case,  the  term  in  brackets  in  Eq.  (4)  reduces  to  a  scalar.  So,,  the 
inversion  of  this  term  reduces  to  a  division  by  a  scalar,  which  avoids  numerical  problems 
which  may  occur  in  matrix  inversions. 

For  each  time-update  step,  the  function  F,  the  state-transition  matrix,  ♦,  and  the 
state-noise  covariance  matrix,  <3,  have  to  be  evaluated.  For  the  integration  of  the  state 
equations,  Eq.  (1),  a  relatively  simple  force  field  has  been  adopted.  During  a  pass  only 
the  first  five  zonal  harmonies  (J2  to  and  the  first  tesseral  harmonic  (J2  2^  of  the 
Legendre  series  expansion  for  the  earth's  gravity  field  (geopotential)  are  accounted  for. 
For  the  state  integration  between  subsequent  passes  a  more-extended  gravity  model  is 
used,  including  terms  up  to  Jc  g.  The  dynamical  equations  are  integrated  with  a  fourth- 
order  Runge-Kutta  method,  Tfte  s.tepsize  depends  on  the  time  between  subsequent 
measurements  and  has  an  upper  limit  of  to  a.  The  computation  of  the  state-transition 
matrix  and  the  atate-nolse  covarianoe  matrix  is  descrioed  in  the  following  Sections.  In 
each  observation-update  step,  the  function  0  and  the  matrices  R  and  H  are  required.  The 
measurement-noise  covarianoe  metrix  is  assigned  the  values  of  the  known  measurement  noise 
variances.  The  observation  matrix  ia  evaluated  at  a  reference  state,  usually  taken  as 
In  Section  6  an  alternative  for  thia  reference  state  will  be  introduced.  """* 

<S.  THE  STATE-TRANSITION  MATRIX 

From  Eq.  (1)  a  differential  equation  can  be  derived  for  the  state-transition  metrix: 

♦t,o  *  (if)xt  *t,0  i  *0,0  '  1  (5) 

These  so-called  variational  equations  can  be  Integrated  numerically,  together  with  the 
state  equations.  An  alternative  approach  la  to  avoid  the  use  of  ♦  and  to  dlreotly 
integrate  a  matrix  Riecati  differential  equation  in  P  (Refa.  9,  10),  Such  a  direct 
numerical  Integration  provides  a  mathematically  rigorous  solution  and  Is  readily 
applicable  in  the  presence  of  all  types  of  disturbing  forces.  The  method,  however, 
necessitates  the  simultaneous  integration  of  a  large  aet  of  differential  equations,  thus 
making  the  procedure  time  consuming.  As  SORKA  is  designed  for  real-time  operations,  it 
was  therefore  decided  to  compute  the  state-transition  matrix  In  some  approximating 
analytical  way.  It  is  realised  that  this  approach  may  entail  subtle  but  Important  effeota 
which  Impair  the  aaeqraoy  of  the  results,  in  particular  when  the  analytical  expression* 
for  the  computation  of  a  require  a  amplification  of  the  force  field,  whiLs  for  the  state 
integration  a  more-extensive  force  field  is  used.  However,  studies  described  in  Refa.  15, 
16  hint  that,  in  general,  a  truncation  of  the  force  field  Is  permissible,  provided  at 
least  J2  18  explicitly  included  In  the  variational  equations.  For  SORKA,  two  alternative 
analytical  technique#  have  been  explored. 

The  flrat  method  can  only  be  applied  if  the  period  between  successive  measurements  ia 
relatively  short.  It  Is  based  on  the  assumption  that  the  expressions  for  the  gradient  of 
the  geopotential  can  be  llnearited  over  the  time  interval  between  successive 
measurements.  Then,  rrom  Eq.  (5)  the  approximative  relation 

*1,0  *  *0  *1,0 

can  be  derived  (Ref.  17),  where  the  matrix  Kq  can  be  partitioned  into  four  sub-matrices: 
two  being  a  3x3  null  matrix,  one  a  3*3  unit  matrix  and  one  a  3x3  matrix  containing  the 
second-order  partial  derivatives  of  the  geopotentlal  with  respect  to  the  X,  Y  and  Z 
components  or  the  state  vector.  Integration  or  Eq.  (6)  leads  to  an  exponential  function 
which  can  bo  approximated  by  the  aeries  expansion 

♦l,0  *  1  *  *0  *  7  <*o  At)V  +  £  <"o  4t)3  *"••• 

whera  At  «  t.  -  t0.  Analytical  expression*  have  been  derived  {Ref.  V?)  for  the  sooond- 
order  derivatives  of  the  geopotential  where  the  effeots  of  the  tonal  harmonics  J2  upto  Jt 


and  the  first  tesseral  harmonic  J2  2  are  accounted  for.  These  expressions,  as  well  as  the 
expressions  for  the  first-order  partial  derivatives,  which  are  required  for  the  state 
integration,  were  composed  by  applying  the  REDUCE  formula-manipulation  computer  program, 
which  is  available  on  the  DEC-10  computer  of  Twente  University  of  Technology.  Though  this 
analytical  technique  of  computing  $  has  been  implemented  in  SORKA  and  has  been  tested 
extensively,  for  the  computations  described  in  this  paper  a  second  analytical  technique 
has  been  used  exclusively,  since  it  consumes  less  computer  time. 

This  second  technique  was  developed  primarily  for  the  computation  of  the  state- 
transition  matrix  for  longer  periods  between  two  successive  measurements,  such  as  between 
two  different  passes.  The  analytical  expressions  for  the  elements  of  the  state-transition 
matrix  are  obtained  by  applying  the  chain-rule  to  the  relations  that  exist  between  the 
state  vector  and  the  orbital  elements  at  any  time,  and  to  the  relations  between  the 
orbital  elements  at  a  time  t^  and  those  at  tQ: 
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where  _£°8C  denotes  the  vector  of  osculating  orbital  elements.  The  matrices  in  brackets 
can  be  derived  easily  from  the  geometrical  expressions  describing  a  Xeplerian  orbit 
relative  to  the  non-rotating  geocentric  reference  frame  (e.g.  Ref.  18).  To  avoid  the 
classical  problems  for  orbits  with  very  low  eccentricity  or  inclination,  in  principle  the 
use  of  non-singular  orbital  elements  is  preferable.  It  was  demonstrated  in  Ref.  19, 
however,  that  in  practice,  if  the  computations  are  performed  on  a  computer  with  a 
reasonable  word-length,  the  problems  occur  only  at  extremely  low  values  of  eccentricity 
or  inclination.  For  a  word-length  of  64  bits.,,  for  example,  the  classical  elements  can  be 
used  if  e  and  sin(i)  are  larger  than  10“  .  For  simplicity,  therefore  the  classical 
elements  were  adopted  in  SORKA  to  compute  the  state-transition  matrix.  To  guarantee  that 
singularity  problems  will  never  occur,  a  simple  engineering  measure  has  been  introduced 
which  precludes  that  e  and  sin(i)  can  take  values  smaller  than  10“10. 

It  has  been  shown  in  Ref.  19  that,  for  the  application  described  in  this  paper,  the 
matrix  ajo,  c/9Pq  can  be  computed  with  sufficient  accuracy  by  neglecting  short-periodic 
and  long-periodic  perturbations  in  the  osculating  elements,  and  by  taking  into  account 
only  the  secular  perturbations  due  to  the  oblateness  of  the  earth  (Jp).  If  the  orbital 
altitude  is  less  than  400  km,  which  is  very  unlikely  for  geodetic  satellites,  also  the 
secular  perturbations  due  to  atmospheric  drag  should  be  accounted  for.  Analytical 
expressions  were  derived  to  compute  the  matrix  when  only  these  two  types  of  perturbations 
are  considered.  In  SORKA  only  the  J2  secular  perturbations  are  taken  into  account  when 
computing  the  middle  matrix  in  Eq.  (7).  A  further  simplification  was  introduced  by 
substituting  mean  Instead  of  osculating  elements  in  the  expressions  for  the  matrices. 
These  mean  elements  are  obtained  by  subtracting  the  short-period  J2  contribution  from  the 
osculating  elements,  according  to  the  non-singular  elements  conversion  technique 
described  in  Ref.  20.  So,  the  computation  sequence  is  as  follows.  From  the  state  estimate 
at  the  start  of  the  filter  process,  t n,  the  osculating  elements  are  computed.  These  are 
converted  into  mean  elements.  The  values  of  the  orbital  elements  at  the  start  of  the 
first  pass,  t^,  are  computed  by  adding  only  the  secular  perturbations  (e.g.  Ref.  21)  to 
the  mean  elements  at  tp.  Next  the  matrix  tPp/SXn  is  computed,  applying  the  usual 
transformation  relations  between  (osculating)  oroitar  elements  and  position  and  velocity 
(e.g.  Ref.  18).  Subsequently,  the  matrioes  8£i/6£o  and  8X./d£,  are  computed.  Matrix 
multiplication  finally  yields  8X,/9Xp.  For  subsequent  gaps  between  passes,  the  same 
prooese  is  repeated,  but  then  tp  "is  tne  time  of  the  last  observation  of  the  previous  pass 
and  t^  becomes  the  time  of  the  first  observation  of  the  next  pass.  If  this  technique  is 
applied  for  the  computation  of  the  state-transition  matrix  during  a  pass,  ti  simply 
becomes  the  time  of  the  next  observation.  Although  there  are  numerous  approximations  in 
this  approach,  it  was  found  (Ref.  22)  that  they  hardly  affect  the  results. 

5.  THE  STATE-NOISE  COVARIANCE  MATRIX 

It  is  well  known  that  Kalman  filter  applications  often  suffer  from  state  estimate 
divergence  problems.  In  principal,  these  are  a  result  of  non-linearities,  errors  due  to 
an  Incomplete  mathematical  model  8hd  to  a  lesser  extent  also  of  computational  truncation 
and  round-off  errors.  Physically,  the  state  divergence  can  be  explained  as  follows.  When 
during  the  observations  processing  the  state  estimates  beoome  more  aocurate,  and  henoe 
the  oovarianoe  matrix  becomes  smaller,  new  observations,  which  reflect  the  true  state, 
will  yield  progressively  smaller  corrections.  If  there  are  too  many  gross  measurement 
errors  or  if  there  is  any  error  in  the  dynamical  model  that  is  not  accounted  for  properly 
by  the  assumed  model  errors,  represented  by  the  state-noise  covarianoe  matrix  Q,  then 
successive  estimates  of  the  state  may  tend  to  follow  an  erroneous  course  and  to  diverge 
from  the  true  state.  Consequently,  the  estimated  state  oovarianoe  matrix  fails  to 
represent  the  true  estimation  error. 


In  the  time-update  step  the  state  vector  and  its  oovarianoe  matrix  are  integrated. 
During  this  Integration,  errors  will  be  Introduced  due  to  dynamio  modeling  errors  and 
integration  errors.  Generally,  the  errors  will  be  non-random.  Methods  have  been  developed 
(e.g.  Refs.  8,  9t  H*  12,  23)  to  aooount  for  the  model  errors  in  some  way.  For 
computational  simplicity,  in  SORKA  the  crude  assumption  has  been  made  that  the  errors  are 
random  and  can  be  handled  by  a  proper  ohoioe  of  the  oovarianoe  matrix  Q.  A  suitable 
choice  for  Q  that  prevents  filter  divergence  has  to  come  from  experience  gathered  during 
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tests  on  the  filter*  performance.  In  SORKA,  two  methods  are  used  to  compute  the  state- 
noise  covariance  matrix. 

For  the  short  time  Intervals  between  successive  laser  observations  during  a  pass,  the 
computation  Is  based  on  the  assumption  that  the  unmodeled  forces  acting  on  the  satellite 
yield  accelerations  that  have  the  same  root-mean-square  value  In  all  three  coordinate 
directions.  At  present,  a  value  of  2.5  m/s/day  is  used.  From  these  accelerations,  the 
standard  deviations  of  the  velocity  errors  after  a  time-update  step  are  found  by 
multiplying  the  acceleration  by  the  length  of  the  time  Interval.  These  three  standard 
deviations  of  the  velocity  are  the  only  components  of  Q  that  are  used.  The  standard 
deviations  of  the  position  error  after  the  next  time-update  step  evolve  from  these 
components  through  the  state-transition  matrix.  It  can  be  argued  physically,  and  it  was 
demonstrated  by  numerical  experiments  (Ref.  22),  that  model  errors  are  of  minor 
importance  during  passes.  The  filter  process  was  shown  to  be  rather  insensitive  to 
relatively  large  variations  of  the  values  selected  for  the  unmodeled  accelerations.  This' 
is  due  to  the  fact  that  force  model  errors  cannot  build  up  large  effects  during  a  pass 
and  that  non-linearity  errors  mostly  dominate  the  state  estimation  (Section  6). 

For  the  integration  interval  between  successive  passes,  a  different  approach  for 
computing  Q  has  been  selected.  Since  the  state  errors  are  mainly  due  to  the  truncation  of 
the  gravity  field  model  used  for  the  integration  of  the  state  vector,  an  upper  bound  for 
the  magnitude  of  these  errors  can  be  estimated.  In  the  current  study,  where  observations 
of  only  one  tracking  station  are  processed,  these  errors  will  be  considerably  smaller 
than  indicated  by  that  upper  bound.  This  is  because  in  that  case  the  satellite  traverses 
during  the  passes  always  (nearly)  the  same  spatial  region^  of  the  gravity  field,  and  only 
a  fraction  of  the  total,  mainly  periodic,  variation  of  the  orbital  parameters  due  to 
unmodeled  gravitational  forces  has  to  be  accounted  for  in  Q.  After  a  number  of  tests  for 
satellites  at  altitudes  of  1000  km  to  2000  km,  fixed  values  were  selected  for  the  along- 
track,  cross-track  and  radial  position  and  velocity  errors  equal  to  20  m  and  2  cm/s, 
respectively.  In  addition,  a  few  dominant  correlations  have  been  introduced  in  the  state- 
noise  covariance  matrix  in  cross- track,  radial  and  along-track  components.  Finally,  an 
orthogonal  transformation  is  applied  to  obtain  the  corresponding  matrix  in  X,  Y  and  Z 
components.  The  Q-matrix  Is  then  added  to  the  state  covariance  matrix,  P,  which  has  been 
Integrated  in  one  step  over  the  complete  time  Interval  between  the  passes,  using  the 
analytically  computed  state-transition  matrix. 

6.  EFFECTS  OF  NON-LINEARITY  ERRORS 


The  Kalman  filter  technique  has  been  developed  for  linear  systems.  When  the  filter  is 
applied  to  the  highly  non-linear  equations  encountered  in  orbit  dynamics,  approximations 
linearized  about  a  reference  state  are  used.  Because  in  SORKA  the  full  equations  of 
motion  are  integrated  numerically  and  the  general  non-linear  relations  are  used  for  the 
computation  of  the  measured  quantities  from  the  state  vector,  linearizations  only  occur 
in  the  integration  of  the  state  covariance  matrix,  P,  and  in  the  Kalman  algorithm  for  the 
observation-update  step.  In  Section  4,  it  has  been  pointed  out  that  the  filter  process  is 
not  very  sensitive  to  the  method  used  for  the  integration  of  P.  So,  linearization  errors 
will  mainly  be  introduced  in  the  observation-update  step. 

During  testruns  with  simulated  laser  range  observations  of  QEOS-3  (Section  8),  it  was 
found  (Ref.  24)  that  after  the  first  few  range  observations  had  been  processed, 
excessively  large  state  corrections  occurred,  although  the  initial  residuals  were  found 
to  be  relatively  small,  while  the  data  were  known  to  contain  no  gross  measurement  errors. 
Normally,  one  would  expect  the  state  corrections  to  decrease  gradually  as  more 
observations  are  processed,  because  the  state  estimates  become  more  aoourate.  The 
testruns  were  started  with  an  input  state  vector  corrupted  with  noise  of  500  m  standard 
deviation  for  the  position  components  and  0.5  m/s  for  the  velocity  components. 

To  investigate  why  the  state  correotlons  did  not  decrease  gradually,  the  behavior  of 
the  Kalman  gain  matrix,  K,  was  studied  in  more  detail.  Since  the  Initial  residuals  were 
relatively  small,  only  this  matrix  could  cause  the  large  state  corrections.  As  the 
linearizations  introduced  through  the  observation  matrix,  H,  were  suspected  to  be 
responsible  for  this  filter  behavior,  a  test  was  performed  in  which  the  effeots  of 
changes  in  the  reference  state  on  the  gain  matrix  were  studied.  As  only  range 
measurements  are  processed,  the  observation  matrix  and  the  gain  matrix  reduce  to  a  row 
matrix  and  a  oolumn  matrix,  respectively.  This  simplifies  the  Interpretation  of  the 
results.  The  predicted  state  vector  and  its  oovarianoe  matrix  were  extracted  out  of  a 
SORKA  simulation  run  at  the  time  of  the  fourth  observation,  when  problems  first  ooourred. 
Then,  the  position  elements  of  the  state  veotor  were  varied  systematically  by  applying 
changes  which  were  proportional  to  the  state  correction  veotor  Kz_,  as  computed  by  SORKA 
at  that  observation  time.  This  was  done  to  insure  that  the  applied  changes  of  the 
reference  position  veotor  were  in  the  same  direction  as  the  nominal  position  correction 
oomputed  by  SORKA.  The  velooity  components  were  not  considered  in  the  evaluation  because 
they  do  not  appear  in  the  matrix  H.  Each  new  state  veotor  thus  obtained  was  used  as  an 
alternative  reference  state  for  whioh  a  new  gain  matrix  was  oomputed.  Plotting  the 
individual  elements  of  these  column  matrices  against  the  total  change  of  the  reference 
state,  finally  yielded  an  indication  of  the  dependence  of  the  Kalman  gain  matrix  on  the 
reference  state.  In  Fig.  1  the  results  are  presented  for  the  two  elements  of  the  gain 
matrix  that  affeot  the  X  and  Y  components  of  the  state  veotor.  It  is  clear  that  K  is 
strongly  dependent  on  the  reference  state  and  even  shows  a  near-singular  behavior.  These 
high  gain  values  lead  to  large  state  correotlons,  which  may  result  in  considerable 
linearisation  errors  and  may  oause  divergence.  In  this  test  the  noise  level  of  the 
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Pig.  1:  The  dependence  of  the  gain  matrix  on  the  reference  state. 


simulated  observations  was  1  m.  Additional  tests  proved  that  smaller  measurement  standard 
deviations  further  amplify  the  near-singular  behavior  and  that  a  larger  measurement  noise 
lessens  the  dependence  of  the  gain  on  the  reference  state. 

Physically,  this  peculiar  behavior  of  the  gain  can  be  explained  as  follows.  At  the 
start  of  a  pass,  the  accuracy  of  the  measurement  is  much  higher  than  the  accuracy  of  the 
state  vector.  In  Pig.  2a  this  is  visualized  by  a  position  variance  ellipsoid  and  a  thin 
slice  representing  the  first  range  measurement  and  its  variances.  After  processing  the 
first  few  hlghly-accurate  range  measurements,  which  are  all  taken  in  nearly  the  same 
direction,  the  position  variance  ellipsoid  will  be  flattened  considerably  in  about  the 
direction  of  the  observations  (Fig.  2b).  The  information  content  of  only  a  few 
observations  is  generally  not  sufficient  to  yield  a  good  approximation  of  the  real 
satellite  orbit.  So,  the  next  observation  may  lie  relatively  far  outside  the  error 
ellipsoid.  In  combination  with  the  extreme  flattening  of  the  error  ellipsoid  this  causes 
the  filter  to  try  to  cure  the  situation  by  producing  a  large  position  shift  vector.  This 
vector  points  in  a  direction  that  is  governed  mainly  by  the  orientation  of  the  major  axis 
of  the  ellipsoid;  its  magnitude  can  be  much  larger  than  the  state  standard  deviation  in 
that  direction. 

For  visualizing  the  filter  process  for  the  case  illustrated  In  Pig.  2b,  further 
simplifications  can  be  Introduced.  The  strongly  flattened  position  variance  ellipsoid  is 
approximated  by  a  plane,  which  is  about  perpendicular  to  the  observation  vector  (Pig. 
3a).  The  range  measurement  variances  can  be  approximated  by  a  plane  perpendicular  to  the 
range  vector  as  computed  from  the  reference  state.  Using  this  representation  it  will  be 
clear  that  the  observation-update  step  will  yield  an  optimal  state  which  lies  on  the 
intersection  of  the  two  planes.  The  state  correction  is  dependent  on  the  range  residual. 


(a)  (b) 


Pig.  2s  Sketch  of  the  state  correction  at  the  start  of  a  pass  (a)  or  after  the  firat  few 
hlghly-aoeurate  range  observations  have  been  processed  (b). 
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Fig.  3:  Simplified  diagram  of  the  state  correction  after  the  first  few  highly-accurate 
range  observations  have  been  processed,  as  provided  by  the  pure  Kalman  filter  (a) 
or  by  the  filter  with  the  reference  state  iteration  modification  (b). 


Zj  and  the  angle  a  between  the  two  planes.  Comparison  with  Eq.  (3)  3hows  that  in  this 
simplfied  diagram  the  magnitude  of  the  Kalman  gain  corresponds  in  a  first  approximation 
to  1/sin  a .  This  explains  the  behavior  of  K  as  shown  in  Fig.  1.  When  both  planes  are 
nearly  parallel,  small  changes  in  the  reference  state  along  the  position  variance  plane 
will  cause  large  variations  of  a  and  thus  of  the  gain  matrix.  However,  if  a  becomes  zero, 
the  approximation  of  the  state  and  measurement  variances  by  two  (parallel)  planes  fails 
and  the  gain  matrix  becomes  very  small. 

Large  state  corrections  will  certainly  introduce  appreciable  non-linearity  errors  in 
the  state  estimates.  This  is  illustrated  in  Fig.  3b,  from  which  it  may  be  concluded  that 
the  initial  (linear  optimal)  state  estimate  (3)  does  not  coincide  with  the  observation 
sphere  described  by  the  range  observation.  In  order  to  force  the  state  estimate  towards 
the  intersection  of  the  sphere  with  the  position  variance  plane,  representing  a  more- 
realistic  approximation  of  the  true  optimal  estimate,  an  iteration  scheme  has  been 
derived  (Ref.  24)  that  also  reduces  the  magnitude  of  the  state  corrections,  and  that 
yields  improved  filter  performance.  During  the  iterations  the  values  of  R  and  are 
held  fixed.  In  the  first  iteration  step  the  initial  state  estimate,  X^,  is  used  as  the 
reference  state.  In  the  next  iteration  step,  the  mean  of  the  initial  reference  state  and 
the  first  updated  state  estimate,  X,,  is  used  as  the  reference  state.  For  this  new 
reference  state,  new  values  of  H  ana  K  are  computed.  Using  this  new  K  value.  Pi  is 
recomputed  and  a  new  state  correction  Kzj.  is  computed,  which  is  added  to  the  initial 
reference  state,  _X^,  The  iterative  process  stops  if  the  computed  values  for  the  reference 
state  have  converged  (Fig.  3b).  Sometimes,  however,  the  process  does  not  converge  at  all. 
This  may  happen,  for  instance,  if  the  state  varla'noe  plane  lies  outside  the  observation 
sphere.  To  oope  with  this  problem  the  measurement  standard  deviation  may  be  increased, 
but  from  a  theoretical  point  of  view  this  is  a  precarious  measure,  sinoe  it  will  also 
allow  erroneous  observations  to  enter  thn  solution. 

Although  the  iteration  soheme  improves  the  stability  of  the  filter,  errors  will 
inevitably  build  up  during  the  processing  of  the  first  few  observations  when  the 
iterative  process  is  not  yet  effective  and  may  still  cause  filter  divergence.  Eventually, 
a  simple  engineering  solution  hes  been  adopted  (Ref.  24)  that  proved  to  be  highly 

effective.  Before  each  observation-update  step  the  diagonal  elements  of  the  state 

oovarianoe  matrix,  P,  are  multiplied  by  a  number  slightly  greater  than  one.  By  this,  the 
relative  magnitude  of  the  correlation  coefficients,  which,  in  general,  describe  the 
flattening  of  the  error  ellipsoid,  is  implioitly  reduced  and  a  less  flattened  ellipsoid 
will  result.  Therefore,  the  multiplicative  factor  has  been  oalled  the  correlation 
correction  faotor,  n  .  Only  in  the  rare  cases  that  the  minor  axis  of  the  error 

ellipsoid  happens  to°l£e  along  one  of  the  coordinate  axes,  no  appreciable  reduction  of 
the  flattening  results  and  the  method  breaks  down.  Figure  4  dearly  shows  the  reduction 
of  the  sensitivity  of  the  Kalman  gain  matrix  to  the  reference  state  for  increasing  values 
of  the  correlation  correction  faotor.  In  the  near-singularity  region,  a  0.01  peroent 
increase  of  the  variances  has  a  tremendous  effect  on  the  gain  matrix;  outside  this 

region,  the  gain  remains  almost  unoh&nged.  It  proved  that  the  application  of  a 
correlation  correction  faotor  is  a  most  efficient  way  to  stabilise  the  filter,  and  is 
more  effective  than  the  Iteration  prooess. 
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4:  The  dependence  of  the  gain  matrix  on  the  reference  qtate  for 
different  values  of  the  correlation  correction  factor. 


In  SORKA,  both  the  reference  state  Iteration  scheme  and  the  correlation  correction 
method  are  applied,  while  the  possibility  to  increase  the  measurement  standard  deviation 
has  also  been  implemented.  Before  each  observation-update  step,  the  diagonal  elements  of 
the  state  covariance  matrix  are  multiplied  by  the  correlation  correction  factor  (default 
1.00001).  The  numerical  tests  have  shown  that  the  iterative  process  then  usually 
converges  immediately.  As  a  matter  of  fact  one  should  be  suspicious  if  this  does  not 
occur,  because  it  may  indicate  that  an  erroneous  observation  is  being  processed,  which 
can  upset  the  filtering  process. 

The  presence  of  these  bad  observations  is  an  additional  problem,  which  may  complicate 
the  above  mentioned  divergence  suppression  methods.  Oross  measurement  errors  will  corrupt 
the  state  estimates,  especially  when  they  occur  at  the  beginning  of  a  pass  where  the 
state  variances  are  large.  To  reduce  the  effects  of  bad  data,  a  statistical  technique  has 
beeil  developed  in  Ref.  25*  It  is  based  on  the  introduction  of  a  reliability  criterion  'for 
bounding  the  propagation  of  non-detected  measurement  gross  errors  into  the  state  vector. 
The  state  covariance  matrix  after  the  observation-update  step  is  computed  such  that  it  is 
not  only  a  measure  for  the  effects  of  random  errors  but  also  for  the  effects  of  a  non- 
detected  gross  measurement  error.  So,  the  filter  is  forced  to  make  a  balance  between  the 
precision  and  the  reliability  of  the  updated  state  vector.  This  goal  is  reached  by 
computing  the  updated  state  and  state  covariance  matrix  according  to 
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The  factor  a  is  a  function  of  some  statistical  parameters  and  the  ratio  R/HPHT.  At  the 
start  of  a  pass,  the  state  variances  are  large  and  a  takes  a  relatively  large  value. 
Consequently,  the  addition  of  the  a-terms  on  the  right-hand  sides  of  Eqs.  (8)  will  yield 
a  smaller  state  correction  and  will  blow  up  the  position  error  ellipsoid  in  the  direction 
of  Its  minor  axis.  This  is  also  favorable  from  the  viewpoint  of  limiting  the 
linearization  errors  and  preventing  divergence.  As  more  observations  are  processed  and  P 
becomes  smaller,  a  decreases  and  the  prooess  more  closely  approximates  the  optimal  Kalman 
filter  ooheme.  This  alternative  technique  has  also  been  included  in  SORKA  as  an  option. 
Tests  where  only  soreened  data  were  processed  yielded  similar  results  as  obtained  with 
the  application  of  the  correlation  correction  faotor. 

7.  DIVERGENCE  MONITORING 

A  proper  use  of  the  techniques  described  in  the  previous  Sections  will,  in  general, 
lead  to  a  stable  Kalman  filter  behavior  when  processing  laser  range  observations. 
Nevertheless,  sometimes  divergence  of  the  filter  may  still  ooour,  whioh  results  in 
lnoorreot  and  useless  state  vector  estimates.  In  simulations,  divergence  can  be  detected 
very  easily.  In  those  oases,  divergence  is  reoognized  if  the  estimated  state  vectors 
deviate  considerably  more  from  the  simulated  state  vectors,  from  which  the  simulated 
observations  are  computed,  than  the  estimated  state  vector  standard  deviations  given  by 
the  state  oovarianoe  matrix.  If  real  observations  are  processed,  such  a  comparison,  of 
course,  is  not  possible.  The  only  way  to  deteot  divergence  in  real  observations 
processing  is  to  study  the  history  of  the  observation  residuals,  z_.  The  observation 
residuals  relative  to  the  predicted  state  vector  at  the  time  of  an  observation  are  used 
in  the  observation-update  step.  The  oovarianoe  matrix  of  the  residual  vector  is  used  in 
the  computation  of  the  Kalman  gain  matrix.  Thus,  for  divergence  detection  there  is 
available  a  sample  of  a  stochastic  variable  (the  residual  vector),  whioh,  under  the 
assumptions  made,  has  a  Gaussian  distribution  with  zero  mean  and  known  oovarianoe  matrix, 
and  whioh  is  not  correlated  in  time. 

The  two  divergence  monitoring  methods  implemented  in  SORKA  are  based  on  testing  the 
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validity  of  the  residual  covariance  matrix.  Therefore,  the  squared  residual  is  weighted 
with  the  covariance  matrix.  This  squared  weighted  residual  is  a  stochastic  variable  which 
should  have  a  chi-squared  distribution,  with,  in  case  of  range-only  measurements,  one 
degree  of  freedom  per  observation.  A  method  can  be  developed  to  test,  with  a  given  degree 
of  confidence,  if  the  sum  of  n  squared  weighted  residuals  corresponds  indeed  to  a  chi- 
squared  distribution  with  n  degrees  of  freedom.  However,  as  after  a  number  of  processed 
observations  this  method  will  become  very  slow,  in  SORKA  two  faster  divergence  detection 
techniques  have  been  included.  One  method  is  based  on  a  fading-memory  filter,  in  which 
the  most  recent  residuals  have  a  greater  weight  in  the  sum.  The  other  is  based  on  the 
digital  low-pass  filter  technique  described  in  Ref.  26,  which  also  results  in  a  test 
which  is  more  sensitive  to  the  last  measurements.  Both  methods  are  handicapped  if  only  a 
few  observations  during  a  pass  are  available,  but  were  found  to  work  satisfactory  if  many 
observations  are  processed.  Presently,  the  methods  are  only  used  to  monitor  if  divergence 
occurs.  In  the  final  version  of  SORKA  one  of  the  methods  will  be  linked  in  a  closed-loop 
mode  to  correct  for  divergence  as  soon  as  such  a  tendency  is  detected. 

8.  RESULTS 

To  investigate  the  capabilities  of  SORKA  in  processing  range  observations  from  only 
one  tracking  station,  many  simulations  have  been  performed.  In  these  tests  simulated 
observations  of  GEOS-3  (Table  1)  were  used.  The  results  of  the  simulations  are  treated  in 
detail  in  Refs.  24,  27»  28.  After  it  had  been  demonstrated  that  SORKA  is  capable  of 
yielding  accurate  state  estimates,  numerical  experiments  were  done  (Refs.  22,  24,  25,  29) 
with  actual  observations  of  GEOS-1  (Table  1).  Theae  laser  range  observations  had  been 

acquired  at  the  Kootwijk  tracking  station.  The  data  arc,  with  a  length  of  54  hour, 

comprises  a  total  of  611  measurements,  distributed  over  8  passes  during  the  period  July 
11  to  July  13,  1978,  in  which  GEOS-1  completed  27  revolutions  about  the  earth.  In  Pig.  5 
the  sub-satellite  points  at  the  times  of  the  observations  are  plotted.  The  general 
direction  of  the  satellite's  groundtracks  is  from  west  to  east.  The  same  observations 
have  also  been  used  in  studies  described  in  Refs.  30,  31  to  estimate  from  laser  range 
data  acquired  at  Kootwijk  and  Wettzell  (Fed.  Rep.  Germany)  the  orbit  of  the  satellite  and 
the  coordinates  of  the  Wettzell  tracking  station.  In  those  studies  the  least-squares 
batch-processing  orbit  determination  and  parameter  estimation  GEODYN  computer  program 
(Ref.  32)  was  used.  To  be  able  to  judge  the  quality  of  the  Kalman  filter  estimates,  an 
orbit  solution  was  generated  with  GEODYN  from  the  611  Kootwijk  observations.  The  computed 
orbital  ephemeris,  containing  state  vectors  at  the  observation  times  and  at  regular  300  s 
intervals  throughout  the  complete  data  span,  was  stored.  Because  of  the  very  high 

accuracies  obtainable  with  GEODYN,  that  orbit  solution  could  be  considered  the  real-world 
orbit  of  the  satellite  to  which  the  SORKA  results  are  compared.  The  state  vector 
differences,  which  are  interpreted  as  the  SORKA  estimate  errors,  were  subsequently 
transformed  into  errors  in  the  classical  orbital  elements  as  well  as  into  errors  in 

radial,  cross-track  and  along-track  direction.  Orbital  differences  expressed  in  these 
parameters  suit  better  the  physical  interpretation  of  the  results  rather  than  the  fast 
varying  rectangular  components  of  the  state  vector  differences. 


Table  1:  General  satellite  data. 


GEOS-1 

GE0S-3 

Satellite  number 

6508901 

7502701 

Launch  date 

November  6 

April  9 

Shape 

octagonal  print 
with  heaiapharical 
cep  on  down  end 
and  octagonal 
pyramid  on  top 

octagonal  prism 
with  radar  altimater 
dith  on  down  and  and 
octagonal  pyramid  on 
top 

Diaanaiona  (cm) 

132  wide 

81  high 

122  wide 

131  high 

Mat a  (kg) 

172.5 

345.9 

Stabillaation 

gravity-gradient 

gravi ty-gradient 

Transmitters 

TRANSIT  end  MINI- 
TRACK  beecon, 

8EC0R  end  0RARR 
treneponders. 

dopplar  baacon. 
C-band  and  S-band 
transponder* . 
radar  altimatar. 

Later  reflectors 

322  reflector!  on 

0,18  m*  bottom- 
mounted  flet 
array 

284  raflactora  in 
conical  ring  around 
tha  ptriphary  of 
bottom  side 

Orbit  (mid  1978) 

semi-major  axis  (km) 

8073 

7221 

eccentricity 

0.0717 

0.0014 

inclination  (deg) 

59.4 

115.0 

period  (min) 

120 

102 

Pig.  5:  The  0E03-1  sub-satellite  points  at  the  observation 
times.  The  general  direction  of  the  satellite's 
groundtracks  is  from  west  to  east. 
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Pig.  6:  The  semi-major  axis  errors  and  standard  deviations  during  the  first  pass.  The 
state  and  its  oovarlanoe  matrix  are  computed  by  the  pure  Kalman  filter  (top 
left),  with  the  reference  state  iteration  soheme  (top  right)  or  with  both  the 
iteration  soheme  and  the  correlation  oorreotion  factor  (bottom). 
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Pig.  7:  The  radial  and  along-track  3tate  errors  and  stan¬ 
dard  deviations  during  the  first  pass. 


The  results  presented  in  this  paper  refer  only  to  the  first  three  passes,  which  are 
separated  by  two  and  one  revolutions,  respectively.  In  this  initial  phase  of  the  data  arc 
all  problems  are  encountered  that  are  characteristic  for  processing  range  observations 
from  a  single  station,  in  particular  if  the  initial  3tate  is  not  known  accurately.  To 
simulate  this  last  condition,  the  initial  state  as  oomputed  by  OEODYN  in  reotangular 
coordinates  was  corrupted  with  noise  of  1  km  standard  deviation  in  the  position 
components  and  1  m/s  in  the  veloaity  components.  For  the  filtering  prooess  this 
corresponds  more  or  less  to  a  worst  case  analysis  since,  in  general,  some,  state 
components  will  be  known  to  a  higher  accuracy,  in  particular  those  which  are  related  to 

the  along-track  velocity.  The  formal  standard  deviation  of  the  observations,  whioh  were 

known  to  oontaln  no  gross  errors,  was  taken  to  be  25  cm,  corresponding  to  tho  accuracy 
level  of  the  Kootwijk  laser  system  in  1978.  The  results  oan  bo  divided  into  two 
categories,  whioh  are  governed  by  different  parameters.  The  first  category  pertains  only 
to  the  first  pass.  These  results  are  primarily  affected  by  the  accuracy  of  the  initial 
state  estimate  and  by  errors  due  to  non-linsarltles.  The  two-revolutions  data  gap  between 
the  first  and  the  second  pass  is  a  transient  region.  The  second  category  starts  at  the 
beginning  of  the  second  pass}  these  results  are  primarily  influenced  by  errors  in  the 
orbit  perturbations  modeling. 

Figure  6  shows  the  effects  of  the  reference  state  iterations  and  the  application  of 
the  correlation  correction  faotor,  n  ,  on  the  semi-major  axis  errors  during  the  first 

pass.  The  solid  lines  lndioate  the  standard  deviations  of  the  semi-major  axis  estimates, 

as  provided  by  the  Kalman  filter.  The  value  of  Tq,  indicated  in  the  plots,  refers  to  the 
origin  of  the  time  scale}  i.e.  the  time  expressed  in  Modified  Julian  Days  (MJD)  of  the 
first  measurement  of  that  pass.  The  plots  show  that  the  basic  filter  without 
modifications  leads  to  divergence  where  semi-major  axis  errors  of  9  km  occur,  which  are 
muoh  larger  than  the  filter  estimates  for  the  standard  deviation.  The  introduction  of  the 
reference  state  iteration  soherae  yields  a  muoh  better  filter  behavior.  After  reaching  a 
peak  of  about  1.5  km,  the  errors  decrease  rapidly  to  a  level  of  about  100  m.  Applying 
also  a  correction  faotor  n  ■  1.Q0001  leads  to  a  more  stable  process  where  the  errors 
decrease  smoother  and  no  sign  of  divergence  is  present.  Of  course,  this  is  achieved  at 
the  expenae  of  a  slower  converging  process  and  larger  estimated  standard  deviations.  The 
evolution  of  the  radial  and  along-track  position  and  velocity  component  ertora  la  shown 
in  Pig.  7.  These  reaulta  were  obtained  from  a  computation  where  both  the  reference  state 
iterations  and  the  correction  faotor  were  applied.  The  radial  position  error  is  always 
less  than  1H0  m}  the  along-traok  velocity  error  decreases  to  an  end-of-pass  value  of  15 
om/a.  The  radial  velooity  error  increases  to  an  end-of-paas  value  of  about  65  cm/s,  whioh 
can  be  understood  when  it  is  realised  that  observations  from  a  single  pass  do  not  yield 
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Pig.  8:  The  radial  and  along-track  position  errors  and  standard  deviations  during  the 
two-revolutions  data  gap  between  the  first  and  the  second  pass. 


Pig.  9:  The  effeots  of  the  assumed  state  noise  at  the  end  of  the  two-revolutions  data  gap 
on  the  semi-major  axis  errors  and  standard  deviations  during  the  aeoond  pass.  The 
plot  on  the  left  refers  to  assumed  radial,  cross- track  and  along-traok  position 
and  veloolty  errors  of  20  m  and  2  cm/a,  respectively;  the  plot  on  the  right  to 
errors  twenty  times  as  large. 


sufficient  information  on  the  preoise  ohape  of  the  orbit.  A  pooltive  radial  veloolty 
error  leads  to  Increasing  radial  position  errors  and,  according  to  the  equations  for  the 
satellite  motion,  to  increasing  negative  along-track  position  errors. 

The  radial  and  along-track  position  errors  during  the  two  revolutions  between  the 
first  and  the  second  paes  are  plotted  In  Pig.  8.  The  oscillating  along-track  position 
error  remains  leoe  than  3.5  km  during  this  period  and  is  about  0.9  km  at  the  start  of  the 
second  pass.  The  radial  position  error  la  about  0.1  km  at  the  start  of  the  second  pass 
and  naver  exoeeda  0.8  km  during  the  two  revolutions.  To  compute  the  standard  deviations, 
the  state  covariance  matrix  has  been  propagated  by  applying  the  aeries  expansion 
technique  for  a  and  the  constant  value  or  2.5  m/e/day  for  the  computation  of  Q.  Prom  Fig. 
8  the  along-track  position  accuracy  estimate  shows  up  to  be  rather  conservative.  This  : 
partly  due  to  a  too  pessimistic  modeling  of  the  effect#  of  unmodeled  perturbations,  bi.r 
for  e  larger  pert  due  to  the  rether  large  state  variances  at  the  end  of  the  flret  psiaa, 
which  in  turn  are  a  result  of  the  use  of  the  correlation  correction  factor  during  that 
pass. 

Figure  9  deplots  for  the  aeoond  pass  the  effect!  of  the  assumptions  made  for  the 
computation  of  the  state-noise  covariance  matrix  aftor  the  two-revolutlona  data  gap. 
While  for  ell  results  presented  In  this  paper  the  atete  noise  after  an  interval  between 
two  pastes  is  computed  from  the  assumption  of  radial,  cross-track  and  along-track 
poaltlon  and  velocity  errors  of  20  m  and  2  cm/e,  respectively,  in  Pig.  9  also  the  results 
ere  plotted  for  state  noie#  values  which  are  twenty  time#  larger.  It  le  clear  that  the 
•emi-aajor  axle  errors  during  the  second  pees  do  not  differ  considerably  for  the  two 
assumptions;  m  both  cases  the  errors  are  lees  than  30  a.  The  estimated  state  variances, 
however,  are  mora  realistic  for  the  smaller  state  noise  values.  That  the  state  component 
estimates  do  not  differ  much  in  the  two  caeea  end  that  the  accuracy  estimates  hardly  vary 
during  the  pass  la  ohsraotariatlo  for  the  semi-major  axis.  Per  the  other  orbital  element* 
It  was  found  that,  generally,  the  smaller  state  noiee  values  yield  more-rncour&ta 
estimates  and  decreasing  eatlaated  standard  deviation*. 
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The  inclination  errors  during  the  first  three  passes  are  shown  in  Pig.  10.  During  the 
first  pass  the  Inclination  errors  take  a  nearly  constant  value  of  about  0.007°  and  show 
no  decrease  with  time.  This  could  be  expected  because  range  measurements  from  a  single 
pass  contain  almost  no  information  on  the  orientation  of  the  orbital  plane.  Consequently, 
inclination  errors  are  not  filtered  out  easily.  At  the  beginning  of  the  second  pass  the 
inclination  error  still  has  the  same  value  but  after  the  first  observations  of  the  second 
pass  have  been  processed  the  error  decreases  and  converges  •‘o  an  end-of-pass  value  of 
about  0.0007°.  At  the  end  of  the  third  pass  the  inclination  error  is  even  less  than 
0.00006°. 

The  total  position  errors  Just  before  the  processing  of  the  first  observation  of  the 
second  and  third  pass,  which  are  primarily  errors  in  cross-track  and  along-track 
direction,  are  1.4  km  and  0.4  km,  respectively.  Assuming  that  the  distance  of  QE0S-1  at 
the  start  of  a  pass  is  2500  km,  a  position  error  of  1.5  km  corresponds,  depending  on  the 
pass  geometry,  to  a  topocentric  angular  error  of  1  to  2  arcminufces.  For  ranging  to 
satellites  at  altitudes  below  2500  tan  the  Kootwijk  laser  station  usually  applies  a  beam 
divergence  of  3  to  10  arcrainutes.  So,  in  this  example,  the  Kalman  filter  position 
prediction  is  so  accurate  that  at  the  first  laser  firings  during  the  second  and  third 
pass  the  satellite  is  actually  within  the  laser  beam  and  no  satellite  search  process  is 
required. 

The  radial,  cross-track  and  along-track  position  errors  during  the  third  pass  are 
shown  in  Pig.  11.  The  errors  in  radial  direction  are  generally  less  than  15  m,  in  cross- 
track  direction  generally  less  than  60  ra  and  in  along-track  direction  about  60  m.  These 
accuracies  fully  satisfy  the  Kootwijk  laser  system  requirements  mentioned  in  Section  2 
for  the  application  of  a  small  beam  divergence  and  a  short  time  window.  Interesting  is 
the  behavior  of  the  along-track  position  error.  The  filter  computes  standard  deviations 
of  about  17  a,  while  the  estimated  along-track  position  contains  in  reality  a  nearly 
constant  error  of  about  -60  m.  This  Indicates  that.  Just  as  shown  in  Pig.  7  for  the  first 
pass,  only  the  first  few  observations  contribute  tc  the  improvement  of  the  along-track 
position  component.  The  computation  of  the  along-track  state  variances  is  obviously  too 
optimistic,  indicating  that  a  further  tuning  of  filter  parameters  still  has  to  be 
performed. 

9.  CONCLUSIONS 


For  investigating  the  possibilities  to  use  laser  range  observations  acquired  at  the 
Kootwijk  satellite  observatory  for  (semi-)  real-time  improvement  of  the  predicted 
satellite  positions,  a  computer  program  called  SORKA  has  been  developed.  The  main 
requirements  were  that  SORKA  could  satisfy  both  the  sccur«oy  level  needed  for  operational 
laser  ranging  and  the  capabilities  of  a  small  local  computer.  Until  now,  SORKA  runs  on  an 
I8H  370/158  computer  and  the  implementation  on  a  local  computer  still  has  to  be  studied. 
In  the  design  of  SORKA  precautions  have  been  taken  to  make  such  an  implementation 
possible. 

All  teats  In  applying  SORKA  to  simulated  and  real  observations  of  0K0S-1  and  QKO.S-3 
look  promising  and  Justify  the  continuation  of  the  efforts  to  improve  the  computations 
scheme  such  that  it  beat  suits  the  laser  ranging  system  characteristics.  It  has  been 
demonstrated  that  a  stable  Kalman  filter  process  can  be  obtained  when  using  laser  range 
measurements  from  one  tracking  station.  It  was  round  that  divergence,  which  very  easily 
may  occur  when  proeesalng  only  laser  range  measurements,  can  effectively  be  suppressed  by 
applying  a  correlation  correction  factor  and  an  iteration  acheme  in  the  observation- 
update  atep. 
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SUMMARY 

This  report  presents  two  methods  for  determining  the  optimum  state  estimate  for  the  angle 
only  ballistic  trajectory  problem. 

The  first  method  utilizes  the  techniques  of  Marquardt  matrix  conditioning  and  explicit 
Jacobian  in  determining  the  weighted  least  squares  state  estimate  of  a  ballistic  trajectory. 
This  Marquardt  least  squares  (MLS)  algorithm  is  a  batch  or  nonrecursive  process.  A  descrip¬ 
tion  of  a  similar  least  squares  batch  technique  is  included  in  this  report  for  a  better 
understanding  and  a  means  of  evaluation  by  comparison. 

The  second  method  applies  the  explicit  Jacobian  technique  to  the  recursive  Kalman  filter 
equations.  This  improved  Jacobian-Kalraan  filter  formulation  when  combined  with  the  MLS 
batch  process  for  initialization  forms  the  desired  total  angle  oniy  tracking  algorithm. 
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NOTATION 

Measurement  matrix 
Time  between  measurements 
Deviations  of  state  from  nominal  set 
Estimate  of  Ax 

Deviations  of  measurements  from  nominal  ox  calculated  set  (Y) 
Total  energy 

Earth  centered  inertial  coordinate  system 

Measurement  errors 

The  f  and  g  series  values 

State  vector  velocity  components 

Time  derivative  of  state  vector  velocity  components 

Gravitational  constant 

Azimuth,  elevation  measurements 

Kalman  gain  matrix 

Identity  matrix 

Jacobian  matrix 

Kinetic  energy 

Marquardt  matrix  conditioning  factor 
Maas  of  earth 
G  *  M 

Potent  in  1  enemy 

State  transition  matrix 

Range 

Range  rate 
Range  vector 
Radius  of  earth 
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Center  of  earth  to  sensor  radius 
Center  of  earth  to  target  radius 
The  x,y,z  components  of  R 
Sensor  position  vector 
State  covariance  matrix 
The  x,y,z  components  of  S 

Sensor  centered  topographic  coordinate  system 

One  sigma  uncertainty  in  E 

One  sigma  measurement  uncertainty  in  6,$ 

Target  position  vector 

Initial  target  position  vector 

The  x,y,z  components  of  T 

Time  for  each  measurement 

Time  step  for  f  and  g  series 

Azimuth,  elevation  measurements 

First  time  derivative  of 

Second  time  derivative  of  6,$ 

The  f  and  g  series  terms 

Initial  target  velocity  vector 

Relative  velocity  vector  (target-sensor) 

Sensor  velocity  magnitude 

Sensor  velocity  vector 

Target  velocity  magnitude 

Target  velocity  vector 

Measurement  covariance  matrix 

Time  derivative  of  X 

Estimate  of  X 

Sensor  state  vector 


X, X  State  vector 

x,y,i  State  vector  position  components 

i,y,i  State  vector  velocity  components 

x»,x»,Xi  State  vector  position  Components 

Xt,Xf,x«  Statu  veotor  velocity  components 

#*»#*»  Time  derivative  of  state  vector  position  components 

Time  derivative  of  state  veotor  velocity  components 
*R'^R'*R  The  x,y,s  components  of  Hr 

Xq, ys.«s  The  X,y,i  components  of  Vg 

X*  Chi-squared 

Y  Estimate  of  Y 

Y. Y  Measurement  veotor 

•  normalised 


1 .  INTRODUCTION 


This  report  deals  with  the  problem  of  state  estimation  of  ballistic  trajectories  with 
angle  only  measurements.  This  type  of  problem  becomes  difficult  when  the  observer  is  free- 
falling  and  more  difficult  if  the  observer  is  then  located  in  the  plane  of  the  observed 
trajectory.  The  methods  described  in  this  report  are  very  effective  against  this  most  dif¬ 
ficult  case,  and  superior  to  existing  angle  only  tracking  filters  in  terms  of  stability, 
computational  requirements,  and  tracking  performance. 

The  first  method  described  herein  utilizes  the  methods  of  Marquardt  matrix  conditioning1 
and  explicit  Jacobian  in  determining  the  weighted  least  squares  state  estimate  for  the  non¬ 
linear,  time  varying,  dynamic  system  of  a  ballistic  trajectory  with  nonlinear  noisy  measure¬ 
ments.  In  this  case  both  the  equations  of  motion  and  the  angle  observations  are  nonlinear 
functions  of  the  state.  This  Marquardt  least  squares  (MLS)  technique  is  a  nonrecursive  or 
batch  process  in  that  all  the  observations  must  be  processed  each  time  a  state  estimate  is 
made.  The  method  of  Incorporating  a  priori  knowledge  of  the  total  energy  into  the  MLS  algo¬ 
rithm  to  assist  in  poor  observability  problems  is  also  discussed. 

The  second  method  takes  the  explicit  Jacobian  technique  developed  for  the  MLS  algorithm 
and  applies  it  to  the  recursive  Kalman  filter  equations.  This  improved  Jacobian-Kalman 
filter  formulation  together  with  the  MLS  for  initialization  form  the  complete  angle  only 
tracking  algorithm. 

This  report  is  organized  as  follows.  Section  2  provides  the  background  material  and  the 
step  by  step  development  of  the  equations  for  the  standard  weighted  least  squares  batch 
filter  and  MLS  algorithms.  A  discussion  of  the  energy  constraint  concept  and  the  applica¬ 
tion  of  the  explicit  Jacobian  method  to  the  Kalman  filter  is  included  in  this  section.  Sub¬ 
section  2.1  presents  the  weighted  least  squares  solution  for  the  general  nonlinear  system 
problem.  This  serves  as  a  common  basis  for  both  the  standard  weighted  least  squares  batch 
filter  and  the  MLS  algorithms  which  are  developed  in  subsections  2.2  and  2.3,  respectively. 
No  attempt  is  made  to  make  these  derivations  mathematically  rigorous.  The  energy  constraint 
concept  is  discussed  in  subsection  2.4.  The  application  of  the  explicit  Jacobian  technique 
to  the  Kalman  filter  equations  is  presented  in  subsection  2.5.  Section  3  gives  the  perform¬ 
ance  results  for  several  ballistic  trajectory  problem  test  cases.  Section  4  contains  the 
conclusions. 


2 .  ALGORITHM  DESCRIPTION 


2.1  Weighted  Least  Squares  Concept 

In  general,  the  fundamental  problem  of  concern  can  be  stated  as  follows.  A  nonlinear 
system  can  be  represented  by  the  linearized  model  matrix  equations 

AY  =  AAX  +  e  (2-1) 

where  AY  is  an  n„x  1  matrix  of  the  deviations  of  the  observations  (Y)  from  the  nominal  or 
calculated  set  (Y) ,  AX  is  a  k  x  1  matrix  of  deviations  of  the  unknown  parameters  from  a 
nominal  (known)  set,  A  is  an  n  x  k  (n  >  k)  known  matrix,  and  e  is  an  n  x  1  matrix  of  obser¬ 
vation  errors  (unknown).  The  problem  iss  gi^n  AY  and  A  and  the  linearize^ model  of  Eq. 
(2-1) ,  find  the  "best"  estimate  of  AX  called  AX.  Once  theA"best"  estimate  AX  has  been 
determined,  the  "best"  estimate  of  the  unknown  parameters  X  may  be  determined  by 

X  =  X  +  (2-2) 

where  X  is  the  known  or  nominal  set  of  parameters.  In  this  case  the  "best"  estimate  is 
acheived  when  the  weighted  least  squares  criterion  is  satisfied;  that  is,  when  the  sum  of 
the  squares  of  the  components  of  the  weighted  residual  vector  is  minimized.  This  quantity 
can  be  represented  by: 

AYTW-1AY  (2-3) 

where  W  is  the  known  n  x  n  measurement  covariance  matrix.  This  weighting  matrix  accounts 
for  the  difference  in  confidence  between  various  observations  and  the  possible  correlation 
between  them. 


The  wall  known  weighted  least  squares  solution  (formulated  by  Gauss  in  1794*)  to  this 
problem  is 

-  (ATW"lA)"lATW“lAY  (2-4) 

which  combined  with  Eq,  (2-2)  yields  the  total  expression 

X  -  X  +  (Arw“lA)"lATW-1AY  .  (2-5) 

Given  some  initial  guess  of  X,  this  expression  is  normally  iterated  as  follows  until  the 
current  parameter  estimates  do  not  vary  appreciably  from  the  previous  iteration. 


*Formulation  included  only  the  diagonal  terms  of  W. 


— «-  Evaluate  A^with  X 
Calculate,/?  for  X 

X  =  X  +  (AaW  xA)  xa  w  xay 
- X  =  X 


Eq.  (2-5)  establishes  the  basis  for  both  the  standard  weighted  least  squares  batch  filter 
and  the  Marquardt  least  squares  algorithms.  The  following  sections  will  show  how  these  two 
different  formulations  are  derived  from  this  common  expression  thus  demonstrating  both  their 
similarities  and  differences. 


2.2  Standard  Weighted  Least  Squares  Batch  Filter 


For  a  given  set  of  observation  times  t  =  t  ,t  , ...t  T  the  linearized  model  represented 
by  Eq.  (2-1)  can  be  expanded  to  give 
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For  a  time-varying  "linear"  differential  equation  model,  that  is. 


^  AX (t)  =  F(x(t))  AX (t) 

the  "approximate"  solution  is  given  by 

(2-7) 

AX (t  +  At)  =  AX(t)  +  At  f(x (t)) 

Factoring  out  AX(t) 

AX  (t) 

(2-8) 

AX (t  +  At)  =*  jl  +  At  f(x (t))j 
which  yields  the  following  transition  relation 

AX(t) 

(2-9) 

AX  4  *  )  .  AX  . 

n-l  n-i,n  n 

(2-10) 

Note  that  two  approximations  were  required  in  arriving  at  this  last  expressions  (a)  the 
linear  differential  equation  model  and  (b)  its  approximate  solution.  The  significance  of 
this  will  become  apparent  in  the  development  of  the  Marquardt  weighted  least  squares  algo¬ 
rithm  in  subsection  2.3.  Using  Eq.  (2-10),  Eq.  (2-6)  can  be  written  as 


Now  factor  out  AXfi  to  yield 
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From  Eq.  (2-12)  the  following  matrix  is  defined 


/An 


VlVl.n 


Eq.  (2-11)  can  now  be  written  in  the  compact  form 


Ax(n)  *  Jn  **n  +  e(n) 


(2-11) 


(2-12) 


(2-13) 


(2-14) 
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The  weighted  least  squares  solution  for  this  linearized  model  can  be  written  as  an  extension 
of  Eq,  (2—4)  to  give 


(2-15) 


(2-16) 


^n  “  (Jn  Wln)  Jn)-1  W(n)  A*(n)  * 

Similarly,  the  final  expression  becomes 

K  ■  Xl>  *  (Jn  "(i)  Jn)”  “(i|  “(»| 

which  can  be  iterated  by  the  procedure  described  in  subsection  2.1.  While  the  iterative 
solution  for  this  expression  essentially  represents  the  standard  weighted  least  squares 
batch  filter,  a  few  more  definitions  are  required  before  the  final  algorithm  can  be  pre¬ 
sented. 

For  the  ballistic  trajectory  problem,  the  state  (parameter)  vector  may  be  expressed  in 
terms  of  an  earth-centered  inertial  Cartesian  coordinate  system 
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For  a  spherical  earth  the  exoatmospheric  trajectory  equations  of  motion  are 
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(2-18) 


where  G  is  the  gravitational  constant,  N  is  the  earth's  mass,  and  R  is  the  magnitude  of  the 
position  vector. 

The  first  order  Taylor  series  approximation  for  the  transition  matrix  $  defined  in  Eqs. 
(2-8),  (2-9),  and  (2-10)  is 
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whore  the  p(x(tR))  matrix  is  defined  by 
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(2-20) 


where  ere  the  derivative  functions  of  the  state  vector  (£  «  f(X)  defined  in  Eq.  (2-18). 
The  partial  derivatives  for  Eq.  (2-20)  ere  given  in  Appendix  A.“  A  more  accurate  method 
for  determining  the  transition  matrix  is  to  use  the  transition  matrix  determined  for  the 
previous  observation  time  in  determining  the  transition  matrix  for  the  current  observation 
time.  This  reduces  ths  time  interval  over  which  the  transition  matrix  must  b#  valid  and 
allows  for  evaluation  of  the  f  matrix  with  ths  updated  nominal  stats.  This  msthod  produces 
ths  following  set 
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(2-21) 


n+l-L,n+2-L 

By  this  method  the  magnitude  of  the  time  interval  is  restricted  to  the  time  between  observa¬ 
tions.  If  this  time  interval  is  still  too  large,  this  propagation  technique  can  be  further 
applied  by  subdividing  the  time  step  between  observations.  For  example,  if  there  are  m 
subdivided  time  steps  of  h  then  the  first  expression  in  Eq.  (2-21)  would  be 

*n_i,n  “  {*  +  hF(X(tn)j|  |l  +  hF(x(tn  -  h))|..^I  +  hF (x ( tn  -  (m-l)h)  }  .  (2-22) 

The  angle-only  observation  set  for  the  ballistic  trajectory  problem  can  be  defined  as 
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where  Rx,  Ry,  Rz  is  the  relative  position  vector  (target-sensor)  in  a  topographic  coordinate 
system  which  is  defined  in  Appendix  B.  The  first  order  Taylor  series  approximation  for  the 
measurement  matrix  An  defined  in  Eq.  (2-6)  is 
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X=X(tn) 


where  g^  are  the  functions  relating  observations  to  states  V  =>  g(X)  defined  in  Eq.  (2-23). 
The  remaining  set  An.. i < • • An_L  are  obtained  in  similar  fashion  using  X(tn_1) . . .X(tn_L) .  The 

partial  derivatives  in  Eq.  (2-24)  are  given  in  Appendix  B. 

The  weighting  or  measurement  covariance  matrix  W,  assumed  in  this  study,  is 
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where  on  and  o,  represent  the  one  sigma  uncertainties  in  the  uncorrelated  measurement  set 
(0.4).  0  ♦ 

One  final  steals  required  before  the  final  algorithm  can  be  presented.  Instead  of 


building  the  ent 
following  form 
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matrix  Jn  defined  in  Bq.  (2-13) ,  Eq.  (2-16)  may  be  rewritten  in  the 


iU 

1 

*n*  l 

2,  J»,tl 

2  Cl-i 

T-i 

I  i 

(2-26) 


where  Jn<l)  -  An»  Jn(2)  -  ViVl.n'-'  JntL+l)  "  An-L#n-L,«- 

All  definitions  have  now  been  given  for  the  standard  weighted  least  squares  batch  filter 
algorithm  which  is  presented  in  Figure  1  using  the  notation  defined  in  this  section.  This 
is  essentially  the  algorithm  described  by  Lincoln  Laboratory*  and  is  basically  the  Gauss 
weighted  least  squares  solution  defined  for  the  ballistic  trajectory  problem.  The  formula¬ 
tion  presented  in  Figure  1  requires  at  least  three  pairs  of  angle  observations  or  three 
cycles  of  the  messurement  loop.  For  this  formulation  the  nominal  state  vector  is  dtfined 
at  the  final  observation  point. 

In  this  algorithm  some  initial  state  guess  is  successivtly  correctad  in  an  attempt  to 
satisfy  ths  weighted  least  squares  criterion.  For  observations  at  times  t  ,t  , , . . . t  . 
this  quantity  is  n  n  1  aL 
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DEFINITIONS: 

I  -  IDENTITY  MATRIX 

A  ■  MEASUREMENT  MATRIX 

W  -  MEASUREMENT  COVARIANCE  MATRIX 

*  ■  TRANSITION  MATRIX 

S^1  -  STATE  COVARIANCE  MATRIX 

Y  -  MEASUREMENT  VECTOR 

V  •  ESTIMATE  OF  MEASUREMENT  VECTOR 
X  -  NOMINAL  STATE  VECTOR 

At  -  TIME  BETWEEN  MEASUREMENTS 


Fig.  1  Standard  weighted  least  squares  batch  filter  algorithm 

If  the  process  is  converging,  this  quantity  becomes  increasingly  smaller  at  a  decreasing 
rate  as  it  asymptotically  approaches  its  minimum  '-alue .  The  process  could  thus  be  terminated 
when  this  quantity  does  not  decrease  significantly  from  the  previous  iteration-  Another 
criterion  for  terminating  the  iterative  process  is  the  one  shown  in  Figure  1 


Note  that  in  this  quantity  the  measurement  residual  has  been  replaced  by  the  difference  in 
the  nominal  state  vectors  for  successive  iterations  and  the  measurement  covariance  by  the 
state  covariance.  If  the  process  is  converging,  this  quantity  approaches  zero.  Therefore, 
termination  of  the  iterative  process  would  be  based  on  the  magnitude  of  this  quantity  rather 
than  a  change  in  magnitude  as  in  the  previous  method.  In  either  case  the  cutoff  point  is 
established  at  the  level  at  which  tracking  performance  is  unaffected. 

Because  the  weighted  least  squares  batch  filter  is  a  batch  or  nonrecursive  process,  all 
of  the  observations  must  be  processed  each  time  a  new  measurement  pair  is  added  to  the 
observation  set.  In  addition  to  this  requirement,  all  the  measurements  of  the  observation 
set  must  be  processed  for  each  iteration  of  the  least  squares  process  as  indicated  in  Figure 
1.  Therefore,  termination  of  the  iterative  process  at  the  earliest  possible  point  is  essen¬ 
tial.  The  nonrecursive  characteristic  of  the  batch  filter  and  resulting  processing  require¬ 
ments  tend  to  restrict  this  algorithm  to  the  initialization  of  track  function.  For  the  ini¬ 
tialization  of  track  function,  the  resulting  weighted  least  squares  state  estimate  and  state 
covariance  are  projected  forward  to  the  next  observation  point  where  they  serve  as  the  ini¬ 
tial  state  and  covariance  for  a  Kalman  filter  algorithm.  The  weighted  least  squares  algo¬ 
rithm  can  also  be  applied  again  at  this  point  using  these  same  projected  quantities  as  ini¬ 

tial  conditions. 

In  Section  3  it  will  be  shown  that  this  algorithm  does  not  always  converge.  Therefore, 
a  series  of  tests  are  made  after  each  iteration  to  determine  if  the  process  is  converging. 
The  algorithm  is  determined  to  be  nonconvergent  if  any  of  the  following  tests  are  true: 

1)  Too  many  iterations  (15) 

2)  Position  magnitude  too  large  (1.5  *  107  m) 

3)  Velocity  magnitude  too  large  (1.0  *  101*  m/sec) 

4)  Altitude  too  low  (100  m) . 

The  values  listed  are  possible  limits  for  these  tests.  If  the  process  is  nonconvergent, 
then  the  pair  of  angle  measurements  for  the  next  observation  point  are  added  to  the  obser¬ 
vation  set  and  the  weighted  least  squares  algorithm  is  performed  again. 

Thus  far,  the  problem  of  obtaining  an  initial  state  guess  to  begin  the  iterative  least 
squares  proaess  has  not  been  addressed.  One  possible  method  for  obtaining  this  initial 
state  guess  is  the  one  described  by  Lincoln  Laboratory1  and  presented  in  Figure  2.  For 
briefness,  the  derivation  of  this  method  is  given  in  Appendix  C.  In  this  method  the  azimuth 
and  elevation  are  fit  with  a  second  order  polynomial  for  a  batch  of  angle  measurement  data 
(at  least  three  pairs) .  The  azimuth  and  elevation  along  with  their  first  and  second  deriva¬ 
tives  are  solved  for  with  the  polynomial  fit  at  the  time  of  the  last  observation.  These 
quantities  are  then  used  in  an  iterative  set  of  energy  and  geometry  equations  to  solve  for 
the  range  and  range  rate,  which  are  in  turn  used  to  estimate  the  initial  guess  of  the  state 
vector  for  the  weighted  least  squares  process. 

The  iterative  energy/geometry  equations  used  to  establish  the  range  and  range  rate 
require  an  initial  guess  of  the  range  and  an  a  priori  estimate  of  the  target  energy.  This 
set  of  equations  is  cycled  until  the  new  estimate  of  the  range  does  not  vary  from  the  pre¬ 
vious  estimate  by  more  than  sous  given  tolerance.  The  selection  of  energy  as  a  constraint 
for  the  initial  state  guess  is  made  because  of  its  relative  constancy  over  the  whole  trajec¬ 
tory  for  a  given  set  of  I CBN  threats  with  the  same  ground  range.  While  the  initial  state 
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Fig.  2  Initial  state  guess  algorithm 


guess  from  this  process  is  constrained  in  an  energy  sense,  the  final  state  estimate  from 
the  batch  filter  algorithm  (Figure  1)  satisfies  the  weighted  least  squares  criterion  and 
is  therfore  independent  from  the  initial  state  guess  and  the  a  priori  energy  estimate. 

If  this  initial  state  guess  algorithm  fails  to  converge,  then  the  pair  of  angle  measure¬ 
ments  for  the  next  observation  point  are  added  to  the  observation  set  and  the  process  is 
repeated. 


2.3  Marquardt  Weighted  Least  Squares 


The  Marquardt  weighted  least  squares  (MLS)  technique  is  derived  from  the  same  basic  least 
squares  solution  as  the  batch  filter  algorithm  described  in  subsection  2.2.  For  clarity 
this  expression  is  repeated  here 
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At  this  point  one  may  deviate  from  the  derivation  of  the  previous  section  by  solving  for  the 
Jacobian  matrix  [Eq.  (2-30)]  explicitly,  thus  eliminating  the  requirement  for  a  transition 
matrix  i  which  is  based  on  an  approximate  solution  [Eq.  (2-8)]  to  an  approximate  model  [Eq. 
(2-7)].  This  is  accomplished  by  a  different  interpretation  of  the  measurement  matrix  A  as 
defined  in  Eq.  (2-24)  and  Appendix  B.  Instead  of  taking  the  partial  derivatives  of  the 
functions  relating  observations  to  states  with  respect  to  each  state  for  some  instant  in 

tim®(1fxSl|x«X(tn),X(tn_i)  .  .  .X(tn_L))  '  Proceed  t0  tak«  the  Phials  of  the  measurement 

equations  at  times  t  ftn->j...tn-li  with  respect  to  each  state  at  time  tfl 


X-X(tn),  x(tn-1)  ...x(tn_L) 

This  requires  a  closed  form  solution  of  the  equations  of  motion  which  is  supplied  through 
f  and  g  series  equations  for  a  free  falling  body.  While  this  approach  requires  a  somewhat 
cumbersome  development  of  equations,  the  final  working  algorithm  will  be  shown  to  be  a  very 
precise  and  rapid  method  for  calculating  the  Jacobian  matrix  explicitly. 


For  simplicity  in  this  seotion  the  observation  set  is  dsfined  somewhat  differently  from 
the  previous  section 
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where 
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where  T  and  S  are  the  ECI  position  vectors  of  the  target  and  sensor  respectively.  Now  pro¬ 
ceed  to  take  the  partial  derivatives  of  the  measurement  equations  with  respect  to  each  state 
(xi,x2...xg)  as  specified  in  Eq.  (2-24)  keeping  in  mind  the  new  interpretation  of  the  mea¬ 
surement  matrix  A. 
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From  Eq.  (2-32)  continue  with 
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For  some  instant  in  time  the  solution  at  this  point  would  be  trivial!  =  lj 
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■  If  all  remaining  terms  are  zero.  This  would  be  the  desired  result  for  the  least 

squares  batch  filter  described  in  subsection  2.2.  For  the  general  case  of  interest  continue 
with  the  f  and  g  series  equations  for  a  free  falling  body 
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Continuing  with  Eq.  (2-36) 
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Xn  order  to  continue,  the  eighth  order  terse  for  f  end  g  ere  introduced.  But  first,  define 
the  terse  u«,  p«,  end  qt  for  simplicity. 
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where  u  is  the  product  of  the  gravitational  constant  and  mass  of  the  earth,  with  these 
terms  f  and  g  can  be  defined 
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Using  Eqs.  (2-38)  and  (2-39)  continue  with 
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Solving  first  for  the  partials  of  u»,  p« ,  qB  with  respect  to  the  states 
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And  finally  solving  for  tbs  partials  of  f  and  g  with  r aspect  to  utt  p» .  q. 
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This  complete*  th#  concept  of  explicit  Jacobian  which  la  a  fundaMntaliy  different  approach 
fro*  the  algorithm  inscribed  in  eubaection  2.2  where  the  eranaitlon  matrix  ia  required.  Xn 
the  next  section  the  extension  of  this  explicit  Jacobian  concept  to  th*  Kalman  filter  formu¬ 
lation  is  proposed. 

The  iterative  solution  sat hod  of  the  weighted  least  squares  formulation  (Eq.  (2-293)  can 
be  enhanced  by  the  method  of  Marquardt  matrix  conditioning* .  basically  this  method  involves 

the  addition  of  a  variable  factor  1  to  the  diagonal  terms  of  tha  stats  covariance  I J^h JQ) 

ia  order  to  improve  th*  Convergence  properties  of  the  iterative  solution  method 
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This  method  results  in  the  following  modification  of  Eq.  (2-29) 
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Note  that  in  order  to  include  the  factor  X,  a  normalizing/denormalizing  procedure  is 
required.  Using  the  symbol  *  to  indicate  normalized,  Eq.  (2-43)  can  be  represented  by  the 
following  set  of  equations  which  define  this  normalizing  procedure. 
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The  Marquardt  method  approaches  the  Gauss-Newton  method  (used  in  the  algorithm  described 
in  subsection  2.2)  as  X-0  and  the  method  of  steepest  descent  as  X-*®.  In  addition,  the  step 
size  increases  as  X-*-0  and  decreases  as  X-*®.  The  strategy  is  to  decrease  X  if  the  solution 
is  converging  and  to  increase  X  if  it  is  diverging.  This  method  thus  has  the  ability  to 
converge  from  a  distant  initial  guess,  and  also  the  ability  to  converge  rapidly  once  the 
vicinity  of  the  solution  is  reached. 


As  discussed  in  subsection  2.2,  one  can  avoid  building  the  entire  matrix  si  by  formula¬ 
ting  Eq.  (2-43)  with  its  algebraic  equivalent 


x„  -  x„  + 

n  n 


U1 


J»UI  “nil-1  V°) 


♦  XI 


(2-45) 


i-l 


where  Jn(l)  «■  Afl t  Jn(2)  *  A^jj  ...JnU.4-l)  -  An_^. 


With  the  techniques  of  explicit  Jacobian  and  Marquardt  matrix  conditioning  established, 
the  final  working  algorithm  is  now  presented  in  Figure  3.  The  algorithm  is  presented  in  the 
same  format  aa  the  standard  weighted  leaat  squares  batch  filter  of  subaection  2.2  (Figure  1) 
ao  that  the  two  m#thr4s  can  bn  compared  directly.  The  absence  of  a  transition  matrix  and 
the  use  of  Marquardt  matrix  conditioning  are  the  distinguishing  features  for  this  formula¬ 
tion.  The  use  of  the  closed  form  solution  for  the  equations  of  motion  (f  and  g  series) 
required  by  the  explicit  Jacobian  technique  is  alto  a  key  element.  Xiao  note  that  the  par¬ 
tial#  for  the  mease -ament  matrix  are  the  partial#  of  the  measurement  equations  at  times  t  , 
Svl’*“S*-L  respect  to  each  state  at  time  t^.  This  means  that  the  R^,  R^,  J»8  terms” 

in  Eqs.  (2-33)  »nd  (?-34)  are  evaluated  with  the  etate*  at  time  t„,  t_  ,  while  Eqa. 

n.>  n-i  n-L 

(2-37),  (2-18>  -nd  (2-41)  are  evaluated  with  the  state  at  Use  t  (X  1. 

II  11 


The  m.»thr;d  for  varying  the  Marquardt  factor  X  is  included  it?  Figure  2.  The  chi-squared 
(x  )  quantity  is  the  criterion  used  to  determine  if  the  procees  As  converging.  If  this 
quantity  increases  from  the  previous  iteration,  the  process  is  determined  to  be  diverging 
and  the  factor  X  la  increased  by  a  fact.v.  of  SO,  If  this  quantity  decreases  from  the  pre¬ 
vious  iteration,  the  process  is  determined  to  be  converging  end  the  factor  X  is  decreased 
by  a  factor  of  10.  An  initial  value  for  X  of  0.0001  la  used  in  this  algorithm.  Note  that 
in  ths  divergent  cate  the  process  is  looped  back  only  to  the  point  where  factor  i  is 
added  thus  using  the  tame  quantities  for  Sf  and  SJ. 


bike  the  batch  filter  of  the  previous  taction  the  Marquardt  leaat  squares  algorithm 
requires  at  least  three  pairs  of  angle  observations  (three  cycles  of  measurement  loop)  and 
defines  the  nominal  state  vector  at  the  final  observation  point.  The  Herqoardt  least 
squares  is  sfleo  a  batch  or  oontecuralve  process  so  that  ell  of  vhj  ohAvrvatiirne  cost  be  pro¬ 
cessed  eech  vims  a  new  measurement  pair  is  added  to  thr  ibservatsOh  act.  both  algorithm? 
produce  the  SAm*  weighted  least  Squares  state  estirq-'.-  t  <udc«sni>vely  correct inq  r,oae  ini¬ 
tial  eteto  guns*.  A  comparison  of  the  two  methods  euAld  ‘hssafoce  b<  aoncerneo  with  tin- 
stability  end  the  computation  requirements  of  eecr. 

The  Marquardt  least  squares  could  use  the  ena.*try  constrained  initial  state  gi.es*  algo¬ 
rithm  described  in  Figure  2  to  supply  the  initai  a»at*  guess  t*-  becin  the  Iterative  process. 
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DEFINITIONS: 

*  MONMADZIO  *• 

X2  -  CHI  SOllAJtt 
J  -  JACOMAM  MATRIX 
W  -  MCAUmikClMt  COVARIANCE  MATRIX 
•  STATE  COVARIANCE  MATRIX 
V  *  MEASUREMENT  VECTOR 

y  •  ESTIMATE  OF  MCACUAfUf  NT  VECTOR 

X  •  NOMINAL  STATS  VICTOR 

<4  •  lAXOfSERM*  VALUE* 

I  *  IDENTITY  MATRIX 

X  *  MARQUAROT  FACTOR 
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Fig.  3  Marquardt  weighted  least  squares  with  explicit  Jacobian 

A  less  expensive  and  simplified  method  can  also  be  used  since  the  Harquardt  least  squares 
process  is  very  stable  and  converges  for  a  very  wide  range  of  state  estimates.  Thia  simpli¬ 
fied  initial  state  guess  technique  is  described  as  follows. 


For  observation  times  of  tR,  tn_^,'. the  following  observation  set  may  be  defined 


(3-46) 


Using  the  first  (n-M  and  last  (n)  observation  points  the  approximate  angle  rates  arc 


•  *“.*  w 

•  •  **n  '  W  /(tn  *  Vl>  ' 

the  velocity  magnitude  for  the  target  can  be  approximated  by  the  square  root  of  the  sum  of 
Its  components  squared 

V  -  (4»  ♦  «ad» a  ♦  (*i)l)4/*  .  (3-46) 

Solving  for  the  range  rata  yields 

R  -  (V*  -  (hi)*  -  «$)  *)*’**  .  (2-49) 


Given  some  initial  quess  for  the  velocity  maenitude  V  and  range  (acquisition  range  Rf), 
the  range  rat*  R  may  be  determined  from  Eo.  (2-49).  The  ranee  at  the  final  observation 
point  may  now  be  determined  by 


Vl'  •  '*-*»• 

At  this  point  the  working  coordinate  system  of  Figure  4  (*',  f,  t')  is  introduced  ec- 
that  the  state  vector  in  this  new  system  can  be  computed  directly  from  the  quantities  »n, 


Fig.  4  Working  coordinate  system  (X',  Y",  Z') 

The  X,  Y,  Z  coordinate  system  in  Figure  4  is  the  observer  centered  Cartesian  coordinate  sys¬ 
tem  produced  from  the  d'  /ference  in  the  target  and  sensor  ECI  position  vectors  (X  -  Xs) . 

The  state  vector  for  the  tarqet  at  the  final  observation  point  in  the  working  coordinate 
system  can  now  be  computed  directly  as 
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(2-51) 


The  transformation  matrix  to  convert  this  state  vector  from  the  X',  Y',  Z“  system  to  the 
X,  Y,  Z  system  is 


cos  <t>  cos  0  -sin  6  -sin  $  cos  0 

cos  $  sin  0  cos  0  -sin  $  sin  0 

sin  $  0  cos  d> 


(2-52) 


The  desired  state  vector  at  the  final  observation  point  in  an  ECI  coordinate  system  can  be 
written  now 


X  =■  CX'  +  XS  .  (2-53) 

This  simplified  technique  could  be  used  as  the  primary  method  for  establishing  the  ini¬ 
tial  state  guess  for  the  Marquardt  least  squares  algorithm,  or  it  could  be  used  as  a  backup 
for  the  energy  constrained  iterative  algorithm  (Figure  2)  when  it  fails  to  converge. 


2.4  Energy  Constraint 

The  problem  of  state  estimation  of  ballistic  trajectories  with  angle  only  measurements 
becomes  difficult  when  the  observer  is  free-falling  and  more  difficult  if  the  observer  is 
then  located  in  the  plane  of  the  observed  trajectory.  This  section  presents  the  idea  of 
incorporating  an  energy  constraint  into  the  angle-only  tracking  algorithms  to  assist  in 
these  poor  observability  problems  or  to  enhance  the  solution  of  any  type  of  tracking  problem 
in  general.  The  selection  of  energy  as  a  constraint  is  mads  because  of  its  relative  con¬ 
stancy  over  the  whole  trajeotory  for  a  given  set  of  ICBN  threats  with  the  same  ground  range. 
The  following  energy  constraint  method  can  be  incorporated  into  either  of  the  weighted  least 
squares  algorithms  discussed  in  subsection  2.2  or  subsection  2.3, 

The  total  energy  (per  unit  mass)  of  a  free-falling  body  can  be  expressed  as  the  sum  of 
the  kinetic  and  potential  energy 

E-KE+PE-^V*-^  (2-54) 

where  V  is  the  velocity  magnitude,  R  is  the  magnitude  of  the  position  vector  (earth  centered 
system),  and  u  is  the  product  of  the  gravitational  constant  and  the  earth's  mass.  The  total 
energy  defined  in  Eq.  (2-54)  is  incorporated  into  the  angle  only  tracking  algorithm  by 
assuming  some  a  priori  knowledge  of  the  expected  magnitude  of  this  quantity  and  its  asso¬ 
ciated  uncertainty.  This  expeoted  or  mean  energy  magnitude  can  be  considered  as  a  pseudo 
measurement  thus  expanding  the  measurement  set  of  the  Narquardt  least  squares  for  exaap’s  to 
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In  other  words  the  a  priori  mean  energy  magnitude  serves  as  the  measured  energy  while  the 
estimated  or  calculated  energy  is  determined  from  Eq.  (2-54)  using  the  estimated  state  vec¬ 
tor.  Because  the  total  energy  is  constant  over  the  whole  trajectory  its  contribution  as  a 
pseudo  measurement  is  utilized  only  once  at  a  single  measurement  point. 


Reformulating  the  total  energy  in  terms  of  the  target  ECI  state  vector 


E  = 


xi  +  x!  +  x| 


(xi  +  x!  +  xi ) 1/4 


(2-56) 


the  required  partials  for  the  Jacobian  matrix  (subsection  2.3)  or  measurement  matrix  (sub¬ 
section  2.2)  are  obtained 


3E 
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'i/J 


i  (xi  +  xi  +  xi) 

i  »  4,  5,  6 


i  =  1,  2,  3 


(2-57) 
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The  measurement  covariance  matrix  is  alao  expanded  to  accommodate  the  expanded  measurement 
set 


M  i.: 


u6  “ 
0  a] 


(2-58) 


The  uncertainty  in  the  energy  could  be  input  directly  or  computed  from  a  uniform  distribu¬ 
tion  given  some  maximum  and  minimum  energy  values 


,  (Emax  -  Smln>  * 
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(2-59) 


Because  the  energy  constraint  concept  is  based  on  some  a  priori  energy  estimate,  the 
resulting  least  squares  state  estimate  will  be  biased  if  this  assvmed  a  priori  energy  is 
different  from  the  actual  energy.  Furthermore,  the  bias  will  increase  as  this  difference 
increases  and  as  the  uncertainty  in  this  pseudo  measurement  is  decreased.  An  analysis  and 
possible  cure  for  the  energy  constraint  bias  problem  is  given  in  subsection  3.2. 


2.5  Kalman  Filter  Application  of  Explicit  Jacobian 

Unlike  the  batch  least  squares  process  of  subsections  2.2  and  2.3  the  Kalman  filter  is 
a  recursive  minimum  variance  filter.  Using  two  angle  measurements  at  each  observation 
pent,  this  algorithm  determines  the  current  target  state  estimate  such  that  the  state 
covariance  is  minimized.  Thus,  the  state  estimate  is  conditioned  on  all  measurements  made 
up  to  that  time.  The  Kalman  filter  is  recursive  in  the  sense  that  only  the  current  measure¬ 
ment  need  be  processed. 


The  minimum  variance  estimate  equations  can  be  arrived  at  througH  the  weighted  least 
squares  concept,  so  the  minimum  variance  estimate  is  also  the  weighced  least  squares  esti¬ 
mate,  and  the  Kalman  filter  can  be  said  to  be  a  recursive  form  of  the  batch  filter.  The 
Kalman  filter  uses  the  aane  linearization  approximations  as  the  batch  filter  of  subsection 
2.2  for  the  measurement  and  transition  matrices.  However,  due  to  its  recursive  nature, 
errors  introduced  through  these  approximations  can  build  up  resulting  in  a  tracking  perform¬ 
ance  which  tends  to  deviate  from  tne  least  squaros  batch  filter  results.  The  improved 
Kalcan  filter  formulation  presented  in  this  section  will  be  shown  to  match  the  batch  least 
•quires  results. 

using  the  notation  devaloped  in  the  previous  sections  the  linearized  model  matrix  equa¬ 
tion  for  the  recursive  problem  is 


&Xn-l  + 


*n  *n,n-l 


(2-60) 


where 


AY  «  deviations  of  the  observations  (y)  from  the  nominal  or  calculated 
n  set  (#)  at  time  t 

A  =  matrix  which  transforms  the  state  at  time  t  to  equivalent  obser¬ 

vation  parameters 

4  _  ,  =  transition  matrix  which  transforms  the  state  at  time  t  .  to  time  t 
n,n-l  n-l  n 


AX, 


n-l 


deviations  from  the  estimate  of  the  state  at  time  t  ,  which  cor¬ 
respond  to  AYn 

observation  errors. 


The  well  known  Kalman  filter  solution  to  this  problem  is  presented  in  Figure  5.  As  dis¬ 
cussed  in  subsection  2.2  and  repeated  here  for  clarity  the  transition  matrix  4  in  the  usual 
approach  is  determined  by  solving  the  "linear"  differential  equation  model 


A  AX (t)  =  F  (X(t))  AX (t) 


dt 


(2-61) 


An  “approximate"'  solution  is  given  by 

AX(t  +  At)  =  AX (t)  +  At  F  (X(t))  AX (t) 


so  that 
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As  discussed  in  subsection  2.2  the  accuracy  of  $  can  be  improved  by  breaking  At  up  into 
At 

equal  fractions  h  »  —  and  using  the  equation 


4(t  +  kh,t)  »  I  +  hF  +  (k  -  l)h^  $ ^(t +  (k  -  l)h,t^  ,  k 


1,2,3. ..m  .  (2-67) 
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Pig.  5  Kalman  filter  algorithm 


For  the  Kalman  filter  formulation  Eqs.  (2-37),  (2-38),  and  (2-41) Aare  evaluated  with  X 

while  R  ,  R  ,  R  terms  of  Eqs.  (2-33)  and  (2-34)  are  evaluated  with  X„.  Note  that  for  n_J 
A  j  *•  ^  n 

this  formulation  the  state  estimate  Xn_  ^  must  be  projected  forward  to  t  each  time  an 
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Fig.  7  Jacobian-Kalman  algorithm  (final) 

iteration  is  performed.  This  is  accomplished  with  the  f  and  g  series  [Eqs.  (2-36)  and 
(2-39)1. 

3 .  PERFORMANCE  EVALUATION 

3.1  Test  Case  Description 

Five  test  cases  were  used  to  demonstrate  the  performance  of  the  tracking  algorithms  pre¬ 
sented  in  this  report.  These  cases  represent  different  degrees  of  difficulty  in  terms  of 
relative  engagement  geometry.  In  each  case  both  the  sensor  and  target  are  on  ballistic  tra¬ 
jectories.  For  cases  3,  4,  and  5  the  sensor  states  are  identical.  The  initial  ECT  state 
vectors  for  these  test  cases  are  given  in  Table  I.  Some  of  the  initial  characteristic  tra¬ 
jectory  parameters  which  further  describe  these  five  test  cases  are  given  in  Table  II, 

3 . 2  Case  1 

In  terms  of  tracking  performance  this  case  provides  the  best  results  of  the  five  test 
cases.  The  aspect  angle  in  Table  II  confirms  the  favorable  tracking  geometry  provided  with 
this  case.  The  Marquardt  least  squares  (MLS)  results  for  this  case  are  shown  in  Figure  8 
for  both  the  constrained  and  unconstrained  mode.  The  results  are  for  50  Monte  Carlo  trials 
and  assume  an  angle  measurement  accuracy  of  20  urad  (lc) .  A  frame  time  of  10  sec  was  used 
so  the  initial  results  shown  at  20  sec  track  time  are  for  three  measurements.  For  the 
energy  constrained  mode,  three  sets  of  results  are  shown  for  assumed  one  sigma  energy  uncer¬ 
tainties  of  10,  20,  and  40  percent  ^  energy'1'  x  10°)  *  Note  that  a11  constrained  cases 

approach  the  unconstrained  resultB. 

The  standard  least  squares  batch  filter  results  for  this  same  case  are  shown  in  Figure  9. 
As  expected,  these  agree  with  the  MLS  results  of  Figure  8  since  both  algorithms  provide  the 
same  least  squares  solution  to  the  problem.  The  somewhat  erratic  behavior  of  the  energy 
constrained  performance  appears  to  be  case  dependent  since  almost  identical  results  were 
obtained  for  different  sets  of  measurements  simulated  with  different  random  numbor  sequences. 
Note  that  the  standard  least  squares  batch  filter  experienced  convergence  problems  for  the 
unconstrained  mode  when  less  than  five  measurements  were  utilised  while  the  MLS  algorithm 
did  not. 

In  Figure  10  an  attempt  is  made  to  give  some  indication  of  the  computational  speed  of  the 
two  least  square  algorithms.  The  algorithms  were  made  equal  in  terms  of  state  projection 
methods  and  criterion  used  for  termination  of  the  iterative  process.  The  performance  is 
given  in  terms  of  CDC  7600  CPU  run  time  per  Monte  Carlo  trial.  No  attempt  was  made  to  enu¬ 
merate  the  number  of  computational  operations  for  the  two  algorithms.  While  this  is  a  rough 
computational  comparison,  the  results  shown  in  Figure  10  indicate  promise  and  warrant  fur¬ 
ther  investigation  for  what  appears  to  be  a  faster  MLS  algorithm. 

The  bias  error  introduced  through  the  a  priori  estimate  of  the  target  energy  in  the  energy 
constrained  mode  was  also  investigated  for  this  case.  In  this  analysis  the  effect  of  the 
error  in  the  a  priori  estimated  energy  on  the  resulting  bias  was  examined.  The  results  of 
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Table  II 

Test  Cases  -  Initial  Characteristic  Parameters 
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Fig,  9  Standard  least  squares  batch  filter  -  Case  1 


HUMS*  Of  MtaSUMMtttTt 
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this  analysis  are  shown  In  Figure  11  for  a  track  time  of  ISO  sec  and  for  assuaed  one  si^aa 
energy  uncertainties  of  10  and  20  percent.  As  an  example,  if  the  a  priori  energy  aeasureaent 
was  20  percent  in  error  and  a  one  sigaa  energy  uncertainty  of  10  percent  was  used  than  the 
bias  on  the  range  estiaate  would  be  30  km.  One  must  therefore  balance  the  benefits  gained 
through  the  use  of  smaller  energy  uncertainties  against  the  resulting  increase  in  bias  fur 
incorrect  a  priori  energy  measurements. 

In  order  to  eliminate  the  bias  problem  and  the  high  processing  requirements  of  the  con¬ 
strained  batch  process,  the  ideal  approach  is  to  initialise  track  with  the  constrained  least 
squares  batch  algorithm  and  than  to  handover  to  the  unconstrained  Kalman  filter  for  the  con¬ 
tinuous  track  function.  The  least  squares  batch  algorithm  would  operate  in  the  energy  con¬ 
strained  mode  to  improve  performance  and  guarantee  convergence  of  the  unconstrained 
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Fig.  11  Bias  error  due  to  incorrect  assumed  energy 

filter.  Using  this  method,  any  bias  error  introduced  through  the  energy  constrained  least 
squares  batch  process  would  be  removed  by  the  unconstrained  Kalman  filter.  This  method  is 
especially  attractive  if  the  Kalman  filter  performance  matches  the  least  squares  batch 
results  which  is  the  case  for  Jacobian-Kalman  filter  formulation. 

The  Jacobian-Kalman  filter  results  for  Case  1  are  presented  in  Figure  12.  A  five  mea¬ 
surement  constrained  MLS  was  used  to  initialize  the  Jacobian-Kalman  filter.  Three  sets  of 
results  are  shown  for  a  5,  10,  and  20  percent  one  sigma  energy  uncertainty  for  initializa¬ 
tion.  Also  indicated  are  the  MLS  results  from  Figure  8  to  show  how  the  Jacobian-Kalman 
matches  the  least  squares  batch  performance  when  initialized  with  an  energy  constrained  MLS 
algorithm. 
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Fig.  12  Jacobian-Kalman  -  Caae  1 

In  order  to  damonatrata  bow  tba  unconstrained  Kalman  filter  removes  the  bias  error  intro 
due ad  through  an  incorrect  a  priori  energy  estimate  in  tba  constrained  least  squares  batch 
process  the  following  test  mi  mads  with  Csss  2.  A  bias  error  was  introduced  by  assuming 
an  a  priori  energy  estimate  for  tbs  five  measurement  least  squares  initialisation  algorithm 
which  was  10  percent  in  error,  tbs  resulting  bias  error  after  initialization  (time  »  40 


sec)  and  after  each  subsequent  measurement  processed  by  the  Jacobian-Kalman  filter  is  pre¬ 
sented  in  Figure  13. 


Fig.  13  Jacobian-Xalman  bias  -  Case  1 


3.3  Case  2 

This  is  a  more  difficult  case  than  the  previous  one  due  to  its  near  in-plane  geometry. 
The  HU  results  for  this  case  are  shown  in  Figure  14  for  both  the  constrained  and  uncon¬ 
strained  mode.  The  standard  least  squares  batch  filter  results  for  this  same  case  are 
given  in  Figure  15  for  comparison.  Notice  that  the  unconstrained  and  40  percent  energy 
uncertainty  constrained  oases  were  omitted  for  the  standard  least  squares  batch  filter  due 
to  convergence  problems.  Also  note  that  in  order  to  ensure  convergence  for  this  problem 
beyond  100  sec  of  track,  this  algorithm  must  begin  track  at  10  measurements  or  less.  These 
problems  were  not  experienced  with  the  MU  algorithm. 

The  Jacobian-Kalman  filter  results  for  Case  2  are  presents  in  Figure  16.  A  five  mea¬ 
surement  MLS  algorithm  with  energy  constraints  was  used  for  initialisation.  Also  indicated 
are  the  MU  results  from  Figure  14  for  comparison. 


3.4  Case  3,  Case  4,  and  Case  3 

These  three  cases  which  are  all  very  similar  in  geometry  proved  to  be  the  most  difficult. 
The  aspect  angles  in  Table  3-2  indicate  a  relative  geometry  very  near  to  in-plane.  The 
unconstrained  MU  results  for  these  three  cases  are  given  in  Figure  17.  Although  no  results 
are  presented  for  the  standard  least  squares  batch  filter,  convergence  problems  were  again 
experienced  for  both  the  constrained  and  unconstrained  mode  using  this  algorithm. 

The  Jacobian-Kalman  filter  results  for  these  three  cases  are  presented  in  Figure  18.  A 
five  measurement  MU  algorithm  with  energy  constraints  was  usad  for  initialisation.  Thasa 
results  match  the  MU  performance  shown  in  Figure  17, 


4.  CONCLUSIONS 

Based  on  the  performance  results  of  Ssctlon  3,  ths  HU  algorithm  is  s  very  stable  end 
cosiputationeily  fast  algorithm.  The  MLS  algorithm  has  been  shown  to  bs  s  superior  algorithm 
in  terms  of  stability  and  computational  requlramants  whan  cos4>ared  to  axlsting  least  squares 
batch  algorithms  which  use  both  a  msasursmsnt  end  transition  matrix.  This  algorithm  has 
never  experienced  convergence  difficulties  against  even  the  most  difficult  of  relative 
engagement  geometries.  In  addition  this  algorithm  has  been  shown  to  converge  for  a  very 
wide  rsnge  of  initial  stats  guesses  U50  percent  range  guess) .  Ths  use  of  the  explicit 
Jacobian  method  provides  for  higher  accuracy  and  speed  by  eliminating  the  need  for  tbs 
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Fig.  18  Jacobian-Kalman  -  Cases  3,  4,  and  S 

transition  matrix  used  in  the  usual  approach.  This  improved  accuracy  along  with  the  tech¬ 
nique  of  Marquardt  matrix  conditioning  contribute  to  the  enhanced  stability  of  the  algo¬ 
rithm.  When  used  in  an  energy  constrained  mode  this  algorithm  serves  as  an  ideal  initiali¬ 
sation  technique  for  the  Jacobian-Kalman  filter  algorithm. 

Dias  errors  are  introduced  in  the  energy  constrained  mode  of  the  KL3  algorithm  when  the 
a  priori  estimated  energy  measurement  is  in  error.  This  bias  becomes  greater  as  the  uncer¬ 
tainty  in  the  energy  measurement  is  decreased.  This  problem  can  be  eliminated  by  initial¬ 
ising  track  in  the  constrained  mode  and  then  handing  over  to  the  unconstrained  Jacobian- 
Kalman  filter  algorithm  for  the  continuous  track  function.  By  this  method,  any  bias  error 
introduced  in  the  energy  constrained  initialisation  process  is  removed  in  the  unconstrained 
Kalman  filter.  This  approach  is  especially  attractive  since  the  Jacobi an-Kalman  filter  per¬ 
formance  matches  the  least  squares  batch  results. 

The  application  of  the  explicit  Jacobian  technique  to  the  Kalman  filter  formulation  has 
produced  a  recursive  tracking  algorithm  whose  performance  has  been  shown  to  match  that  of 
the  least  squares  batch  process.  This  Is  an  improvement  over  existing  Hainan  filter  algo¬ 
rithms  which  require  both  a  measurement  and  transition  matrix  which  results  in  a  tracking 
performance  that  tenda  to  deviate  from  the  least  squares  batch  performance  bound.  For  the 
test  esses  described  In  Section  3,  the  tracking  performance  U«  range  error)  for  the 
Jacobian-Kalman  filter  algorithm  was  consistently  30  to  SO  percent  better  than  the  per¬ 
formance  of  a  tracking  algorithm  which  used  this  latter  type  of  Kalman  filter  formulation. 
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APPENDIX  A.  TRANSITION  EQUATION 


State  vector 


Equations  of  notion 


APPENDIX  B.  MEASUREMENT  ERUATTON 


EARTH  CENTERED  INERTIAL  (ECO  SENSOR  CENTERED  TOPOGRAPHIC  ttCT) 


APPENDIX  C.  DERIVATION  OP  INITIAL  STATE  GUESS  ALGORITHM 


Relative  State  Equations 

x  =  R  sin  0  cos  $ 
y  =  R  cos  0  cos  <t> 
z  =  R  sin  $ 

x  “  R  sin  0  cos  |  -  R  |  sin  0  sin  $  +  R  6  cos  0  cos  $ 
y  =  R  cos  0  cos  <f>  -  R  $  cos  0  sin  $  -  R  §  cos  0  sin  $ 
z  =  R  sin  $  +  K  |  cos  4> 

Dividing  x,  y,  z  by  R 
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Velocity  Vector  Definitions 

VT  =  Target  velocity  vector  in  SCT  system 
ys  =  Sensor  velocity  vector  in  SCT  system 
VR  =  Relative  velocity  vector  (TARGET -SENSOR) 
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Target  velocity  magnitude  squared 
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(xs  +  aiR) *  +  (yg  +  aaR) *  +  (tg  +  a»R)1 
lyj.12  »  R*  (a?  +  al  +  al)  +  R(2xgai  +  2ysaj  +  2zgaj)  +  (x|  +  yg  +  ig) 


Energy  equation  per  unit  mass 

®T  “  Ekinotic  +  Epotential  "  2 


'C-3> 


(C-4) 


where  RT  •>  distance  from  earth  center  to  target  and  u  ■  gravitational  constant  *  earth's 
mass. 


Combining  Eqs.  (C-3)  and  (C-4) 

R*(af  +  af  +  aj)  +  R(2xgai  +  2ysaa  +  2iga,)  +  (it*  ♦  y*  ♦  -  28^,  -  -  0 


(C-5) 
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Quadratic  solution  of  R 
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Prom  geometry 


R^,  -  (RJ  +  R|  +  2RRg  sin  4)  * 


(C-7) 


For  a  free-falling  Body 


|  =  $  tan  4»  -  -r 
R  20 


(C-9) 


1)  Calculate  ^  [Eq.  (C-8) ] 


2) 

Calculate 

aif 

a2,  a}  [Eq.  (C-2)  ] 

3) 

Calculate 

a, 

b  [Eq.  (C-5)  ] 

4) 

Calculate 

[F.q.  (C-7M 

5) 

Calculate 

c 

[Eq.  (C-5)) 

6) 

Calculate 

R 

[Eq.  (C-6)  ] 

7) 

Repeat  4, 

5, 

6  until  R  converges 

8) 

Calculate 

ft 

[Eq.  (C-8) ) 

9) 

Calculate 

x, 

y,  z,  x,  y,  z  [Eq. 

[Eq.  (C-l)] 
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1.  INTRODUCTION 


In  target  tracking:  First,  a  motion  model  which  describes  the  motion  of  the  target  tracked  as 
accurately  as  possible,  and  an  observation  model  are  obtained.  In  a  clear  environment  (i.e.,  there 
does  not  exist  any  interference  such  as  jamming  or  clutter);  these  models  are  in  general,  discrete  and 
linear  with  respect  to  the  disturbance  and  observation  noises.  Moreover  the  observation  noise  is  an 
additive  Gaussian  noise  where  the  Gaussian  assumption  is  due  to  the  Central  Limit  Theorem.  Next, 
using  one  of  the  estimation  algorithms  already  developed  in  the  literature®-13  (e.g.,  the  (extended) 
Kalman  filter  algorithm),  the  target  states  are  estimated. 


f 

i 


I 


If  either  the  motion  or  observation  model  is  nonlinear  (in  general,  this  is  the  case),  then  the 
optimum  solution  to  this  estimation  problem  can  not  be  given  due  to  nonlinear  functions  in  the 
models.  However,  a  suboptimal  solution  is  given  by  using  a  nonlinear  estimation  algorithm.  By  a 
nonlinear  estimation  algorithm  (e.g.,  the  extended  Kalman  filter  algorithm),  the  states  are  estimated 
as  follows:  First,  using  a  Taylor  series  expansion,  the  nonlinear  functions  are  linearized  around 
some  points  so  that  the  nonlinear  estimation  problem  is  reduced  to  a  linear  one.  Then  using  a  known 
(usually)  solution  to  this  linear  estimation  problem,  the  states  are  estimated.  Hence,  if  the  nonlinear 
functions  are  not  smooth  enough  for  a  Taylor  series  expansion,  then  the  nonlinear  estimation  algorithm 
may  produce  estimates  which  are  much  different  from  the  actual  values  of  the  states.  Therefore,  some 
estimation  algorithms  for  an  estimation  problem  with  nonlinear  functions  which  are  not  smooth  enough 
for  a  Taylor  series  expansion  are  needed. 


If  both  the  motion  and  observation  models  are  linear,  we  have  a  problem  of  state  estimation  in 
a  linear  discrete  time  system.  The  solution  to  this  problem  has  been  treated  extensively  in  the 
literature.®-13 


;  In  the  tracking  of  a  target  in  the  presence  of  interference  (such  as  jamming  or  clutter  which 

i  is,  in  general,  not  Gaussian  noise),  an  observation  model  with  only  additive  observation  noise  may 

|  not  be  used  since  it  does  not  account  for  the  possibility  of  utilizing  measurements  originating 

!  simultaneously  from  the  interference  source  and  the  target.  Still,  if  a  classical  estimation  algorithm 

!  were  used  to  track  the  target  by  using  an  observation  model  with  only  observation  noise,  the  estimates 

|  of  the  target  states  may  diverge  from  the  actual  values.  Hence,  some  estimation  algorithms  are  needed 

for  discrete  models  with  arbitrary  random  interference  as  well  as  an  observation  noise. 

j 

This  chapter  treats  the  problem  of  state  estimation  for  discrete  odels  with  or  without  interference. 
As  a  result,  three  new  smoothing  algorithms  are  presented  so  that  the  need  mentioned  above  is  fulfilled. 
The  main  idea  for  these  smoothing  algorithms  is  that  of  quantizing  the  states  of  the  models  to  a  finite 
set  of  states.  This  (approach)  reduces  the  smoothing  problem  to  a  multiple  (composite)  hypothesis 
testing  problem.  Further,  using  three  decoding  techniques  of  information  theory,  the  smoothing 
algorithms  are  developed.  The  first  smoothing  algorithm  is  referred  to  as  Optimum  Decoding  Based 
Smoothing  Algorithm,  which  uses  the  Viterbi  decoding  algorithm.  The  second  smoothing  algorithm  is 
referred  to  as  Stack  Sequential  Decoding  Based  Smoothing  Algorithm,  which  uses  a  stack  sequential 
decoding  algorithm.  The  third  one  is  referred  to  as  Suboptimua  Decoding  Based  Smoothing  Algorithm, 
which  uses  a  suboptimum  decoding  algorithm. 


2.  SMOOTHING  ALGORITHMS 


F 


2.1  Models  and  Assumptions 


Through  this  section,  we  deal  with  the  following  discrete  models 
x(k+l)  *  f (k,  x(k) ,  u(k) ,  w(k))  (Motion  Modal) 

(2.1.1) 

z(k)  »  g(k,  x(k),  v(k))  (Observation  Model) 

for  target  tracking  in  a  clear  environment.  In  the  presence  of  interference  we  deal  with  the  following 
models , 


where 


x(k+l)  »  f(k,  x(k),  u(k),  w(k>)  (Motion  Model) 

(2.1.2) 

z(k)  “  g (k.,  x(k),  I(k),  v(k))  (Observation  Model) 

x(0)  is  an  n  x  1  initial  (target)  state  random  vector 

x(k)  is  an  n  x  1  (target)  state  vector  at  time  k 

u(k)  is  a  q  x  1  known  pilot-command  vector  at  time  k 

w(k)  is  a  p  x  1  disturbance-noise  vector  at  time  k  with  zero  mean  and  known  statistics 
v(k)  is  an  i  x  1  observation-noise  vector  at  time  k  with  zero  mean  and  known  statistics 
z(k)  is  an  r  x  1  observation  vector  at  time  k 

tHA  J~  —  -  -  1  . .  “Oh  tnnun 


Time  k  is  time  t0  +  kTQ  where  t0  and  T0  are  the  initial  time  and  the  observation  interval, 
respectively. 

f  (k,x(k)  ,u(k)  ,v(k)) ,  g(k,x(k),I(k),v(k)),  and  g(k,x(k) ,v(k))  are  (linear  or  nonlinear)  vectors 
with  appropriate  dimensions. 

Furthermore,  the  random  vectors  x(0),  w(j),  w(k),  v(2),  v(m),  I(n),  and  I(p)  are  assumed  to  be  indepen¬ 
dent  for  all  j,  k,  2,  m,  n,  p. 

2.2  Quantization  of  States  and  Transition  Probabilities 

This  section  describes  a  kind  of  quantization  for  target  states  and  some  difficulties  in  calculating 
transition  probabilities  between  quantization  levels. 

Let  us  consider  the  state  x(k).  It  is  a  random  vector  whose  range  is  in  the  Space  Rn  (n-dimensional 
Euclidean  space).  Let  us  divide  Rn  into  nonoverlapping  subspaces,  Rj'a,  and  assign  a  unique  value 
to  each  subspace  R"  where  subscript  q  stands  for  quantization. 

Definition  2.2.1  A  function  x  (•)  A  Q{x(» ) }  is  a  quantizer  for  the  state  x(0  if  the  following 
hold  q 


(a) 

(b) 


x  (•)  A  Q{x(*)}  »  x  ,  whenever  x(0  e  k". 
q  -  qi  i 

is  unique  for  each  r”. 


Definition  2.2.2  x-(")  is  the  quantized  state  (vector)  at  time  •,  and  its  possible  values  are 

called  the  quantization  levels  of  the  state  x(0. 

Definition  2.2.3  Subspace  R°  is  sometimes  called  Gate  (or  Cell)  r". 


Definition  2.2.4  x  ^  is  the  quantization  level  for  Gate  (Cell)  R^. 


Quantization  means  that  whenever  a  random  state  vector  x(-)  falls  within  a  giver,  subspace,  say  R?,  the 
state  x( ■)  is  quantized  to  the  unique  value  xqi  (see  Figure  2.2.1),  Let  us  now  define  the  transition 
probabilities,  which  govern  the  target  motion  within  the  gates. 


Figure  2.2,1.  Quantization  and  Transition  Probabilities 


Definition  2.2.5  The  transition  probability  tjm(k)  is  the  probability  that  the  state  x(ktl) 
will  lie  in  the  gate  r£  when  the  state  x(k)  is  in  the  gate  Rj,  i.e., 

n,  (k)  A  Prob{x(k+l)  e  R"|x(k)  t  r")  .  (2.2.1) 

jw  m'  j 

By  definition,  the  conditional  probability  it,  (k)  can  be  rewritten  as 


Prob{x(k+l)  e  Rn,  x(k)  e  R*1} 

*1m(k>  - 5 - - - L 

J  Prob{x(k)  e  r”} 

*  7 - - -  [|  j  p(x(k+l),  x(k))  dx(k+l)  dx(k) ] 

.  p<*(k))  dx(k)  Jr"  Jr" 

JRn  J  a 


7 - ± -  {  jL  p(x(k+l)  |x(k))  dx(k+X) ]  p(x(k))  dx(k)}  (2.2.2) 

n  p(x(k))  dx(k)  jr"  jr" 
j  m 
Rj 


where 

p(x(k+l),  x(k))  is  the  joint  probability  density  function  of  x(k+l)  and  x(k) 
p(x(k))  is  the  probability  density  function  of  x(k) 

p(x(k+l) |x(k))  is  the  conditional  probability  density  function  of  x(k+l)  given  x(k). 


It  is  not  usually  easy  to  evaluate  the  transition  probability  iTjm(k)  analytically.  The  difficulties 
are  due  to  the  shapes  of  the  gates  (R?  and  R*}) ,  the  statistics  of  the  disturbance  noise  vectors 
(w(*)'s),  and  the  initial  state  vector  x(0) .  In  order  to  see  this,  consider  the  following  linear 
motion  example 


x(k+l)  »  Ax(k)  +  v(k) 


(2.2.3) 


where 


x(0)  is  an  n  x  1  Gaussian  initial  state  vector 

x(k)  is  an  n  x  1  state  vector  at  time  k 

w(k)  is  an  n  x  1  Gaussian  disturbance  vector  at  time  k 

A  is  a  constant  transition  matrix  with  appropriate  dimension. 


Moreover,  the  random  vectors  x(0),  w(k),  w (l)  are  assumed  to  be  statistically  independent  for  all  k, 
l.  Hence  x(k+l)  and  x(k)  are  linear  transformations  of  the  Gaussian  random  vectors  x(0),  w(0) ,  w(l), 
...,  and  w(k) .  Thus,  p(x(k))  and  p(x(k+l) |x(k) )  are  normal  density  functions.  Therefore  the  evaluation 
of  the  probability 


p{x(k+l)  £  R^|x(k)  e  Rj} 

is  not  analytically  possible.  The  problem  is  more  difficult  if  the  motion  model  is  not  linear.  If 
the  transition  probability  Tjm(k)  needs  to  be  calculated,  it  should  be  performed  numerically.  Even 
this  may  be  difficult.  In  other  words,  the  evaluation  of  the  exact  transition  probabilities  between 
gates  is  not  practical.  Therefore,  the  next  section  discusses  an  approximate  target  motion  model 
obtained  by  approximating  the  disturbance  noise  vector  w(k)  and  the  initial  state  vector  x(0)  by 
discrete  random  vectors  (see  Appendix  B) ,  and  by  quantizing  the  state  x(k),  as  described  above,  for 
all  k  «  1,  2 .  For  this  finite  state  modal,  the  transition  probabilities  can  be  calculated  easily, 

2.3  A  Finite  State  Model  for  the  Target  Motion 

Throughout  this  section,  gates  are  assumed  to  be  generalized  rectangles  such  that  the  zero 
vector  0  (origin)  is  located  in  the  center  of  a  generalized  rectangle,  say  R§  (see  Figure  2.3.1). 


Figure  2.3.1.  Quantization  with  Generalized  Rectangles 


Let  the  lengths  of  the  sides  of  a  generalized  rectangle,  say  Rj,  be  gjj,  g^.  . ..,  gin.  These  lengths 
are  said  to  be  the  sizes  of  Gate  Rj.  Moreover,  the  quantization  levels  for  gates  are  assumed  to  be 
the  center  points  of  the  gates,  namely 

xq(‘)  £Q{x(*)}  *  xqi  if  x(-)  e  R^  (2.3.1) 

where  is  the  center  of  the  generalized  rectangle  (gate)  R^. 

Let  us  now  define  the  finite  state  model  which  approximates  the  target  motion  model.  The  flow 
chart  of  this  finite  state  model  is  in  Figure  2.3.2.  For  each  k,  the  disturbance  noise  vector  w(k) 


Figure  2.3.2.  The  Flow  Chart  of  the  Finite  State  Model 


Is  approximated  by  a  discrete  random  vector  Vj(k)  whose  possible  values  are  vgjtk),  VjjOO, 
the  corresponding  probabilities  are  pdl(k),  pd2(k),  •••»  Pj.  00*  l.e.* 


Wdok 


(k) 


Probtw^k)  «  v^lk))  •  pdl(k) 


1  »  1,  2,  .... 


where  m^  Is  a  positive  Integer,  and  the  subacrlpt  d  stands  for  discrete  (tee  Appendix  8).  Also  the 
Initial  stato  vector  x(0)  la  approximated  by  a  discrete  random  vector  x, (0)  whose  possible  values  are 
x^tO),  x,j2(0)»  •*•*  *^(0),  cllB  corresponding  probabilities  are  p^  <°>*  *»U2<0) . Pdn0(0>*  l>e-‘ 


Probix^O) 


xdl(0)) 


•*di(0) 


*  “  2 . nn 


where  ng  Is  a  positive  integer.  Further,  in  the  (target)  motion  model,  replacing  the  disturbance 
noise  vector  v(k)  and  the  Initial  state  vector  x(0)  by  the  discrete  random  vectors  wj(k)  and  xd(0) 
respectively,  and  then  quantising  the  suttee  by  Quantiser  (2.3.1),  the  target  motion  modal  is  reduced 
Co  the  finite  state  model 


where 


*  <k*l)  -  Q{£(k,  x^k),  u(k),  wd(k))) 


(2.3.2) 


Q(*> 

xq(k) 


is  Quantiser  (2.3.1) 

la  the  quantised  state  vector  at  time  k,  and  Its  possible  values  (l.e.,  the 
quantisation  levels  of  the  state  vector  x(k))  are  x„,(k),  x^jCk),  x  (k) 
where  n^  is  the  number  of  possible  quantisation  levels  of  tho  state  <,t“k 
vector  x(k) 


x  (0)  £  x.(0)-,(by  definition,  Xqi(0>  &  *dl^0^*  1  *  1*  2*  ....  no*  *n  ot,ler  words,  tho 
^  quantisation  levels  of  x(0)  are  eetumed  to  equal  the  possible  values  of  the  discrete 

.fandom  vector  x(0)> 


u(k)  le  a  known  pilot-command  vector  at  time  k. 


Throughout  this  chapter  whenever  the  target  motion  model  (or  the  state  equations,  or  tho  terget  motion) 
Is  mentioned  ve  refer  to  Model  (2.3.2)t  l.e..  It  is  assumed  that  tha  target  motion  Is  described  by 
(Finite  State)  Model  (2,3.2). 


The  transition  probability  itjg(k),  which  la  definad  by  the  conditional  probability  that  the 


Gate  R?,  given  that  the 
,  naoety, 


tt  .(k)  ■  Prob{x  (k+1)  -  x  |x  (k)  -  x  }  (2.3.3) 

js.  q  q*.  q  qj 

is  determined  as  follows  (see  Figure  2.3.1). 

Let  us  assume  that  the  quantized  state  vector  Xq(k)  is  equal  to  the  quantization  level  Xqj  for 
Gate  Rj  (i.e.,  the  target  is  in  Rj  at  time  k) .  The  transitions  from  this  quantization  level  to  the 
others  are  determined  by  the  discrete  random  vector  w<j(k)  and  the  function  Q{f(k,  x„(k)  ■  Xqj,  u(k), 
wg(k))}.  The  discrete  random  vector  wd(k)  can  take  any  value  in  the  set  {w^(k),  v^OO.  •••>  w^^OO) 

with  corresponding  probabilities  pdi(k),  pd2(k),  ....  Pdmi[(k).  Thus,  the  quantized  state  vector 

Xq(W-l)  may  be  equal  to  at  most  m^  different  quantization  levels.  If  the  function  f(k,  Xq(k)  «  Xqj, 
u(k),  wd(k))  maps  Xqj  into  another  gate,  say  R?  for  only  one  possible  value,  say  w<Ji(k),  of  the  discrete 
random  vector  wd(k),  then  the  transition  probability  Hj^fk)  (from  Gate  R?  to  Gate  Rj)  is  the  prob¬ 
ability  that  the  possible  value  wdi(k)  of  wd(k)  occurs,  i.e.,  J 


Vk)  ■  pdi(k) 

However,  if  the  function  f(k,  Xq(k)  -  Xqj,  u(k),  wd(k))  maps  Xqj  into  another  gate,  say  R$,  for  more 
than  one  possible  value  (say  vd2(k)  and  wd2(k))  of  v<i(k),  the  transition  probability  TTj^(k)  (from 
Gate  R?  to  Gate  rJ)  Is  the  probability  that  the  discrete  random  vector  w (k)  is  equal  to  one  of  these 
possible  values  (w^k)  or  wd2(k)),  i.e.,  d 

V(k)  ’  ^  Pdn(k)  "  pdl(k)  +  Pd2(k) 

n 

where  the  summation  Is  over  all  n  such  that 

Q{f(k,  *q(k)  -  x^.  u(k),  wdn(k)))  -  . 

Havlrtg  determined  the  finite  state  model,  we  can  represent  the  target  motion  by  a  diagram  called 
a  Trellis  diagram  for  the  target  motion. 

2.4  A  Trellis  Diagram  for  the  Target  Motion 

Let  us  assume  that  the  quantized  state  vector  Xq(k)  has  nk  possible  values,  say,  xqi(k),  x  2(k), 

•  ••>  XqngM  where  nk  la  a  positive  integer.  To  represent  the  target  motion  by  a  graph,  we  adopt  the 
following  convent  tone 

(1)  Each  possible  value  of  Xq(k)  Is  represented  on  the  ktk  column  by  a  point  (sometimes 

called  node)  with  the  corresponding  quantization  level  so  that  the  k1**  column  contains 

the  possible  quantisation  levels  of  Xq(k)  (in  other  words,  the  possible  gates  In  which 

the  target  can  lie  at  time  k)  where  k  «  0,  1,  2,  ... 

(2)  The  transition  from  one  quantisation  level  to  another  it  represented  by  a  line  having 
a  direction  indicating  the  direction  of  the  target  motion. 

Hence,  the  target  motion  from  time  zero  to  time  L  can  be  represented  by  a  directed  graph  shown  in 
Figure  2.4.1,  which  le  called  the  trellle  diagram  for  the  target  motion  from  time  zero  to  time  L. 

Definition  2.4,1  A  path  In  the  trellis  diagram  la  any  sequence  of  directed  lines  where  the  final 
vertex  of  one  la  the  initial  vertex  of  the  next  one. 

2.5  Approximate  Observetion  Models 

So  far  the  target  motion  model  hee  been  reduced  to  a  finite  state  model  which  usee  the  quantized 
stats  vectors  (Xq(O’s).  However,  the  observation  models  in  (2.1.1)  and  (2.1.2)  use  the  target  state 

vectors  (x(>)‘e).  Thus,  In  the  observation  models  In  (2.1.1)  and  (2.1.2),  replacing  the  state  vector 

x(k)  by  the  quantised  state  vector  x  (k),  the  following  approximate  observation  models  are  obtained 


quantized  state  vector  Xq(k+1)  will  be  equal  to  the  quantization  level  x„£  for  i 
quantized  state  vector  x^(k)  is  equal  to  the  quantization  level  x^  for  Gate  R“ 


»(k> 


g(k,  xq(k),  v(k)) 


In  clear  environments 


(2.5.1) 


g(k,  x^k),  I(k),  v(k» 


in  the  presence  of  Interference 


i  7-V 


time  1  (column  1) 


time  L  (column  L, 


End  (final) 

quantization  levels  (nodes) 


Figure  2.4.1.  The  Trellis  Diagram  for  the  Target  Motion 


From  now  on,  whenever  the  observation  model  (or  the  measurement  model  (equations))  is  mentioned, 
we  refer  to  the  models  in  (2.5.1).  The  observation  models  in  (2.5.1)  are  used  in  the  following 
analyses. 


Let  us  consider  the  trellis  diagram  In  Figure  2.4.1  where  It  Is  assumed  that,  without  loss  of 
generality,  the  target  will  be  tracked  from  time  zero  up  to  and  Including  time  L.  Therefore,  the 
trellis  diagram  Is  drawn  from  time  zero  to  time  L.  Time  zero  refers  to  the  Initial  state.  Let  us 
now  define  the  following  which  will  be  used  throughout  our  further  analyses. 


Is  the  mmber  of  quantization  levels  for  the  gates  In  which  the  target  may  lie  at 
time  1,  In  other  worda,  Che  number  of  possible  values  of  the  quantized  state  vector 
xq(l)  whore  1  »  0,  1,  2,  ....  L 

g(i)  Is  the  set  of  all  the  quantitation  levole  for  the  gates  tn  which  the  target  may  lie 
at  time  1,  namely, 


M 


8<i>  &  (*(}1(i),  *q2U)»  •••.  Xqn  (1)} 

where 

1*0,  1,  2,  ...,  I 

la  the  number  of  possible  paths  through  the  trellis  diagram;  this  number  le  lose  than 
or  equal  to 


L 

TT  «, 

j.O  J 


th 


le  the  n  path  through  the  trellis  diagram,  which  Is  indicated  by  a  thick  line 


«■(!)  Is  the  quantization  level  for  the  gate  In  which  the  target  liee  at  time  1  when  It 
**  follows  path  lla.  In  other  words,  the  possible  value  of  the  quantized  state  vector 
Xq(l)  which  the  mth  path  passes  through.  For  example,  in  Trellis  dlagrsa  2.4.1, 

x*(0)  -  x  ,(0),  x“(l)  «  x  (1),  x*(2)  •  x“(2),  ... 
q  q2  q  q2  q  q 

k?  Is  ths  probability  that  the  possible  value  of  ths  Initial  state  vector  xa(0)  from  which 
the  m*h  path  etarts  occur*,  namely,  ng  “  Prob(x.(0)  •  ^(O)}.  For  example,  in  Trellle 
diagram  2.4.1  q 


it"  ■  Prob(xq(0)  -  xq2(0)} 


is  the  transition  probability  from  the  (1-1)  gate  for  the  m  path  (i.e.,  the  gate 
from  which  the  target  passes  at  time  1-1  when  it  follows  path  Hjj)  to  the  1th  gate  for 
the  m1*1  path.  In  other  words,  it  is  the  transition  probability  that  the  target  will  be 
at  the  iih  quantization  level  (node)  of  path  Hjj  at  time  i  when  it  is  at  the  i-1 
quantization  level  (node)  of  at  time  i-1,  that  is,  nij  A  Prob{xq(i)  -  x5(i)  |x.(i-l)  - 
x®(i-l)}.  For  example,  in  Trellis  diagram  2.4.1 

ir“  -  Prob{xq(l)  -  x“(l)  |xq(0)  -  x“(0)} 

A  Prob(x  (1)  -  x  ,(1) |x  (0)  -  x  .(0)} 

“  q  q2  1  q  q2 

-  Prob{x  (2)  -  x“(2)|  x  (1)  -  xm(l)} 

2  q  q  q  q 

A  Prob(x  (2)  -  x  ,(2) |x  (1)  -  x  ,(1)} 

”  q  qz  q  q2 

Hq831  is  the  maximum  of  the  probabilities  that  the  quantization  levels  at  time  zero  occur, 
i  •  ©* , 

-5“  A  aSffo  Prob{xq(0)  -  a) 

it?8*  is  the  maximum  of  the  transition  probabilities  from  the  quantization  levels  at  time 
i-1  to  the  quantization  levels  at  time  1  (where  1  -  1,  2,  . ...  L),  that  is, 

it?8*  ■  max  Prob{x  (i)  -  a|x  (i-1)  •  b} 

1  aeg(i)  *  “ 

beg(i-l) 

x?*n  is  the  minimum  of  the  probabilities  that  the  quantization  levels  at  time  zero  occur, 
i.e., 

tt?*“  ■  max  Prob{x  (0)  -  a} 

0  aex(O)  q 

niin  t^e  the  transition  probabilities  from  the  quantization  levels  at  time 

1-1  to  the  quantization  levels  at  time  1  (where  i  -  1,  2 . 1),  namely, 

n?in  -  min  Frob{x  (i)  ■  a|x  (i-1)  »  b) 

1  aex(i)  *  1 

beg(i-l) 

x®  A  (x“(0),  x“(l),  ....  Xq(L))  which  is  the  sequence  of  the  quantization  levels  (nodes) 
which  the  m*"  path  passes  through,  obviously, 

x“(i>  c  g(i)  i  •  0,  1,  2 . I 

tL  «  {z(l),  z(2),  ...,  n(L) )  is  the  observation  sequence  from  time  1  to  time  L 

I1*  A  { I (1) ,  1(2),  ...,  1(D)  is  the  interference  sequence  from  time  1  to  time  L. 


Obviously,  the  target  motion  occurs  along  one  of  the  possible  paths  In  the  trellis  diagrsm.  Hence 
our  aim  is  to  decide  a  path  in  the  trellis  diagram,  which  is  moat  probably  followed  by  the  target, 
by  using  the  observation  sequence  zK  Because  of  randomness  in  the  models,  our  approach  must  be 
statistical,  I.e.,  a  statistical  optimisation  problem.  Based  on  the  observations,  we  shall  guess 
which  path  was  followed  by  the  target.  Hence,  a  criterion  la  needed.  For  a  tracking  problem,  a 
suitable  criterion  may  be  the  minimum  error  probability  criterion,  which  Is  a  special  case  of  Bayes 
criterion  In  Detection  Theory.  Using  this  criterion  reduces  the  problem  of  finding  the  path  moot 
probably  followed  by  the  target  to  a  (Composite)  Multiple  Hypothesis  Testing  problem. 

2.6  Minimum  Error  Probability  Criterion 

In  the  previous  section,  we  labeled  the  M  possible  paths  through  the  trellis  diagram  Hj,  Hj,  . . . i 
H^.  Sometimes  these  paths  ars  referred  to  as  Hypotheses.  Hence,  using  the  minimum  error  probability 
criterion  and  the  observation  sequence,  we  would  like  to  decide  which  hypotheses  is  true  (in  other 
words,  we  would  like  to  find  the  path  most  porobsbly  followed  by  the  target).  To  accomplish  this 
we  develop  a  decision  rule  that  assigns  each  point  in  the  observation  space  D  to  one  of  the  hypotheses. 
Therefore,  we  can  view  the  decision  rule  as  dividing  the  whole  observation  space  D  Into  M  subspaces 
D},  Dg,  and  Du  (sea  Figure  2.6.1).  If  Che  observations  fell  In  the  subspace  Dj,  we  decide  that 
the  target  followed  path  H*  (i.e.,  ti,  is  true).  Subspace  D{  is  callod  the  decision  region  for 
Hypothesis  Hj.  Therefore,  we  must  choose  the  decision  regions  Di,  D2,  ...,  and  D^  in  such  a  way  that 
the  overall  error  probability  is  minimized. 

Ths  ovsrsll  error  probability,  sometlmas  called  the  Bayee  risk  (R),  is  deflnsd  by 

*  A  5  [  ([  ,  P<“i>  P*<*U|K  )  dzL) 

1-1  j-1  VcD.  3  J 

Jfi  1 


(2.6.1) 


Figure  2.6.1.  Observation  Space  and  Decision  Regions 


where 

(  P(*L|«j) 
p'C^lHj)  -i 

[  |  p(*L|Hj,1L)  p(1L)  diL 


in  clear  environments 


in  the  presence  of  interference 


(2.6.2) 


poy 

p(il|hj,il) 

P(1L) 


is  the  probability  that  Hypothesis  It*  (the  path  Hj)  is  true,  and  it  is  called 
the  a  priori  probability  of  Hypothesis 

is  the  conditional  probability  of  the  observation  sequence  *L  in  clear 
environments,  given  that  Hypothesis  Hj  is  true  (l.u.,  the  target  followed 
the  path  Hj) 


la  th*  conditional  probability  of  the  observation  sequence  t1,  in  the  presence 
of  Interference,  given  Hypothesis  H^  and  the  interference  sequenco  IL 

it  thu  Joint  density  function  of  the  interference  sequence  I*"  . 


In  order  to  find  the  optimal  decision  rule,  ve  vary  the  decision  regions  1>1,  Dg,  ....  and 
ao  that  the  risk.  R  is  minimised.  It  is  well  known  that  the  optimum  decision  rule^4  is 

choose  Hj  if  p(Ht)  p* C»l*i«t)  >  p(H  )  p'(*L|Hj)  for  ail  J  *  1  (2.6.3) 

For  a  given  obeervstlon  sequence  iL,  if  the  inequality  in  (2.6.3)  becomes  sn  equality  for  one  or 
more  Hypotheeee,  Uj's,  any  one  of  these  Hj's  end  H^  can  be  chosen  as  the  decision.  This  does  not 
change  the  average  error  probability,  throughout  thia  chapter  for  all  observation  sequences,  iL's, 
for  which  the  inequality  in  (2.6.3)  becomes  an  equality,  the  decision  is  made  at  random  (e.g.,  by 
the  flip  of  a  fair  dime)  among  the  hypotheses  satisfying  the  equality.  Hence  the  optimum  decision 
region  for  Hypothesis  H^  becomes 

D,  ^  C*Lt  p(H^)  p*  <*L|Mi)  >  p(H*)  p,(tL|Hi)  for  all  Jdi,  end  all  the  observation  sequences, 
4  "  tL's,  for  which  the  inequality  In  (2.6.3)  becomes  sn  equality  and  then  Hypothesis 

Hj  has  been  eho«*  n).  (2.6.4) 

It  should  be  noted  that  the  declelon  regions  are  nonoverlapping,  namely, 

O^OOj  •  ♦  for  1  4  J 

where  O  and  6  stand  for  lnterssctlon  and  the  empty  set  reapsetlvely.  However,  the  union  of  all  the 
decision  regions  cover  the  whole  observation  space  D,  that  le, 

H 

D  -  t'  D. 
l-l  1 


Hence,  the  optimum  decision  rule  may  be  interpreted  as  follows:  If  the  observation  sequence  zL 
falls  within  the  optimum  decision  region  D^,  then  choose  Hypothesis  (path)  as  the  decision,  i.e., 

choose  H^  if  i**  e  (2.6.5) 

Having  determined  the  optimum  decision  rule  (2.6.3)  (or  (2.6.5))  with  respect  to  the  minimum  error 
probability  criterion,  we  apply  it  to  tracking  problems  in  the  next  section. 


2.7  Optimum  Decision  Rule  for  the  Target  Paths 


Let  us  consider  the  motion  models  in  both  (2.1.1)  and 
(2.5.1).  The  a  priori  probability  of  Hypothesis  H^  can  be 


(2.1.2)  and  the  observation  models  in 
rewritten  as 


pou  -TT  \ 

1  k-0  “ 


(2.7.1) 


since  the  disturbance  noise  vector  w(k)  is  assumed  to  be  independent  of  w(j)  and  x(0)  for  all  j  +  k, 
where  is  as  defined  in  Section  2.5.  Further,  using  the  assumption  that  the  interference  vector 
I(k)  is  Independent  of  I(j)  for  all  k  i  ),  we  can  rewrite  the  joint  density  function  of  the  interference 
sequence  IL  as 


P<IL)  -  ft  p(Kk)) 
k-1 


(2.7.2) 


where  p(I(k))  is  the  probability  density  function  of  the  interference  vector  I(k).  Moreover, 
recognising  that  the  sequence  x“,  defined  in  Section  2,5,  describes  Hypothesis  completely,  and  using 
(2.7.2)  and  the  assumption  that  the  observation  noise  is  independent  from  sample  to  sample,  the 
function  p'(zMHi)  in  (2.6.3)  can  be  rewritten  as 

P'U1!^)  -  p'(xl|*J) 


■  ft  p'(s<k)|x*(k» 
k-1  * 


(2.7.3) 


where 


in  clear  environments 


p'(t(k)|**(k)) 


(2.7.4) 


f  p(*(k)|xj(k}> 

I  p(t(k)|x* (k),  l(k))  p(I(k))  dl(k)  in  ths  presence  of  Interference 
;l(k)  < 

p(s(k)  jx^k))  is  the  conditional  probability  of  ths  observation  *(k)  In  clear  environments  in 

"  (2.5.1),  given  that  x  (k)  •  x*(k),  i.e,, 

q  q 

p(*(k)l»J(k))  ^  p(t(k)ix  <k)  -  xj(k)) 

p(t(k) |xl(k) ,  l(k))  la  the  conditional  probability  of  the  observation  t(k)  in  the  presence  of 
*  interference  in  (2,5.1),  given  that  x  (k)  -  x*(k)  and  l(k),  that  la 

q  q 

p(*(k)|x‘(k),  t(k))  -  p<t(k)|x  <k>  -  x*(k),  l(k)> 

Let  ua  now  consider  the  function  p*(t(k) |x*(k))  in  the  presence  of  interference,  that  is 


p'(t(k)|x‘(k))  -  [  p(i{k)txj(k>.  I(k))  p(I(k))  dl(k) 
<  'l(k)  ** 


(2.7.5) 


whether  or  not  this  integral  can  be  evaluated  in  a  closed  form  depends  on  the  function  g(k,x-(k)  * 
xi(k),  l(k),  v(k)),  and  tha  atatiatlcs  of  the  it  erference  I(k)  and  the  observation  nola*  v(8).  In 
many  casts,  a  numerical  Integration  might  be  used  to  evaluate  it.  Howsvtr,  throughout  this  chapter, 
approximating  the  Interference  vector  I(k)  by  e  discrete  random  vector  I<i(k)  whose  possible  values 
are  Idl(k),  Id»(k),  ...,  X<irv(k),  with  correspoodlng  probabilities  p(ld|(k)),  p(Id}(k)),  ....  and 
PUdrgtk)),  i.e.. 


Prob(td(k)  -  ldt(k)}  -  p(Idl(k)) 


the  Integral  in  (2.7.5)  la  raduced  to  a  aiamutlon 


(2.7.6) 


[  p(e(k>|x*<k),  I(k))  p(I(k))  dl(k)  =  l  p(x(k)|x*(k),  I  (k»  p(1 ,5(k» 

■'l(k)  q  2-1  q  dl  di 

where  r^  is  the  number  of  possible  values  of  the  approximating  discrete  vector  I,j(k).  In  other  words, 
by  chainging  the  interference  I(k)  to  I<j(k)  we  make  another  approximation  for  the  observation  model 
in  the  presence  of  interference  in  (2.5.1).  The  observation  model  becomes 


*<k)  -  g(k,  x  (k),  I(k)  -  I.(k),  v(k)) 
q  d 

A  g(k.  x  (k),  Id(k),  v(k)).  (2.7.7) 

From  now  on,  throughout  the  chapter,  if  it  is  not  easy  to  calculate  the  integral  in  (2.7.5),  the 
integral  will  be  approximated  by  (2.7.6).  In  other  words,  vnen  the  integral  in  (2.7.5)  can  not  be 
easily  evaluated.  Observation  model  (2.7.7),  (instead  of  the  observation  model  in  (2.5.1)),  will  be 
used  for  further  analyses. 

Substituting  (2.7.1)  and  (2.7.3)  into  Optimum  Decision  Rule  (2.6.3),  we  obtain 


However,  it  is  frequently  more  convenient  to  perform  sussations  than  multiplications.  Since  the  In 
function  is  a  monotone  increasing  function,  taking  the  natural  logarithms  of  both  side*  of  the 
inequalities  In  (2.7.8),  we  get 


for  all  j  i  i  (2.7.9) 

where 


/P<*(k)|*‘(k)> 

q 


in  clear  environments 


(2.7.10) 

\  In  the  presence  of  interference 


p*(«(k)|x*(k»  *<  j  p(a(k)|x‘(k),l(k))  p(Uk»  di(k)  using  (2.5.1) 

T k  . 

p(*(k)|x‘(k),ldt(k))  pUdt(k»  using  (2.7.7) 

p(s(k)|x*(k),l  (k)  «  p(*(k)|a  (k)  •  a*(k),l(k)  *  t .  (k))  which  ia  the  conditional  probability  of  «(k) 

q  q  q  “in  (2.7.7),  given  that  *-(k)  •  x*(k)  and 


id(k)  -  ^(k). 


tlthar  one  of  the  expressions  in  (2.7.8)  and  (2.7.9)  with  the  convention  in  Section  26  ie  the  optimum 
dec la  ion  rule  for  deciding  the  psth  meet  probably  followed  by  the  target. 

Sow  ve  ere  going  to  verify  the  following  equalities  for  the  observation  models  in  the  presence 
Of  interference  in  (2.3.1)  end  (2.7.7). 


p*(.(M|*J(k))  -  p(»(k)|x“(k)) 

p{tL|a.)-Tt  p<*(k)|x‘<k)) 

1  k-i  q 

-  p»(mt|#1)  (2.7.11) 


where 

p(t(k) |x*(k))  ie  the  conditional  probability  of  the  obeervetion  s(k)  in  the  presence  of 
q  interference,  given  that  x  (k)  «  xMk) 

q  q 

pfa^lH.)  is  the  conditional  probability  of  the  observation  sequence  «"  in  the  pretence 

of  interference,  given  that  Uypotbasia  a*  in  true  (i.e.,  the  target  followed 
the  path  a  ), 


Lee  us  see  this  fact  for  the  observation  model  In  the  presence  of  Interference  In  (2.5.1).  Proa 
(2.7.10),  ve  have 


i 


p'(z(k)|x*(k))  -  f  p(x(k)|x*(k),  I(k)>  p(I(k))  dl(k)  (2.7.12) 

q  -f  I(k)  q 

Using  Bayes'  rule  end  the  assumption  that  the  interference  I(k)  is  Independent  of  the  initial  state 
vector  x(0)  and  of  the  disturbance  noise  vector  v(j)  for  all  j,  k.  Ve  obtain 

p(z(k) ,  x*(k),  l(k)) 

p(z(k)|x*(k>,  I(k))  - - ; - 3 -  (2.7.13) 

q  p(x*(k))  p(I(k)) 

Hence,  substituting  (2.7.13)  into  (2.7.12)  and  recalling  that  the  Integration  is  taken  over  all  the 
sample  space  of  I(k),  yields  the  first  equality  in  (2.7.11).  The  second  equality  in  (2.7.11)  follows 
from  the  assumption  that  the  interference  and  the  observation  noise  are  independent  from  sample  to 
sample.  The  same  equalities  in  (2.7.11)  can  be  verified  for  Observation  Model  (2.7.7).  Hence  for  all 
observation  models  already  considered,  the  function  p* ( » j • )  is  a  conditional  probability  density 
function. 

Let  ua  give  soma  definitions  which  will  be  used  later  on 

Definition  2.7.1  The  metric,  denoted  by  HN(x^(0))t  of  the  initial  node  x^fQ)  Is  defined  by 

MS(s  .(0))  -  iafProbU  (0)  -  x  .  (0) ) ]  (2.7.14) 

qi  q  qi 


Consequently 


MH(x*(0))  -  In 
q  0 

Definition  2,7.2  The  metric,  denoted  by  Hfx^jU-l)  *  x^jlk)),  of  the  branch  which  connects  the 
quantisation  level  (node)  x^(fc-l)  to  the  quantisation  level  x^(k)  is  defined  by 

M(x  ,<k-l)  *  x  ,(k)>  A  l«(Prob(*  (k)  •  x  Ck)|x  (k«l) 

<U  qi  -  1  qv  q 

•  x  ,(k-l)H  ♦  in  p'(s{k)jx  ,(k»  (2.7.13) 

qi  qi 


Definition  2.7.,t  The  metric  of  a  path  from  time  sero  to  time  i  1#  the  summation  of  the  wetrlc 
of  the  initial  tvd*  from  which  the  path  starts  and  of  the  metric*  of  the  branch**  which  the  path 
consist#  of.  for  example,  the  metrie,  denoted  by  «(**(!)>,  of  the  portion  between  the  nodes  x*(0) 
and  *®(D  of  the  path  Uypothesis)  I  it  "  q 


Consequently,  the  metric,  eometime*  leiwted  by  of  the  path  (through  the  trellis)  it 

K(«t  >  4  N  («*(«) 

•  ln(p(H  )  p’fs^lu  )J  (2.7.17) 

■  a 

where  **(U  la  the  end  node  of  the  path  H,,  p(U,)  rad  p'it^jiL.)  are  given  by  (2.7.1),  (2.7.))  and 
(2.7.10?. 

Definition  2.7.4  The  error  probability  of  a  path,  eey  Ng,  In  a  trellis  diagram  T  with  M  possible 
paths  Hj,  Hj,  ...,  %  is  the  probability  of  deciding  a  path  which  ia  different  than  H,  a*  the  one 
most  probably  followed  by  the  target  when  the  target  actually  followed  the  path  t^.  This  error 
probability  la  denoted  by  either  Hj,  ....  W^)  or  ft  (T)  where  subscripts  t  and  m  stand  for 

error  and  the  »1^  path.  Hence 


P£  (Hj,  ....  Hjj)  &  P£  (T) 
m  m 

^  Prob{zL  c  dJ^} 


-  f  p(*L|H  )  dzL  (2.7.18) 

z  £D  “ 

m 

where  5n  is  the  complement  of  the  decision  region,  for  the  path  Hj,,  and  pU^jH,,)  is  i-1-  probability 
density  function  of  the  observation  sequence  when  the  target  actually  followed  the  pain  (h..  Hence 
from  (2.6.1),  the  overall  error  probability  for  the  detection  of  the  patn  most  probably  followed  by 

the  target  (denoted  by  8,  Pj-,  Pg(Hj . Hjj),  or  Pg(l))  can  be  expressed  in  terms  of  the  path  error 

probabilities  as  follows 


M 

PE  '  l  P(V  PE  (H1>  U2' 
o 


V 


(2.7.19) 


where  p(H  )  is  given  by  (2.7.1). 

© 

Definition  2.7.5  The  d-mslty  function  of  the  observation  sequence  a^“  when  the  target  actually 
followed  the  path  il,,  (t.e.,  pCz^j)^))  is  referred  to  as  the  likelihood  function  for  the  path  (hypothesis) 


Therefore,  the  opticus  decision  rule  for  deciding  the  path  most  probably  followed  by  the  target  from 
time  a-er*  7s?  t fee  L  lead  us  to  choosiug  the  path  (free  time  sere  to  tine  L)  with  largest  metric  (t.i 
the  tyollis  diagram).  This  can  be  handled  by  using  the  Viterbi  Decoding  Algorithm  (Vt>A)^>*°,  which 
ie  the  optimum  decoding  algorithm.  The  algorltte  which  obtains  the  trellis  diagram  for  the  target 
motion  (model),  a*  described  before,  and  which  finds  the  path  most  likely  followed  by  the  target  by 
using  VDA  is  referred  to  as  Optimum  Decoding  based  Smoothing  Algorithm  (QPSA). 

2,-8  OPTIMUM  DECODING  BASED  SHXJTiUNG  AUXlRlTWf 

initial  Step  -  reducing  the  target  motion  model  to  a  finite  state  model,  as  described  before, 
obtain  a  trellis  diagram  for  the  target  motion  (model)  from  time  aero  to  the  time,  say  L. 
until  whleh  the  target  will  he  tracked.  Then  assign  to  each  node  its  metric. 

first  Step  -  for  each  node  at  time  one!  Using  the  observation  *<1),  evaluate  the  metrics  of  the 
branches  connecting  'he  initial  nodes  to  the  node  at  time  one;  adding  these  metrics  to  the 
metrics  of  the  initial  nodes  from  which  the  branches  start,  find  the  metrics  of  the  path* 
merging  at  the  node  at  time  one,  and  label  the  path  with  largest  metric  (which  is  called  the 
best  path  for  the  node  at  time  one),  then  discard  the  other  paths,  finally  assign  the 
largest  metric  to  the  <uxl«  at  time  one  (which  i*  called  the  metric  of  the  node  at  time  one). 

th 

k  Step  “  for  each  node  at  time  hi  Using  the  observation  at  time  k,  calculate  the  metrics  of 

the  branches  connecting  the  nodes  a'  lie*  k-1  to  the  node  at  time  k;  adding  these  metrics  to 
tne  metrics  of  the  nodes  at  time  V-i  from  which  the  branches  start,  find  the  metrics  of  the 
path*  merging  at  the  node  at  lime  k  and  label  the  path  with  the  largest  metric  (which  is 
called  the  best  path  for  live  node  at  time  k),  then  discard  the  other  paths,  finally  assign 
the  largest  metric  to  the  node  at  time  k  (which  is  called  live  metric  of  the  node  at  title  k). 

If  k  •  1,  Stop,  and  choose  among  the  nodes  at  time  k  the  one  with  the  largest  metric,  then 

decide  the  beat  path  for  this  node  as  the  path  followed  by  the  target. 

The  following  section  illustrates  the  optimum  decoding  based  uuvtHitg  algorithm  by  on-exaapie. 

2.1.1  An  example 

let  us  consider  a  target  whose  motion  from  time  aero  to  tine  2  Is  described  Ly  figure  J.S.i. 

Using  GBSA,  we  would  like  to  find  the  path  in  the  trellis  diagram  which  was  most  probably  followed 

by  the  target  from  time  aero  to  tine  2. 

Initial  Step  -  To  each  node  at  time  aero.  Its  metric  is  assigned,  l.e.. 


fOi(a  . (0))  •  Probtx  (0)  •  s  (0)1  .  I  «  1,  2.  i 

qi  q  qi 

Pram  mow  on,  the  metric  of  the  mode  x  , (k)  is  represented  by  XX(x  , (k)). 

qi  qi 

First  Step  -  Consider  the  node  s.jO).  The  branches  s^jfO)  s^i(l)  and  x.j(O)  x-fU)  are  the  only 
ones  connecting  the  nodes  at  time  *»ro  to  x„i(i).  Hence  calculating  the  metrics  of  these 
branches  and  then  adding  these  metrics  to  the  metrics  of  the  nodes  x  ,(0)  and  »  .(0),  the 
following  are  obtained  * 


AU  *  v(1»  ♦  ““V0” 

*12  fi*(*ql(0)  •  s^fl))  ♦  NJI(Xgj'O)) 
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Time  Zero  Time  2 


e.  Trellis  Diagram 
for  the  Target 
Motion  from  time 
zero  to  time  2. 


Time  Zero  Time  2 


b.  Diagram  at  the  end 
of  the  first  step. 


Time  Zero  Time  2 


e.  Diagram  at  the  end 
of  the  second  step. 


Figure  2.8,1.  Diagrams  for  the  Exmaple  for  the  Optimum  Decoding  based  Smoothing  Algorithm 


Further,  assuming  that  An  >  An.  the  path  x  2(0)  x„,  (1)  is  chosen  ,.s  the  best  path  for  the 
node  XqjU),  and  *n  is  assigned  to.  the  node  *  ^(1)  as  its  metric,  i.e., 


VJ)>  ■  AU 


then  the  path  x-iCP)  x_|(l)  is  discarded,  let  us  now  assume  that  the  following  are  similarly 
found  for  the  node  x  -(1). 


«ql<0>  Xq2d)  i»  thu  best  Path  for  x^2(i) 

MK<xq2U))  -  MU^lO)  -  xq2a))  +  KN(X(ji(0)) 

Hence,  w*  have  Figure  2.8.1.b  at  the  end  of  the  firat  etep. 

Seeowi  Step  -  Consider  the  node  xql(2).  The  branches  x^O)  xqi(2)  and  xq2v  xql (2>  are  the  onea 
connecting  the  nodes  at  time  one  to  the  node  x«iw .  Hence  calculating  the metrics  of  thtaa 
brunches  end  aiding  these  metrics  to  the  metrics  of  the  nodee  x  ,{1)  and  x  al).  the  following 
ere  obtained 


An  4  Xfx^U)  *  x^U))  ♦  Mifx^U)) 

*22  4  H(x^U)  -  Sql<2)>  ♦  HJKx^U)) 

Further,  asaumlng  the*  A|2  *  *21*  th*  P*1*  M<0)  Aajd)  U  choatn  aa  tha  beat  path 

for  the  node  xqi<2),  end  *22  la  aaaigned  to  the  node  *qlU)  *a  lta  metric,  that  la. 

then  the  path  xqj(0)  x^U)  *^j(2>  it  dlacardtd.  let  ua  now  attorns  that  the  following  art 
similarly  found  for  the  node  x  ,(2), 

d* 

x^jfO)  x^U)  *q2<2)  i*  the  beat  path  for  x^j(2) 

Wtk^U))  *  Kfx^Cl)  -  x^{l»  +  WHx^Oi) 

Haoce,  we  have  Figure  2.8. I.e  at  tha  end  of  tha  second  step,  lo  addition,  assuming  that 
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MN(xq2(2))  >  MN(xql(2)) 


the  path  x_2(0)  xa,(l)  x  ?(2)  is  chosen  as  the  path  followed  by  the  target  from  time  zero 
to  time  2.  4  H 

Having  defined  the  optimum  decoding  based  smoothing  algorithm,  its  performance  should  be  deter¬ 
mined.  This  is  discussed  in  the  next  two  sections. 

2.8.2  An  Upper  Bound  for  the  Overall  Error  Probability 

Let  us  consider  a  target  whose  motion  from  time  zero  to  time  L  is  described  by  a  trellis  diagram 
with  M  possible  paths  H,,  Hj,  ...,  Hj^  from  time  zero  to  time  L.  The  evaluation  of  the  overall  error 
probability  Pg  for  the  detection  of  the  path  followed  by  the  target  from  time  0  to  time  L  is  conceptual¬ 
ly  easy;  however,  it  is  in  general  computationally  impractical  since  it  contains  multidimensional 
integrals.  On  the  other  hand,  upper  bounds  on  Pg  are  available  which  in  some  cases  approximate  Pg 
quite  well.  One  of  these  bounds  is  presented  below. 

Let  ri  be  a  subset  of  the  observation  space  D  such  that 

A  {zL:  M(Hj)  >  Mtt^)  for  some  j^i}  (2. 8. 2.1) 


where  MCH^)  is  the  metric  of  the  path  (hypothesis)  H^  and  it  is  given  by  (2.7.17).  Then  IL  contains 
the  complement,  D^,  of  the  optimum  decision  region  (since  the  observation  sequences  (zL+s)  for 
which  the  inequality  in  (2.6.4)  becomes  an  equalitv  are  resolved  at  random  into  the  decision 
regions  satisfying  the  equality).  It  then  follows  from  (2.7.18)  the  error  probability  of  the  path 
can  be  upper  bounded  by 


?£  (Hx,  H2,  ....  Hm)  <  j  L  p(*L|Ht)  dzL 


1 


vh«r*  the  function  6(0  is  defined  by 


♦U1*)  & ' 


,  p(zL|H  )  (j>(zL)  dzL 

* \ri  x 


1  if  *  E  r. 


0  elsewhere 


(2. 8. 2. 2) 


(2.8.2..*) 


Furthermore,  4(t  )  can  be  upper  bounded  «•  follows! 

If  a1*  e  rt,  it  then  follows  from  (2. 8.2.1)  that  for  some  j  i  i,  M(Hj)  -  >_  0.  Hence 


exp  a(H(Hj)  -  HO^))  >  1 

for  *oec  j  i  1  and  any  non-negative  masher  a.  Thus,  for  sny  non- negative  numbers  a  and  p  we  have 


(  J  exp  -  M(H.)1)P  »  1  for  any  a,  p  >  C  (2. 8.2.4) 

)#l  J  1 

On  the  other  head,  if  iL  f  T,,  then  the  expression  on  the  left  hand  aide  of  the  Inequality  in  (2. 8, 2. 4) 
la  at  laeat  a  non-negative  number.  Therefore,  for  all  a*-,  we  obtain 


♦<*H  <  (  ]  vep  a(M(U  )  »  M(H.)J)P  for  any  a,  p  >  0  (2.8. 2. 5) 

~  J?1  3 

Further,  aubatltutiag  (2. 8.2. 5)  into  (2.8.2. 2)  yields 


tap  a(K(Hj) 


M(ttt)])p  diL 


(2.8. 2.6) 


the  integrand  in  (2. 8. 2. 6)  la  obviously  non-negative.  Hence  enlarging  the  domain  of  the  integration 
in  (2, 8. 2. 6)  makes  the  valua  of  the  Integral  Larger.  Therefore,  the  error  probability  of  can  be 
further  upper  bounded  by  taking  the  integration  over  the  whole  observation  apace 
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P  (»!«  ....  Vlf 


L  P(zLiHi> 


exp  -  ap  M(H  ) [  l  exp  a  M(H  )] 
1  jf*i  j 


p  dzL 


for  any  a,  p  >  0  (2. 8. 2. 7) 


Since  a  and  p  are  any  arbitrary  non-negative  numbers,  a  can  be  set  equal  to  l/(l+p).  Hence  taking 
a  »  l/(l+p)  and  further  using  (2.7.17),  (2.7.11),  and  the  following  equality 


b  blna 
a  »  e 


for  any  a,  beR*a>0 


the  error  probability  of  the  path  can  be  bounded  by 


VH1’H2* 


•Vi 


L 

(IT 

k-0 


_e_ 

j_a+p 

V 


L 

ITT 

k-l 


J_  L  _L_  L  _1_ 

p'(z(k)|x*(k))]1+p  {  l  (TT^)1+P  [JT  P'(z(k)|xJ(k)))1+p}p  dzL 


j#ik-0 


k-l 


for  any  p  >  0  (2. 8. 2. 8) 


vhere  p'  (z(k)  |xq<k)>  is  given  by  (2.7.10)  and  tiJ  is  defined  in  Section  2.5.  The  bound  in  (2. 8.2. 8) 
is  chat  of  Callaser's  type.  Sub  cituting  this  bound  for  the  error  probability  of  the  path  %  in 
(2.7.19)  yields  an  upper  bound  on  the  overall  error  probability  for  the  detection  of  the  path  followed 
by  the  target. 

2.8.3  An  Entembie  Upper  Bound  for  the  Overall  Error  Probability 

Let  ua  consider  a  t.-rget  whose  motion  ia  described  by  a  trellis  diagram  T  with  M  possible  paths 
Hi,  Hj,  ....  Hy  fr .i  time  zero  to  time  L  and  let  pass  through  the  quantization  levels  xi(0),  xj(l), 
....  x|(l)  (see  Figure  2.4.1).  In  order  to  derive  an  ensemble  bound,  let  us  start  defining  the  q 
following  symbols,  which  will  be  used  in  the  following  analyses. 


X®  is  the  set  of  all  possible  quantization  levels  from  time  one  to  time  L,  namely 


X®  A  {all  possible  values  of  x  (k)  for  k  »  1,  2,  ...,  l) 

the  number  of  elements  in  X® 
the  set  of  all  L-tuples  of  X® 

tha  ensemble  (or  set)  of  all  M-tuples  of  B*.  Hence  f  contains  (H®)1^  elements  in  it 

the  set  of  all  quantisation  levels  which  the  path  Hj;  peases  through  from  time  one 
time  L,  l.c., 


Hj  A  (x*(1),  x*(2) . X*(L)}  e  H* 

which  ia  an  L-tuple  of  X® 

T®  la  the  aet  of  all  quantisation  level*  from  time  one  to  time  L  in  T.  In  other  words. 


T®  A  <8*.  H®,  ....  H ®)  c  £ 

(H  la  the  enaambl*  of  all  poaalbl*  trelli*  diagrams  with  M  poaalble  path*  from  tlms  aero 

to  time  L,  which  are  obtained  from  the  trellis  diagram  T  by  replaclns  only  T*  by 
elamanta  of  (,  Htnca  this  enaabla  contalna  (N*)1"  elements  in  it.  cM  it  refarred  to 
a*  the  anaambla  of  each  motion  (or  trail ta  diagram)  in  ltmelf.  Obviously  eM  la  the 
ensemble  of  T  too  (since  T  c  £M) . 

The  exact  expressions  for  both  the  error  probability  and  tha  upper  bound,  givan  in  tha  pravloua 
taction,  contain  aultldlatnslooal  integral*  which  are  generally  vary  complex  to  evaluate.  Therefore, 
Instead  of  evaluating  these,  we  consider  an  average  error  probability  over  the  ensemble  c.  Averaging 
an  error  probability  over  thla  ensemble  la  referred  to  ea  "Random  Coding"  which  la  the  central  tachnlque 
of  Information  Theory.  An  upper  bound  averaged  over  tha  ensemble  &  la  celled  an  ensemble  bound,  which 
turns  out  to  b*  quite  simple  to  evaluate.  Obviously,  at  least  one  trelli*  diagram  In  £K  must  have 
an  error  probability  ae  email  as  thia  enaambl*  bound  (by  convention,  both  error  probsbllltle*  end  error 
probability  bounds  ere  assigned  to  the  related  trellis  diagrams).  In  other  words  an  ensemble  upper 
bound  will  give  ua  an  upper  bound  on  tha  error  probability  for  the  beet  trellis  dlegram  IncH  (l.e., 
tha  trellis  diagram  with  minimi*  error  probability  ln£tO. 

In  order  to  derive  an  ensemble  error  probability  or  an  anmmablm  bound,  first  e  probability 


B*  is 

€  ia 

H®  la 

1  to 
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density  function  Qe(*)  is  defined  on  the  ensemble  £  such  that 


Qe(Te)  A  TT  Q(H <) 
i-1  1 


Q(H®)  A 


L 

TT 

k-1 


d(xj(k)) 


(2. 8. 3.1) 


where  q(>)  is  an  arbitrary  probability  density  function  on  Xe.  Hence  Q(-)  is  a  probability  density 
function  on  the  set  ge.  Then  an  error  probability  or  an  error  probability  bound  is  averaged  with 
respect  to  Qe(*)  over  the  ensemble. 

The  ensemble  error  probability,  denoted  by  either  PE  or  PE(T) ,  for  the  detection  of  the  path 
(through  a  trellis  diagram  T  in  CM)  most  likely  followed  by  the  target  (associated  with  T)  is  defined 
by 


PE  -  l  QS(Te)  Pe(T) 
Te£(? 


-  I  Qe(T6)  P„(T)  (2. 8. 3. 2) 

T  £& 

where  PE(T)  is  the  overall  error  probability  for  the  detection  of  the  path  (through  T)  most  likely 
followed  by  the  target  (associated  with  T)  and  Te  is  the  set  of  all  quantization  levels  from  time  one 
to  time  L  in  T.  Substituting  (2.7.19)  into  (2. 8, 3. 2)  and  changing  the  order  of  summations,  the 
ensemble  error  proabillty  can  be  rewritten  in  terms  of  the  path  ensemble  error  probabilities  as 

M  _ _ 

Pp  -  l  p(H  )  P  (T)  (2. 8. 3. 3) 

E  i-1  1  E1 

where 

pTItT  -  l  Qe(T8)  Pp  (T)  (2.S.3.4) 

Ei  To{.  g  Ei 

where  PE.(T)  is  the  error  probability  of  the  path  Hj,  and  PE. (T)  is  referred  to  as  the  ensemble  error 
probability  of  the  path  H^.  As  being  noticed,  a  bar  at  the  top  of  a  quantity  (symbol)  denotes  the 
ensemble  average  of  that  quantity.  Ensemble  upper  bounds  for  the  detection  of  the  path  most  probably 
followed  by  the  target  can  be  obtained  by  averaging  upper  bounds  for  the  path  error  probabilities 
over  the  ensemble.  Let  PgB  (T)  be  an  upper  bound  for  the  error  probability  of  the  path  H^,  i.e.. 


P,,  (T)  <  P„.  (T)  (2.8, 3. 5) 

fci  ~  EBi 

Substituting  this  bound  for  the  error  probability  of  in  (2. 8. 3. 4)  yields  the  following  bound  for 
the  ensemble  error  probability  of  the  path 

P7TfT  <  t  Q*(TC)  P„  <T> 
i  r*c£  1 

4P^1tT  (2.8.1.6) 

where  Pgg^ (t)  it  referred  to  es  an  enameble  upper  bound  for  the  error  probability  of  the  path  Hi. 
Further,  substituting  the  bound  in  (2. 8. 3.6)  tor  the  ensemble  error  probability  of  tha  path  Hi  in 
(2. 8. 3. 3)  ylslds  tha  following  bound  for  tha  aosemble  error  probability 

H  _ 

P,  <  I  p(H.)  P_,(T)  (2. 8.3.7) 

1  ~  1-1  1  “i 

Let  us  now  darive  an  anstmbla  bound  for  tha  ovarall  error  probability  by  uelng  the  bound  in 
(2. 8. 2.8).  Substituting  the  bound  in  (2. 8. 2. 8)  end  (2. 8. 3.1)  into  (2. 8. 3, 6),  we  can  upper  bound  the 
ensemble  error  probability  of  tha  path  U1  as 
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.  _e_ 


<T>  £  l  “•  I  Q(H*)  Q(H*)  ...  Q(H^)  J  L  ^  1+P  [p'(*L|H1)]1+P  {  I  [p'(iLjH  )]1+P}P  dzL 

1  H®eg8  H^£Be  2  j’li 


for  any  p  >  0 


where  p’Ci^lH^)  Is  defined  by  (2.7.3)  ami 


bi  £  ft 

1  k-0  K 


(2. 8. 3. 8) 


(2.8. 3. 9) 


Changing  the  order  of  sumations  and  integration,  (2. 8. 3. 8)  can  be  rewritten  as 

p  1  1  1 

<  f  L  bt  1+P  {  l  Q<H?MP,(*1,|H1)11+P}{I  •••  l  (IT  Q(H?)) (  l  b^  (p,(zL|H  )1+£>]P}  (2.8.3.10) 

i  ~  !z  „e_„e  1  jdi  1,11  J  jdi  J  J 


H'eH 


V" 


where  p  is  an  arbitrary  non-negative  nuaber.  If  we  restrict  the  parameter  p  to  lie  in  [0,1],  then 
the  tern  in  the  last  braces  can  be  farther  upper  bounded  by  using  Jensen's  inequality.  Let  f (R)  be 
the  term  In  the  last  brackets,  namely, 


1  1 

f(R)  ARP£  [  l  b*+p  (p'(*L|K  ))1+p  ]p  (2.8.3.11) 

jdi  J  3 

where 

1  1 

R  L  ^  ^  (p'(*L|»j))1+°  (2.8.3.12) 


Then,  f(R)  is  a  convex  nfunctio#  for  any  p  c  [0,1]  since  Rp  la  a  convex  n function  for  any  p  c  [0,1). 
Furthermore,  the  term  in  the  last,  braces  In  (2.8.3.10)  1«  the  expectation  of  f(R)  with  respect  to  the 
following  probability  density  function 


l  I  <Tf  Q«»!»  (2.8.3.12) 

Jt»l  J 

19 

Therefore,  using  Jenaen'a  inequality  (If  R  la  a  random  variable,  f(R)  la  a  convex  f>  function  of  R, 
and  B(R)  la  finite,  than 


K{f(R))  <  f (E(r)>  (2.8.3.13) 

where  E  stands  for  the  expectation)  and  recognising  that  H*  la  summed  over  the  seme  apace  g*  for 
all  Jt  and  J 

l  Q(«!)  -  1  (3.8.3.14) 

we  obtain  the  following  bound  for  the  term  In  the  last  braces  la  (2.8.3.10). 

1  .  JL  _I_  _L 

I  l  (IT  0w!))(  l  bj*  (p,(n‘'|H,»1*V  <  {  l  bj4*  ]p  (  [  QC8*)(p'(«Li81))l+0]°  (2.8.3.13) 

jdl  iH  J  jdl  J  J  “  Jdl  i  ^e  1  1 

1  for  any  p  C  [0,1] 


Substituting  this  bound  into  (2.8.3.10),  we  gat 
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1  1  1 

pTTtT  <  b  1+0  [  l  bj+e]p  f  ,[  l  Q(U*)  (p'(*L|H.))1+P]P+1  dzL  (2.8.3.16) 

*  '  i"  1  «;*•  1 

Farther,  using  the  following  Inequalities 

IT  ^  £  b  <  TT  it®"  for  all  i  (2.8.3.17) 

k-0  *  1  k-0  K 

the  tern  outside  the  integral  in  (2.8.3.16)  can  be  further  upper  bounded  by 


-  JL  L  TT®“  p 

b.  1+9  [  I  b*+p]p  <  (H-l)p  TT  for  all  1  (2.8.3.18) 

1  Jdi  J  k-0  Tt“in 


Let  us  now  consider  the  term  In  the  last  brackets  In  (2.8.3.16).  Substituting  (2.8.3. 1)  and  (2.7.3) 
into  this  tent,  and  changing  the  order  of  sumatlona  and  nultiplications  yield 

1  ,  1 


h*eh’ 

s  k-l 

L 

TF 

(  l 

k-l 

xi(k)eX< 

<1 

L 

TT 

<  I  9 

k-l 

xeX* 

.l+o\ 


(2.8.3.19) 


the  laat  equality  follow*  from  the  feet  that  for  all  1  and  k,  xl(k)  1»  a  touted  over  the  awe  apace  Xs. 
Substituting  (2.8.3.18)  and  (2.8.3.19)  Into  (2,8.3.16)  yields  tha  following  saeaable  upper  bound 


for  the  error  probability  of  tt(. 


t  »***  P  t  ,  J_ 

om)p<TT  TT  (  l  l  «iU)  (p'(«<w)|x))l4°rl) 

fcl  “  k-0  k-1  'e(k)  # 


for  all  1  and  any  »  t  (O.U  (2.8.3.20) 


where  p'(x(k)(x)  la  given  by  (2.7.10),  *"*  and  v?1”  are  defined  In  Section  2,5,  and  q(>)  1*  an 
arbitrary  density  function  on  X*.  Substituting  the  bound  in  (2.8,3.20)  Into  (2. 8. i. 7),  and  recognising 
that  this  bound  does  not  depend  on  1  (that  la,  the  paths)  yield 

K<  !  pOuITt? 

6  l-l  1 


»(t)  l  p(a.)  *  «tr7 
i-i  1 


(2.8.3.21) 


Hence,  the  bound  In  (2.8.3.20)  U  also  an  upper  bound  for  the  rateable  error  probability  (or  the 
detection  of  the  path  scat  probably  followed  by  the  target. 

It  the  function  g(k,  .,  .)  In  the  oheervatlon  model  being  considered,  and  the  statistics 
of  the  observation  noise  v(k)  and  the  interference  t(k)  (In  the  pretence  of  interference)  ere  tlae- 
Invarlent,  then  the  tare  in  braces  la  (2.8.3.20)  la  tie*- Invariant.  Hence,  la  this  case,  the  rateable 
upper  bound  ta  (2.8.3.20)  becoaea 

l  **•*  JL  _i_ 

?,  <  (H-l)p  (  TT  4;)W  i{  t  I  9(*>  (p,(a(h)|«»W>l‘*l»L  for  eay  o  c  (0,1)  (2.8,3.22) 
s“  k-0  ej10  ;*(k)  ^a 

Dalai  the  telatloa  that  expUaal  •  a  for  aay  a  >  0,  tha  bound  la  (2.8.3.22)  can  he  rewrlttea  as 

f,  <  eap-tU^tiJ.g)  -  d  -  •jjjj  for  aay  p  c  (0,lj  (2.8.3.23) 
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where 


EflCP.q)  &  -ln{| 


z(k) 


[  l  q(x)  (P'(z<k)|x))1+p]pH} 


xeX 


(2.8.3.24) 


ga  TT 

k-0 


max 

\ 

min 

\ 


%(pi<l)  is  referred  to  as  the  Gallager  function  since  It  vas  first  defined  by  Gallager  .  Recalling 
that  p  is  any  arbitrary  nusber  in  [0,1]  and  g(*)  Is  an  arbitrary  probability  density  function  on  Xe 
leads  us  to  obtaining  the  tightest  bound  on  PE  by  minimizing  the  right  hand  side  in  (2.8.3.23)  over 
p  and  q.  this  gives  us  the  following  bound 

P_  <  exp-LE(M,G,L)  (2,8.3.24) 

£> 


where 


E(M,C,L)  A  max  max  [E  (p,q)  -  p  - - 

3  pttO.l)  0  '  h 

■  max  [max  En(p,q)  -  P  - 

PC  [0,1]  3  g  '  1 


0  InC. 
1+p  1  1 

p  InG. 
1+p  L  1 


(2.8.3.25) 


As  it  has  been  noticed,  the  aaxlaisation  is  taken  over  all  p  c  [0,1]  and  the  set  of  all  possible 
probability  density  functions  on  X*.  In  order  to  evaluate  E(M(G,L),  it  is  necessary  to  analyse 
Eq(P,9)  as  a  function  of  p.  The  Important  propartles  of  this  function  are  stated  in  the  following 
theorem.  The  proof  of  this  theorem  is  presented  in  Reference  [17). 


Theorem  2.6.3. 1 

Assume  that  the  average  mutual  information,  denoted  by  1 (q) ,  which  is  definad  by 


Kq)  A  I  [  q(x)  p*(a(k)|x)  Io[-  .  - )  da(k) 

'a(k)  ye  l  q(s)  p'(s(k) jx) 

xtX* 


(2.8.3.26) 


is  nonsero  (in  fact,  Kg)  is  always  non-negative).  Then,  Eo(o.q)  has  the  following  properties 


yo.g) 

•  0 

lor 

o  «  0 

(2.6.3.27) 

^(o.g) 

>  0 

lor 

p  >  0 

(2.6.3.28) 

36„<*.q) 

Sp“ 

>  0 

for 

p  >  0 

(2.6.3.29) 

^(e.j) 

dp 

1  •  Kq) 

‘p-o 

(2.8.3.30) 

^(o.q) 

<  0 

(2.6.3.31) 

with  equality  in  (2.6.3.31)  if  and  only  if 


In [ - P-teMi*) - j  „  (2.6.3.32) 

l  m  qU)  p*(a(k)  [*) 
a a 

for  ell  x  c  X*  sed  all  t(k)  la  the  specs  of  ell  passible  observations  at  time  k  such  that 

q(a)  p'(s(k)|»)  >  0.  Therefore ,  for  a  gives  q,  Iglo.q)  is  s  positive  i sc rasa  lag  cosvas  n  function 

of  p  c  (0,»)  with  g  slops  at  the  origin  equal 'to  l(q)T  Also,  the  following  function 
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is  a  convex  O  function  of  p  e  [0,<»>.  Hence  we  can  easily  perform  the  maximization  in  (2.8.3.25) 
over  p  e  [0,1]  for  a  given  q(’)  so  that  E(H,G,L)  can  be  expressed  parametrically  as^,17 


0 


If  R  >  C 


E(M,G,L)  -  < 


max 

3 


.  ,  ln(M-l)  PlnG 

E0(P,q)  -  P  ^ 

„  s  InCM-l)  InC 

V  Eo(1*S>  “  “l  '  IT 


If  '5p[mqX  E0(p*3)] 


P-1 


<  R  <  C  (2.8.3.33) 


I£  E0(P*3)J 


P-1 


where 


RAlEGhil  +  M 

*  L  4L 

C  A  max  l(q) 


(2.8.3.34) 


The  maximization  of  the  Gallager  function  Eo(p,q)  and  the  average  mutual  information  I(q)  over  the 
apace  of  all  possible  probability  density  functions  on  Xe  has  been  treated  in  the  literature.  Two 
theorems  relstcd  to  this  maximisation  are  stated.  Their  proof  can  be  found  in  References  [16]  and  (17). 


Theorem  2. 8. J. 2 

A  probability  density  function  qgW  an  Xc  maximizes  the  function  E<)(p,q0)  for  a  given  p  >  0 
if  on  only  If  the  following  holds  w  ~ 

f  [p*(z(k)|x))l+0  [o(z(h),q  ))°  >  f  [a(*Ck),q  ))1+p  for  all  x  cX°  (2,8.3.35) 

z(h)  'a(k) 

with  tquality  tor  alt  x  t  X®  for  which  q^U)  >  0  where 

1 

a(e(k),q  )  &  l  q-(s)  [p'fzfk) (x))1'*0  (2.8.3.36) 

'u  xtX*  ” 


TfSOfi*  .8.3,3 

A  p.  ..  „c,4,  .>•  danstty  .unction  qg(-)  on  X*  maximizes  tha  average  mutual  information  J (q. j  if 
and  only  if  tha  following  ho  Ida 


p*(a(k)|x)  la[ - - — ]  <  i<.  )  for  .u  „  t  *«  (2.8.3.37) 

*(k)  l  <h. (x)  p‘(a(k)|x)  -  ^ 


equality  for  all  x  for  which  «^(x)  >  0. 

tt  than  followe  that  neither  of  these  theorems  la  very  useful  In  finding  the  maximum  of  the  Cellager 
function  or  the  everege  mutual  information,  tut  both  are  useful  in  verifying  that  e  hypothesized 
solution  le  Indeed  e  solution.  For  example,  using  thean  theorems.  It  can  he  verified  that  the  uniform 
distribution  on  Xa,  that  in 


qU>  -  “  for  all  x  t  X*  (2.8.3.38) 

r 

le  not  thn  optimum  (maximising)  distribution  for  nlther  Efl(p.q)  or  I(q).  In  general  the  maximization 
over  the  probability  density  functions  on  X*  must  he  performed  numerically.  Even  If  tha  optimum 
(aaximlzUg)  probability  density  function  in  known,  thn  evaluation  of  tha  Cellager  function  or  thn 
overage  mutual  information  le,  In  general,  net  easy  at  ell  since  the  related  expressions  contain 
multidimensional  Integrals,  hence  the  evaluation  of  than  must  be  performed  numerically,  throughout 
thin  chapter,  sa  tha  ensemble  upper  hound,  the  bound  using  p  «  1  end  the  uniform  distribution  for 
q(-)  in  (2.6.3.20)  In  weed  by  virtue  of  thn  Men  feature  arising  from  tha  watfem  distribution  (which 
in  statad  in  thn  nnxt  theorem)  and  the  feet  that  tha  a  implant  function  to  calculate  among  t<)(P,q'  U 


EQ(l,q).  Obviously  this  bound  is,  in  general,  not  as  tight  as  the  bound  in  (2.8.3.24).  Substituting 
p  “  l'and  (1/Ne)  for  q(x)  in  (2.8.3.20),  we  obtain  the  following  bound  for  the  ensemble  upper  bound 
for  the  overall  error  probability 


i 


where 


PEiD 


TT  {(  [  I  (p’(t(k)|x))1/2l2) 

k«l  Ji(k)  xeXe 


a  „e 

&  B 


(2.8.3.39) 


L 

DA  (M-l)  [  TT 

k-1 


aax 

<-4s)1/2}  <-^>2L 

Tr"ln  N* 


(2.8.3.40) 


If  the  function  g(k,  .,  . ,  ,)  in  the  observation  model  being  considered  and  the  statistics  of 
the  observation  noise  v(k)  and  the  interference  l(k)  (in  the  presence  of  Interference)  are  time- 
invariant,  then  the  bound  in  (2.8.3.39)  becomes 


<  D  (j 


t(k) 


[  l  (p’UMlx))1'2!2}1* 

xeX« 


(2.8.3.41) 


Let  us  now  prove  the  following  theorem  which  gives  ue  the  reason  that  a  uniformly  weighted 
ensemble  bound  (like  (2.8.3.39))  is  used  as  the  performance  measure.  A  uniformly  weighted  ensemble 
bound  is  an  ensemble  bound  obtained  by  using  the  uniform  density  function  Q*(>)  on  the  ensemble 
l.e., 


Uence 


for  all  x  C  X® 


Qtt®) 


J)™ 


for  all  T*  e  S 


(2.8.3.42) 


Theorem  2. 8. 3.4 

For  a  given  uniformly  weighed  ensemble  upper  bound  1*  for  the  overall  error  probability,  there 
exists  a  subset,  denoted  by  f  H#,  of  the  ensemble  €h  such  that  £«*  cone  tain*  at  least  hatf  of  the 
elements  In  vH,  and  every  element  (trellis  diagram)  in  must  have  an  overall  error  probability 
which  ia  less  than  or  equal  to  two  times  8*,  l.e.. 


P-lt)  <  28*  for  mil  T  t  (2.8.3.43) 


where  P|(f)  Is  the  overall  error  probability  of  T  (l.e.,  the  overall  error  probability  for  the  detection 
of  thm  path  through  ¥  most  likely  followed  by  the  target  associated  with  the  trellie  diagram  t>. 


Proof:  Ut  us  assume  that 1*  the  comf iwseet  of  fit,,  l.e.. 


Pfi(T)  >  28*  for  all  t  *  £it 


(2.8.3.44) 


t 


tha  suain,  demoted  by  8,  of  elements  la  ^  is  greater  than  half  of  the  number  of  eleueote  In 
N  (otherwise  there  le  nothin  te  prove),  l.e.. 


It  > 


(2.8.3.45) 


Since  both  Q*(t*)  «ad  f.(t)  are  positive  far  ell  T  in  and  ? H,  le  contained  in  the  ensemble 
arm  prohaklllty  Pg  defined  by  (2.8, .1,2)  cam  he  lesser  bounded  by 

Fg  I  I  Q*(T*)  »g(T) 
tt?5^ 


(2,8.3.48) 


Substituting  the  bound  in  (2.8.3.44)  for  the  overall  error  probability  of  T,  t!  ensemble  error 
probability  can  be  further  lower  bounded  as 


P  >  2B®  [  Qe(T*) 

l*e  *H~ 
s 


(2.8.3.47) 


Substituting  (2.8.3,42)  into  (2.8.3.47)  (since  the  uniformly  weighted  ensemble  bound  is  considered) 

ve  get 


P 


8 


(Ne) 


IK 


-  2B8  K  -~rj  (2.8.3.48) 

(N*)1* 

Further,  substituting  the  lower  bound  tn  (2.8,3.45)  for  K  in  (2,8.3,48),  the  bound  in  (2.8.3.48) 
can  again  be  lower  bounded  so  chat  ve  have 


> 


This  contradicts  the  assumption  that  B8  is  an  ensemble  upper  bound.  This  completes  the  proof. 

If  the  ttellia  diagram  associated  with  the  target  being  considered  is  a  member  of^Hg,  then 
the  overall  error  probability  of  this  trellis  is  upper  bvaruded  by  2b«;  otherwise,  we  do  not  have 
any  idea  about  this  overall  error  probability.  SineeCH*  contains  at  least  half  of  tli>:  *  levin's 
jtn  the  ensemble  cN.  there  is  a  good  chance  (hat  the  trellis  diagram  being  considered  belongs  to 
ctoa.  However,  for  nonuni forrniy  weighted  ensembles,  a  large  subset  (of  CM),  every  element  of  which 
has  an  overall  error  probability  bounded  by  a  constant  bound,  say  nut  easily  be  obtained  since  the 
arsunv-'Vt*  must  include  the  effect  of  nanuttifora  weighting.  That  is  why  uniformly  weighted  ensemble 
bounds  are  used  as  the  performance  measure. 

2.9  mat  SdiQdlvrtAL  OtCODHiC  BASHD  SNOOTMING  AlfiOfcltSiH 

In  tracking  a  target  from  time  to  Use  l,  by  using  the  optimum  decoding  baaed  smoothing 

algorithm  <00$A),  the  path  sent  likely  followed  by  the  target  imultiple  (coopnsite)  hypothesis  testing 
or  dcodlng  problem)  1*  decided  by  simply  finding  the  path  with  the  largest  metric  through  a  trellis 
diagram  from  time  tern  to  time  t.  UOSA  does  this  by  using  the  Vi  ter'-' 1  decoding  algorithm,  which 
systematically  examine*  (searches)  alt  possible  paths  in  the  treHis  diagram.  Hence,  if  the  number 
of  possible  paths  in  the  trellis  diagram  is  eery  large,  OOSA  require*  a  huge  amount  of  memory  and 
computation. 

If  there  were  a  way  to  guess  the  correct  path  without  calculating  the  spirit  of  eveiy  path  in 
the  trellis  diagram,  most  of  the  computation  and  memory  yeguiftment  in  sH)SA  cnnld  bo  avoided,  line 
way  to  do  so  is  to  use  a  smoothing  algorithm  using  a  stack  sequential  decoding  algorithm  (which  is 
subopcimum,  t.e.,  it  doe*  not  minimise  the  overall  error  probability)*^**.  Iweh  a  worthing  algorithm, 
which  at  any  time  (step)  stores  a  "stack**  «i  already  searched  paths  of  varying  length  ordered  according 
to  their  metrics,  is  presented  below.  This  algorithm  is  referred  u  as  Stack  Sequential  Decoding  based 
Smoothing  Algorithm  (SSD5A). 


initial  Step  -  deducing  the  target  notion  model  to  a  finite  state  mods'),  as  described  before, 
obtain  a  trellis  diagram  for  the  target  morion  model  from  time  aero,  to  tine,  say  U  until 
which  the  target  will  be  tracked.  Calculate  the  metric*  of  all  Initial  paths  (by  convention, 
an  initial  nod*  and  it*  metric  are  referred  to  as  an  initial  path  and  the  metric  of  tbi* 
Initial  path  respectively).  Then  store  these  paths  and  their  metric*  and  order  the* 
according  to  their  metrics, 

Recursive  Step  ~  Compute  the  metric*  of  the  path*  which  are  the  elngle*br«nch  continuations 

of  the  hast  path  in  the  stack  (sse  Definition  2.9.1  below)  snd  replace  the  best  path  and  It* 
metric  by  these  path*  and  their  metric*,  if  any  of  the  newly  added  paths  merge*  with  a  path 
already  In  the  stack,  discard  the  one  with  smaller  metric.  Then  reorder  the  remaining 
path*,  if  the  best  path  in  the  Stack  terminates  in  a  final  node  of  the  trellis  diagram, 
stop  and  choose  the  best  path  sis  the  path  meet  probably  followed  by  the  target;  otherwise 
repeat  the  process  (i  e. ,  continue  to  search  by  extending  the  best  path  to  the  stmek). 

Definition  2.9.1  The  best  path  la  s  stack  of  alteady  searched  "aths  of  varying  length  Is  tbs  on* 
with  the  Urgeet  metric.  If  there  la  mors  than  one  path  with  the  asms  largest  metric,  than  cha  best 
path  la  the  ont  with  tht  longest  length,  if  there  is  more  than  one  path  with  the  same  largest  metric 
•ad  lergth,  then  the  beat  path  la  only  one  of  theme  paths  (this  1*  chosen  at  random). 


The  following  example  illustrates  the  stack  sequential  decoding  baaed  smoothing  algorithm. 
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2.9.1  An  example 

Let  us  consider  a  target  whose  notion  frcci  tine  zero  to  time  2  is  described  by  Figure  2.9.1. 
Using  SSDSa,  we  would  like  to  find  the  path  (through  the  trellis  diagram)  nost  probably  followed  by 
the  target  from  tine  zero  to  tine  2.  Let  us  first  adopt  the  following  conventions. 

Tine  Zero  Tine  1  Time  2 


1  J  I 


Figure  2.9.1.  The  Trellis  Diagram  for  the  Example  for  the  Stack  Sequential  Decoding  baaed 

Smoothing  Algorithm 


1.  In  a  stack!  *  searched  path  Is  represented  by  the  node  sequence*  that  the  path  pasees 
through  followed  by  Its  metric.  The  sequence  ami  the  metric  are  separated  by  a  comma. 

The  metric  of  the  path  ie  denoted  by  KSC-)  where  the  term  in  parentheses  U>  the  final 
oad*  of  the  path.  Further,  different  eearched  patha  are  eeparated  by  eemlcelena. 

U.  A  stack  Is  ordered  In  such  a  way  that  the  beet  path  1*  placed  at  the  end  of  the  stack  and 
the  path  with  the  second  lsrqeet  metric  Is  placed  before  the  best  path,  etc. 

then  the  path  mast  likely  followed  by  the  target  te  obtained  as  follows. 

Initial  Step  -  Assuming  that  the  metric*  of  the  initial  path#  (i.e..  Initial  node#)  3.,(0), 
and  *^(0)  are  such  that  s 


KSU^CO))  l  H$(»^(0>>  >  KS  (*  <«))  (2,9.1. 1) 

where  KS(‘)  A  WHO  la  defined  by  (2.1. It),  the  following  stack  la  obtained 

*,!«».  Md^Wh  KSfa^WJi  *^(0),  IUU  (0»  «. 9.1,2) 

Flrat  Step  -  the  path*  X-j(0)  x^jU)  snA  *q}C0)  a-jil)  are  the  single-branch  continuation*  of 
the  bast  path  a^j(0j  In  (2. 9. 1.2).  Hence  calculating  the  metric  cf  these  paths,  that  ie, 

XSia  .CD)  -  *(*.«>  *  a  .»))  ♦  HSU  ,(0)) 

qi  q*  ql  q2 

Ittta^CD)  -  HU^CO)  *  *,jO»  ♦  HbU^jCO))  (2.9.1.  J) 

where  H(**y)  U  defined  by  (3.  MS),  sad  replacing  the  beet  path  and  tta  metric  tl.e., 

*,2(0).  HSU^j(O)))  by  thaae  pacha  and  th*u  metric*,  we  get 

*<l(0)4eU^s«))tat3<0)^SU1j(0»;a<?2(0)g({lU),HS(a^a))}a<J(0)a4ja),KS(*oa))  (2.9.1.*) 

tee  assuming  that 

(OU^CO)}  *  HSCa^jd))  >  HSu^U))  >  W(s^(0)), 


(2.9.1. 5) 


I  v-i'» 


and  then  reordering  the  paths  in  (2. 9.1.4)  according  to  their  metrics  -e  obtain  the  stack 


xql<0),M$(xql(0)){xq2<0)*qia),KS(xqlU));xq2(0)xq2U),HS(*q2(l));xq3(0).MS(xq3<0))  (2. 9. 1.6) 

hence,  the  best  path  Xq3(0)  does  not  terminate  in  a  final  node  in  Figure  2.9.1.  Therefore, 
we  shall  continue  to  search  by  extending  the  best  path  xq3<0)  similarly. 

Second  Step  -  The  paths  Xq3<0)  Xal(l)  and  Xq3(0)  Xqjd!  are  the  single-branch  continuations 

of  the  best  path  Xqj(u)  in  12.9.1.6).  Thus  calculating  the  metrics  of  these  paths  and  then 
replacing  the  best  path  and  its  metric  in  (2. 9. 1.6)  by  these  paths  and  their  metrics  yields 


V<0)'}iS(Xql(0))i!‘q2<0)8ql(1)'HS(Va));Xq2l0)V(1)*KS{sq2<1)):Xq3(0)Xql(1)-MS<Xql(1)):y<l3{0)x<,2a)* 

MS (x  ,(!)).  (2.9. 1.7) 

9* 

Hence  the  newly  added  paths  CO)  *qi(l)  and  x  3(0)  Xqjfl)  Serge  with  the  paths  Xq2(0)  XqjCl) 
and  Xqif.O)  Xqjd)  (which  are  already  in  the  stack)  respectively.  Assuming  that 


MS  (a  ,<U)  of  x  ,(0)  *  ,<1)  >  HS(x  ,<!))  of  x  (0)  x  .(’) 
and 

HS(x  2<1»  Of  xq2(0)  xq2(l)  >  KStx  a(D)  of  *q3<0)  xq2(i)  . 

the  paths  Xqj(Q)  *qj(l)  end  XqjCQ)  Xqj(l)  and  their  metric*  are  discarded,  heoee  we  have 
the  stack 


«  .(0>*  HS(xqlW)i  xqJ(0)  xqia>,  HS(xqlU))S  *^2(0>  *q2d).  XS(xq2U))  (2.9. 1.3) 

Still,  tfcu  best  path  *qj(0)  SqjU)  l«  (2.9. 1.8)  does  not  terminate  In  one  of  the  end  nodes 
<»l  the  trellis  dlsgtss*.  Hence  the  beat  path  xqJC05  *q,(U  is  extended. 

third  Step  -  Computing  the  metric*  oi  the  path*  *qj<0>  Xqjtl)  xqi(2)  and  *qj(0)  »qj(l)  XqjO), 
nastely,  . 


«Stxi(.U»  -  M(s  .(1)  -  *,,<2))  ♦  XS(*  .(!)) 

91  9*  9*  9* 

HS(*qJ(i»  «HC*q2(l)  -  *qjU>)  ♦  KSCs^H))  .  (2.9. 1 .9) 

replacing  the  best  path  end  it*  settle  In  (2. 9. 1.8}  by  th*-«e  psths  and  their  settles,  and 
then  reordering  all  the  paths  according  to  their  settle*,  let  «*  «ss<ste  that  the  following 
stark  Is  obtained 

e4|l(0).HS<*|)l(0))j«)tJC0)«qlU),HS(*qi(l))?*<{?{0)*^a)a|iJl2),KSUq.(2));«ql{O)»q2«)a)}l(i),«isql(2)). 

<2.9.1.10) 


Since  ths  best  psth  *«j(0)  x-jd)  *qj<2)  terminates  la  the  final  node  *«)(2)  In  the  trellis 
diagrse.  It  is  decided  thst  we  peib  *  2<9)  »  j(l>  *<jl{1*  w“  09,1  Prswly  followed  by  the 
target.  9  9«  •» 

Hirlig  established  the  stack  sequential  decoding  based  smoothing  algorithm.  Its  performance  is 
going  to  be  discussed  in  the  following  two  tact Iona, 

2,9.2  An  tipper  bound  (or  ths  Orerall  Error  Probability 

Let  us  consider  s  target  whose  motion  from  tine  eeto  to  time  L  is  described  by  s  trellis  diagram 
with  H  possible  paths  lij,  Hj,  ....  and  %  from  tint  taro  to  time  L.  Let  H*  and  be  two  path*  through 
the  trellis  diagram  such  that  is  the  correct  psth  (l.e. .  It  is  the  one  actually  followed  by  the 
target)  and  1C.  Is  the  Incorrect  one  (see  Figure  2.9.2).  It  is  esetly  verified  that  since  et  «ach 
step  the  stack  sequential  decoding  based  smoothing  algor  1 ton  ($5 OJA)  extends  only  the  best  path  In 
tbs  stack  by  only  on#  branch,  the  path  i^t  cannot  be  cfcoeon  as  the  one  most  probably  followed  by  the 
target  (as  the  decision)" 


If  N(s*(l»  >  M(n*(J))  (or  *11  I  r  S  and  son#  )tS 


(2.9. 2.1) 
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Tine  Zero 


\i<°> 


J 


Tine  One 

J 


Time  i  Time  j  Time  L 

I  l 

•  •  • 


Initial  nodes  End  nodes 

Figure  2.9.2.  The  Trellis  Diagram  for  the  Performance  Analysis  of  the  Stack  Sequential  Decoding 

based  Smoothing  Algorithm 


or 

If  Y  >  M(xm  (j))  for  some  j  e  S; 

ttt 

however,  the  path  H^,  may  be  chosen  as  the  decision 

If  M(x“'(j))  l  Ym  for  all  jcS  (2.9. 2. 2) 

where  M(x™(i))  is  the  metric  of  the  portion  between  the  nodes  xn(0)  and  x°(i)  of  the  path  H  ,  which  is 
defined  by  (2.7.16),  and  q  q  m 

S  ^  (0,  1,  2,  ....  L) 

Yb  £  «in  M(x“(i»  (2.9.2. 3) 

ixS 

let  r  be  a  subset  of  the  observation  space  D  such  that 
m  • 

r  A  {*Li  M(x*j'(j))  >  y  for  all  jcS  and  all  m'j*m}  (2.9.2.A) 


Since  any  incorrect  path  H,,!  (the  one  which  waa  not  followed  by  the  target^  can  be  chosen  as  the 
decision  only  if  (2. 9. 2. 2)  in  valid,  the  set  rn  contains  the  complement,  D»,  of  the  decision  region 
Dq  for! the  path  Hm  ( the  decision  region  Dq  is  by  definition  the  subset  of  the  obeervttlon  epece  such 
’  thst  whenever  th*.  observation  sequence  fsllg  within  this  subset,  SSDSA  decides  the  path  as  the 

one  most  likely  followed  by  the  target).  Let  fn  be  another  eubeet  of  the  observation  space  D  euch  that 

f  -  {sLi  M(x“  &))  >  Y_  tor  all  m'dm)  (2. 9. 2.5) 

m  x  ”18 

Since  the  inequality  in  (2. 9. 2.4)  implies  the  inequality  in  (2, 9. 2. 5)  (the  converse  it  not  true), 
ra  le  contained  in  fn.  Hence  5™  is  a  subset  of  Fa.  Therefore,  the  error  probability  of  the  path 
(2.7.10))  can  be  upper  bounded  by 


V»i*  h2 . vi  kef  d*L 

A  J  L  P(*L|V  9(aL)  d«L 


(2. 9. 2. 6) 
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where 


4>(*L)  A  < 


|  0  elsewhere 

Moreover,  Che  function  $(zL)  can  be  upper  bounded  by 


(2,^.2. 7) 


9(*L)  <  t  I  exp  ct(M(x®  (L))  -  Y_)]P  for  any  a,  p  >  0  (2. 9. 2. 8) 

“  m'*a  q  « 

The  reason  that  (2. 9. 2. 3)  Is  valid  Is  as  follows:  If  *L  e  fa,  It  follows  froo  (2. 9. 2. 5)  that  there 
exists  at  least  an  a'  i  a  such  that 

M(x*'(i.)>  -  Y  >  0 

q  a  — 

Consequently 


exp  a(M(x“  (L))  -  Y  )  >  1 
q  a  — * 


for  any  a  >  0 


Therefore  the  swats t Ion  In  brackets  In  (2. 9. 2. 8)  Is  greater  than  or  equal  to  one;  obviously  any  non- 
negative  power ,  say  p,  of  It  la  greater  than  or  equal  to  on*.  On  the  other  hand  If  tL  t  fa,  the 
tern  In  brackets  In  (2, 9. 2. 8)  la  at  least  non-nogatlve.  Hence  (2. 9. 2. 8)  la  valid.  Further, 
substituting  (2. 9. 2. 8)  into  (2. 9. 2. 6)  yields 


PK  <U1,H2,...,y  <  |  L  p(*L|Hb)  (exp  -  OP-  H  exp  a  H(x“'(l)>)P  UL 
Also  froi  (2. 9. 2. 3),  we  have 


for  any  u,p  >  0 


(2.9. 2. 9) 


l 

exp  -  oPY  <  l 

*  1-0 


exp  -  oc  M(x“(l)) 


(2.9.2.10) 


sluca  at  least  one  tore  In  the  suaaatlon  la  equal  to  the  left  hand  side  of  the  Inequality  and  the 
other  terns  In  the  sunaatlon  are  at  least  non-negative.  Substituting  (2.9.2.10)  Into  (2. 9. 2. 9),  ve 
get 


pE  <v»2. 

flk 


exp  a  M(x*’(t.)>lpdiL 


for  any  a.o  >  0  (2.9.2.11) 


Moreover,  using  (2.7.11),  (2.7,16)  sod  the  equality 


exp  (lna)  •  a  for  all  a  0  , 


(2.9  2.11)  can  be  rewritten  as 


*•«»!»>):♦  I  (»S>'QP  •  IT  («v* 

*>,  1  *  UL  0  k-1  <  1-1  0  k-l  * 


•  (p'fslkllx^k)))1^  .  ft  pU*U>l«£U)>1  •  (  l  l«!'  ft  «?'  P,(*(h)tx*'{k))l(,)p  da1- 
9  j-iel  *  e'fW  u  lt-1  * 


for  any  o,p  >  0  (3.9.2.12) 


Morer  tr,  using  the  following  Inequalities 


win 

k  -  k  —  k 


for  ell  •  and  k 


(2.9. 2. 11) 


the  bound  In  (2.9.2.12)  can  be  upper  bounded  further  to  that  we  obtain  the  following  bound  for  the 
error  probability  of  tha  pact  8#. 
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PE  <VH2 . V  -  I  L  *  TT  P*(t(k)|«J(k))  +  l  (it! 

m  ‘  z  k«l  H  i-1  u  k“l 


min. -op 
k  ' 


•  (p'utwixjtk)))1-^  TT  p*(«(j)|x«(j))]  {  i  [,»“TT  rp-ww^'wnVd*1 

q  J-i+1  q  m'*«  0  k*l  k  <1 


for  any  a,p  _>  0  (2.9.2.14) 

whore  p  (z(k)lxq(k))  is  given  by  (2.7.10)  and  ^  #  and  n***  are  defined  in  Section  2.5.  Moreover, 
substituting  this  bound  into  (2.7.19),  we  obtain  an  upper  bound  for  the  overall  error  probability. 

Since  it  contains  multidimensional  integrals  as  in  (2.9.2.14),  it  can  not  be  evaluated  easily. 

Therefore,  an  ensemble  upper  bound  on  the  overall  error  probability  is  considered  in  the  next  section. 

2.9.3  An  Ensemble  Upper  Bound  for  the  Overall  Error  Probability 

Setting  a  equal  to  l/(l+p),  the  bound  for  the  error  probability  of  the  path  H„  in  (2.9.2.14) 
can  be  rewritten  as 


PE  (T)  i  J  L  V*L>  {  J-  V<*L>)P  d*L  any  p  >  0 


(2.9.3. 1) 


_  P  j  i  _  _P_  j  p  1 

y*L>  k  (^ln)  **  TT  p'(r(k) |x*(k))  +  :  {(a“in)  1+*  TT  p'U(k)|x“(k))1+p 

k-1  H  i»l  k»l  9 


*  IT  P’(*(j)|x*(j))) 
j»i+l  q 


!L.<«u)A(«r>1+0  Tt  O140 

u  k-1  *  9 


(2. 9.3.2) 


Averaging  this  bound  over  the  eneeoble  #«a  la  Section  2.8.3,  w*  obtain  the  following  bound  for 
the  ensemble  error  probability  of  the  path  H^, 

- 1 A  .1 .  ""I11  f  L  V* <  L  -L 

*  ■  ** 

•JL(  l  Q(H*)  AuUSjfIT  (  l  Q(U?)  {  l  B  ,(»**)  1P)  daL  (2, 9.3. 3) 

*  U^*  £  <«#* 

where  p  is  eny  non-negative  number.  Hence,  reetrlctlng  p  in  the  interval  (0,  1),  and  using  Jensen's 
inequality  (as  iu  Section  2,8,3),  the  tern  in  braces  in  (2. 9. 3. 3)  cen  be  upper  bounded  an  follows 

<  |  I  Q«£>  y*L)l  (  J.  I  Q(H*,>  lBt(sL)J°  d*1 

V 

•  (M-L)0  j  t  (  l  Q(H*)  A^sSil  I  Q(H*,)  8#,(aS)P  dtL  for  any  p  €  (Q,l|  (2.9,3.*) 
*  < 

The  lent  equality  follow  from  the  fact  that  the  summation#  run  ovar  tbs  same  space  H*.  Further 
using  (2. 8. 3.1)  with  an  argument  similar  to  the  one  in  Section  2.8.3,  we  can  aaally  obtain  the 
following  equalities 

l  q<<)  y«b  •  ft  FWiJTiy  +  I  ^  TT 

-•  k-1  i-1  k-1  * 

- — r  L 

•  p'ucwlm)^  TT  TOOT 

J-iH 


(2.9.3. 5) 
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and 


where 

p'(z(k)|x)  A  l  q(x)  p'(t(k)jx) 
xeX* 

X  jl 

p‘f*(!;)|xr+0  A  l  z(x)  p’(z(k)|x)1+p 
xtXe 


for  any  p  e  [0,1] 


(2. 9. 3. 6) 


where  p' (r(k) |x)  is  given  by  (2.7.10),  q(>)  is  an  arbitrary  probability  density  function  on  the  set 

X®  and  X*  ic  defined  *n  2.8.3.  Substituting  these  equalities  into  (2. 9. 3. 4),  and  changing  the  order 

of  Integrations  and  multiplications,  we  obtain  the  following  bound  for  the  ensemble  error  probability 

of  the  path  H 
o 

p_  (T)  <  F  .  ni  c  +  J  TF  (^in)  1+P  o.  TT  C  )  for  any  p  e  [0,1]  (2. 9. 3.7) 

a'l  i-ik-1  J-i+1  J 

where 


1 

C.  4  f  p'(t(k)|x)  [p'(*(k)|x)1+0]p  dz(k) 
*  't(k) 


1 

A  f  [p,(a(k)|x)1+p]1+<>  dr(k) 
*  ■'a(k) 


P  A  (H-l)p  l  IT  (*?*V^) 

k-1  * 


(2. 9. 3. 8) 


where  •  \  «  and  **  ate  •*  defined  in  Section  2.5. 

If  Che  function  g(k,  .,  .,  .)  In  the  observation  model  being  coreioered,  the  statistics  of  the 
observation  noise  v(k),  end  the  Interference  I(k)  (in  the  presence  of  lntcrjerence)  sre  all  time 
invariant,  then  Cg  and  0^  are  time  invariant,  in  this  case,  the  bound  In  (2. 9. 3. 7)  becomes 

l,  l  .  ft 

FTM  <  F  •  <<£  +  I  I  "IT  v<!lu>  ^  U>l  •  C.Ui)  for  an,  p  t  [0,1]  (2. 9.3. 9) 

*  11  i-1  k»l  *  *  * 

Since  the  bound  in  (2, 9. 3.7)  doe*  not  depend  on  the  net!,  (l.e.,  ■),  it  follows  from  (2. 8, 3. 3)  that 
this  bound  1«  also  so  upper  bound  for  the  ensemble  error  probability  Pg,  for  the  detection  of  tb** 
peth  most  probably  followed  by  the  tnrgst  being  considered.  Furthermore,  the  integrals  in  (2. 9. 3. 7) 
may  not  be  evaluated  for  any  p  e  (0,1).  Hence  the  eesleat  bound  to  calc-let*  Is  thw  one  for  p  »  1, 

This  feet  with  the  nice  feature  of  the  uniformly  weighted  ensemble  bound  (see  Theorem  2. 8, 3. *)  led* 
us  tc  using  the  bound  with  P  •  1  add  the  uniform  density  function  for  q(  >  .l.e.,  q(*)  «  1/N'  where 
H*  is  defined  in  Section  2.6.3))  in  (2.9.3  7)  as  the  performance  measure  of  the  stack  sequential 
decoding  baaed  something  algorithm. 

2.10  soaomuM  DECODING  used  smoothing  augomthi 

As  described  before,  In  order  to  decide  the  peth  most  likely  followed  by  a  target  from  lime 
kero  to  time  l,  the  optimum  decoding  based  smoothing  algorithm  COOSA)  and  the  stack  sequential 
decoding  baaed  smoothing  algorithm  (SSOSa)  first  obtain  «  trellis  diagram  (denoted  by  T)  for  the 
larger  motion  (model)  from  time  tero  to  time  L|  then  uee  the  Vlterbl  decoding  algorithm  (VDA)  sad 
stack  sequential  decoding  algorithm  respectively.  The  number  of  paths  in  the  trellis  diagram 
depends  on  L  ai  well  es  no,  mg,  mg,  ....  *L,  end  the  grte  slsea  used  to  reduce  tfc  target  motion 
model  to  e  finite  state  model  where  n^  and  mg  are  the  number  of  possible  vs  lues  of  the  d leers'* 
random  vectors  xj(P)  end  vg(k)  (see  Section  2.3) .  to  particular.  If  I-  is  very  large  (in  ther 
words,  the  target  needs  to  be  tracked  for  a  long  time),  the  trellis  diagram  may  cor tain  •  hugs 

amount  of  ,mhc.  In  such  r  eras,  4 SOSA  may  require  a  very  large  memory  for  the  storage  of  etacka 

of  searched  paths  and  comparisons  to  reorder  the  paths  in  stacks  according  to  their  metrics  while 
0DSA  requires  s  hugs  memory  end  computation  to  compare  the  ner~Us  of  all  paths  in  the  trailia 
diagram.  Hence,  tiu.ee  smoothing  mlgovlthma  uru  lapiactical.  Therefore,  a  smoothing  algorithm 
which  requires  a  constant  memory  for  tbs  path  most  probably  followed  by  the  target  from  time  tero 

to  any  time  l  la  needed.  In  this  section,  such  an  algorithm  Is  presented.  It  is  based  on  a  sub- 

optimum  decoding  algorithm.  Hence  it  dost  not  minimite  the  overall  error  probability  for  the 


detection  of  the  path  most  likely  followed  by  the  target.  It  (smoothing  algorithm)  is  referred  to 
as  Suboptimum  Decoding  based  Smoothing  Algorithm  (SDSA)  which  is  as  follows: 

Initial  Step  -  After  obtaining  the  first  L(l)  observations  (i.e.,  the  observation  sequence 
from  time  one  to  time  L(l)),  using  ODSA  find  the  path  most  probably  followed  by  the 
target  from  time  zero  to  time  L(l).  Let  this  path  be  (see  Figure  2.10.1). 


Time  Zero  Time  L(l)  Time  t(l)+L(2) 


* -  t(l)  +  1  - O- -  L(2)  - *  I 

- - -  |  , 

Initial  nodes 
(quantization  levels) 


Figure  2.10.1.  The  Trellis  Diagram  for  the  Suboptimun  Decoding  baaed  Smoothing  Algorithm 


First  Step  -  Obtain  the  next  1(2)  observations  (i.e.,  the  observation  sequence  from  time 
L(l)+1  to  time  1(1)  +  L(2))  and  asstae  that  the  target  in  fact  followed  the  path  fl* 
from  time  sero  to  time  1.(1)  (in  other  words,  assume  that  the  target  wss  at  the  end  point, 
denoted  by  fiq(L(l)),  of  H1  at  time  L(l)  with  probability  one).  Then  using  ODSA,  find 
the  path  most  probably  followed  by  the  target  from  time  L(l)+1  to  1.(1)  +  L(2).  Let 
this  path  be  ft*. 

Second  Step  -  Obtain  the  following  L(3)  observations  (i.e.,  the  observation  sequence  from 
time  1(1)  +  1(2)  +  1  to  time  L(l)  +  L(2)  +  1(3))  and  assume  that  the  path  iUfi*  actually 
followed  by  tha  tsrgst  from  tims  taro  to  tims  1(1)  +  L(2)  (in  other  words,  assume  that 
tha  tsrgst  was  at  the  and  nods,  dsnoted  by  8  ( 1(1)  +  L<2))  of  the  path  ft2  with  probability 
ona) .  Than  using  ODSA,  find  tha  path  most  probably  followed  by  the  target  from  time 
L(l)  +  1(2)  +  1  to  time  1(1)  +  L(2)  +  l(3).  Let  this  path  be  8*.  The  other  ateps  eimilarly 
continue  until 

q 

l  -  I  Uk) 

k-1 

At  the  end,  decide  the  path  composed  of  the  pathe  fl1,  fl2,  . ...  as  the  path  ((I)  most 
probably  followed  by  tha  target  from  time  saro  to  tlaa  L,  i.e.. 


a  -  a1  a2  ... 


where  q  is  the  number  of  observation  sequences  considered  from  time  tero  to  tlae  L.  The 
number.  L(i),  of  observations  in  the  1th  observation  sequence  is  chosen  such  that  at  the 
(l»l)th  Mtp  of  (SDSA),  ODSA  finds  tht  path  x*  without  requiring  a  huge  aamory  and  compute' 
tlon. 

Let  us  divide  tha  trallls  diagram  T  into  q  parts  such  that  tha  first  part  contains  L(l)  +  1 
coltmms  of  quantitation  levels,  starting  from  time  tarot  tha  tecond  part  contains  tha  nsxt  1(2) 
columns;  tha  third  part  contains  the  following  L(3)  columns  of  quantitation  levels;  and  so  on. 
how  wa  era  going  to  define  some  symbols  which  will  be  used  in  tha  analyses  (sea  Figure  2.10.2). 


lime  Zero 


Time  L(l) 


Time  L(l)+L(2) 


Time  L 


L(l)+1 


The  end  node  of 


Figure  2.10.2.  The  Trellis  Diagram  for  the  Performance  Analysis  of  the  Suboptimum  Decoding 

based  Smoothing  Algorithm 

H  is  the  atl1  path  through  the  trellis  diagram  T 

a 

i  th 

H  is  the  portion  of  H  in  the  i  part  of  T  where  i  •  1,  2,  ....  q 

m  ts 

Hg  is  the  path  (throughout  T)  which  SDSA  decided  that  was  mont  probably  followed 

by  the  target  when  the  target  actually  followed  the  path  H 

IB 

Hg  is  the  portion  of  Hg  in  the  1th  part  of  T  where  1  «  1,  2,  ...,  q 

T1  la  the  first  part  of  the  trellis  diagram  T 

n 

T*  is  the  trellis  diagram  composed  of  the  paths  in  the  i1*'  part  of  T  which  start 

off  at  the  end  (final)  node  H*”*  where  1  »  2,  3,  . ...  q 

T  is  the  trellis  diagram  which  is  composed  of  T*,  T^,  . and  T^ 

D  ■  ©  ■ 

N  Is  the  number  of  possible  paths  (hypotheses)  through  T 

K*  la  the  number  of  poaslble  paths  through 

la  the  multiplication  of  H^,  ....  and  h£ 

X*  la  the  aet  of  all  quantisation  levels  In  T ,  except  the  quantisation  levels 

at  time  aero 

X1*  la  the  sat  of  all  quantisation  levels  In  except  the  quantisation  levels  at 

time  aero 

X**  le  the  eet  of  all  quantisation  lavels  In  whare  1  •  2,  3,  ....  q 

la  tht  sat  of  all  L(l)-tuplas  of  x£* 


la  tha .cartesian  product  of  the  sett  H**.  hf*.  ....  and  that  la 

jjt  •  gl#  |  |^2t  |  t  t  jjji  “  ^ 

Is  tha  enaamble  (or  set)  of  all  N*-tuples  of  H** 

Is  the  snsamble  (or  set)  of  all  N^-tupls*  of 

la  the  observation  sequence  used  at  the  (1-1)  step  of  SDSA  (l.a.,  tha  obaarvatloo 
a eq nance  associated  with  the  llD  part  of  T),  that  Is 


BU(t)  .  s(ne2),  ....  «(nH(l))> 
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where 

1-1 

n  A  J  L(k);  by  definition,  n  ■  0  for  i  -  1 
k«l 

The  performance  of  the  subopt  laum  decoding  based  smoothing  algorithm  Is  discussed  In  the  next  two 
sections. 


2.10.1  An  Upper  Bound  for  the  Overall  Error  Probability 

Let  us  start  calculating  the  correct  detection  probability  of  the  path  H,,,  which  la  defined  by 
the  probability  of  choosing  Hq  as  the  path  most  likely  followed  by  the  target  when  the  target  in 
fact  followed  the  path  Hj,.  In  other  words,  the  correct  detection  probability  Pca(Hi,  H2,  ...,  %) 
is  the  probability  of  choosing  H,,(«  H*  . . .  Hj§) ,  given  the  observation  sequence  and  that  the 
path  Hg  was  followed  by  the  target.  Hence,  we  have 


Pc  . V  A  *«*<>£  H’1*L.Bb> 

B 

-  Prob{a|  -  H^|Hb,*L,  h£  -  H2,  ....  H^"1  -  H^'1} 

•  Prob^"1  -  Hfl|HB.aL,  hJ  -  H^,  ....  H^-2  -  H<'2} 

...  Prob{Hg  '  (2.10.1.1) 


On  the  other  hand,  for  the  subopt iaua  decoding  based  smoothing  algorithm,  the  probability  of  choosing 
tl£  as  the  decision  (l.e.,  Hi  -  H*) ,  given  the  observation  sequence  that  the  path  Hg,  was  followed 
by  the  target,  and  the  correct  detection  of  the  oaths  H*.  Hg,  ....  H*"1,  is  the  probability  of 
choosing  H*.  given  the  observation  sequence  st>ti}|  that  the  pathK*  was  followed  by  the  target, 
that  is, 

Prob(H^  »  HB|HB,xL,  a^  -  H2,  ....  h£“1  -  a2'2)  -  ProblRg  -  H*|tL(i),»£>  (2.10.1.2) 

Also  we  have 


ProbtHg  -  H2|sL<1>,i£)  •  1  -  P^(T2) 


(2.10.1.3) 


where 


pg  (T1)  A  Proh(8g  *  U^lsl‘<1),l^}  (2, 10.1. A) 

i  Ph 

which  is  the  error  probability  of  H,  when  at  the  (i-1)  step  of  SOSA,  00SA  is  used  for  only  the 
trellis  diagram  T*  with  the  observation  sequence  s1*'1!.  First  substituting  (2.10.1.3)  into  (2.10.1.2) 
and  then  into  (2.10.1.1),  we  obtain  the  correct  detection  probability  of  it 


•v 


ft 

i*i 


(i 


(2. 10.1.  S) 


Moreover,  from  the  definitions  of  the  correct  detection  end  error  probabilities  of  the  peth  H  (see 
Definition  2. 7. A),  we  have  * 


“2 . V  A 

• 1  -  v8!-  . v 


(2.10.1.0) 


Hence  suDetltv  (2.10.1.$)  into  (2.10.1.6)  we  get  the  error  probability  of  as 

PE  (H^,  Mj,  ....  -  1  -  ft  |1  -  Pj  (T*)J  (2.10.1.7) 

Furthermore,  using  e  bound,  denoted  by  l^(T^) ,  for  tbe  error  probability  of  H^,  the  error  probability 
of  Ha  can  be  upper  bounded  by 
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pe  (hx>  h2>  ....  y  <1  - 


ft 


[l  -  B^OrS] 

m  id 


(2.10.1.8) 


where,  for  example,  B_(T_)  Is  a  bound  of  Gallager's  type  for  the  trellis  diagram  T  ,  with  the 
observation  sequence  zL(i)  (see  2. 8. 2. 8).  Therefore  substituting  the  bound  in  (2.10.1.8)  for  the 
error  probability  of  the  path  In  (2.7.19)  yields  a  bound  for  the  overall  error  probability. 

2.10.2  An  Ensemble  Upper  Bound  for  the  Overall  Error  Probability 

An  ensemble  bound  for  the  overall  error  probability  for  the  detection  of  the  path  most  likely 
followed  by  the  target  being  considered  can  be  obtained  as  follows: 


First,  for  every  path, say  a„,  through  the  trellis  diagram,  T,  a  probability  density  function 
!*(•)  is  defined  on  the  ensemble  £  such  that 

ttt  m 


TO 


ft 

i-1 


CO 


for  all  Te  e  £ 
a  m 


(2.10.2*1) 


where  Qne(*)  is  a  probability  density  function  on  the  ensemble  6 n,  and  l'me  e  £*.  Then  averaging 
the  error  probability  of  the  path  over  the  ensemble  Sa  with  respect  to  the  probability  density 
fucntlon  in  (2.10.2.1)  yields  the  following  ensemble  error  probability,  denoted  by  PB(T),  of  the  path 


PE  (T) 


CC  PE  (T) 

m 


(2.10.2.2) 


Further,  substituting  (2.10.1.7)  and  (2.10.2,1)  into  (2.10.2.2),  changing  the  order  of  multiplications 
and  auesaatioos,  and  recognising  that  the  summations  are  performed  over  the  entire  enaemblea,  we 
obtain 


where 


(2.10.2.3) 


(2.10.2.4) 


which  la  the  ensemble  error  probability  of  (he  path  H,  when  only  the  trellis  diagram  T_  Is  considered. 
In  other  words.  It  Is  the  ensemble  average  of  the  error  probability  of  Hi  (when  only  the  trellis 
diagram  ti  la  considered)  over  the  ensemble  £1  with  reapset  to  a  probability  denaity  function 
<)£*(•)  (aee  Section  2.8.3),  Similarly,  using  (2.10.1.8),  (2.10.2,1),  and  (2.10.2.2),  we  obtain  the 
following  ensemble  bound  for  the  error  probability  of  the  path  tt^. 


where 


pTTrT  <  l  -  fr  U 

Em  1-1 


W>  A  l  t  CO 

m 


-  b‘(t^)) 


(2.10.2,3) 


(2.10.2.6) 


which  la  an  ensemble  average  of  a  bound  CC  for  error  probability  of  the  path  K-  when  only 
the  trellis  diagram  T*  Is  conaldered.  Furthe. ,  aubatltuting  (2.10.2.3)  for  the  ensemble  path  error 
probabilities  In  (2.8.3. 3)  yields  an  anacmble  average  of  the  overall  arror  probability  for  the 
detection  of  the  path  moat  Ukaly  followed  by  tbe  target  being  considered  (t.e.,  the  ensemble  error 
probability  fp)  as  follows. 

K  -  S  p(«  >  t»  -  fr  u  -  ti  (t‘>j> 

-  l  -  l  pi»J  ft  U  -  ri  «‘)i 

m-l  m  l-l  m  * 


(2.10.2.7) 


Similarly  substituting  (2.10.2.5)  for  the  ensemble  path  error  probabilities  in  (2. 8. 3. 3),  we  obtain 
the  following  ensemble  bound  for  the  overall  error  probability  for  the  detection  of  the  path  most 
likely  followed  by  the  target  being  considered. 


PEi1 


I  Ptt  )  ff  [1  -  B*(T*)J 

a-l  “  i-1  B  " 


(2.10.2.8) 


where  p(H  )  is  as  defined  in  Chapter  2.6. 

3.  APPLICATIONS  OF  THE  SMOOTHING  ALGORITHMS 

As  discussed  before,  the  new  smoothing  algorithms  developed  in  the  previous  chapters,  can  be 
used  for  (linear  or  nonlinear)  discrete  models  with  arbitrary  independent  (of  each  other  and  from 
sample  to  sample)  random  interference  and  noise.  The  following  chapters  consider  applications  of 
these  smoothing  algorithms  to  seme  discrete  models  with  Gaussian  noise  and  with  or  without  arbitrary 
random  interference  in  the  time  interval  [0,1]. 

3.1  An  Example  with  Gaussian  Disturbance  and  Observation  Noises 

This  chapter  deals  with  the  following  discrete  models 


x(k+l)  ■  f  (k,  x(k) ,  u(k) ,  w(k))  (Motion  Model) 

*(k)  -  g(k,  x(k))  +  v(k)  (Observation  Model)  (3.1.1) 

where 


x(0)  is  an  n  x  1  initial  state  Gaussian  random  vector  with  mean  m^  and  covariance 
R0  u 
v(k)  is  a  p  x  1  Gaussian  disturbance  noise  vector  with  aero  mean  and  covariance 

x(k),u(k) ,*(k)  and  f (x,x(k),u(k) ,w(k))  are  aa  described  in  Chapter  2.1 
g(k,x(k))  la  id  r  x  1  (linear  or  nonlinear)  vector 

v(k)  la  an  r  x  1  Gaussian  observation  nolee  vector  with  sero  mean  and  covariance 

«v(k). 

Moreover,  the  random  vector*  x(0),  w(j),  v(k),  v(t)  and  v(a)  ere  assumed  to  be  independent  for  ell 
j»  b*  li  •• 

3. I. I  The  Metric  of  a  trench 

The  observation  s(k)  In  (3.1.1)  is  a  linear  function  of  the  Gauealen  observation  noiae  v(k). 
Bence  the  conditional  probability  density  function  of  s(k>,  given  that  x(k)  *  x*(k),  la  s  multi- 
vsrlste  Csuaalsn  density  function.  Thus  we  have  q 


p*(»(k)|x*(k))  £  p(s(k) }x(k)  -  xj(k)) 


**P  -  ((e(k)  -  g(k,  *‘(k))|T  *'l(k)|s(k)  -  g(k,  x‘(k»)/2) 

- *7,7777-7— T7T“ - - - 


(3.1.1. 1) 


(2s)l,‘  [det  ^(k)]1 

Substituting  this  into  (2.7.15)  yields  the  metric  of  the  branch  between  the  node#  xl(k-l)  end  xl(k) 

as  <  d 


M(x‘(k-1)  *  x‘(k))  «  In  -  ln(tft)r/J{dst  h^k)]1'2)  -  (*(k)  -  g(k,x‘(k)))T  S^(h)ta(k)  -  g(k,x*(k)))/2 


(3.1.1. 2) 


3.1.2  The  Optima  Decoding  bessd  Seoo thing  Algorithm 

S. 1.2.1  A  Union  Upper  Sound.  Ae  mentioned  before,  the  bound  for  the  overall  error  probability, 
given  In  Chapter  2.1.2  t»  very  complex  to  evaluate;  hence  the  ensemble  bound  in  Chapter  2.8,3  was 
considered.  However  for  the  models  In  (3.1.1),  a  union  bound  (which  is  very  assy  to  evelueta)  for  the 
detection  of  the  path  (moat  probably  followed  by  the  target  from  time  taro  to  time  L)  by  using  OOSA 
can  be  derived  ae  follows. 

Let  us  now  consider  the  sat  i‘<  dafiaad  by  (2.8. 2.1).  Substituting  (3, 1.1.1)  into  tba  metric  of 
the  path  (l.a.,  (2.7.17))  end  then  eubetitutlng  title  metric  into  (2.8.2. 1),  w*  obtain  the  eat 
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y  ■  {iLi  M*  <Hj )  M'(y)  for  gone  j  i*  l) 

-  u  t*L:  M'{8  )  >  (3. 1.2.1) 

J-i  J 

where 


M’ (H.)  A  2  In  it*  +  l 
1  u  k-1 


{2  In  it*  -  [*(k>  -  g(k,  x*(k))lT  R~*(k)(t(k)  -  g(k. 


xj(k))]} 


(3.1. 2. 2) 


Recall  that  the  set  I",  contains  the  conplenent,  Dj,,  of  the  decision  region  Dg  for  the  hypothesis 
Hg.  Then  from  the  axioms  of  probability,  we  obtain  the  following  bound  for  the  error  probability 
of  the  path  y 


PE  (H v  y,  ....  y  <  Prob{*1 1  rjy} 

<  l  Prob{*S  M'(H.)  >  M*  (H,  )  |H, }  (3.1. 2. 3) 

“j?i  J-ii 


Substituting  (3.1. 2. 2)  for  M'(y),  we  get 

Prob{*L;  H’(Uj)  >  N‘(y)|y}  -  Prob(J^  (y)  (3.1. 2. 4) 


where 

J.,  4  I  2U<k.  x^(k))  -  g(k,  a*(k»}T  yX(k)  a(k) 

13  k-1  **  **  v 

Is 

A,.  A  2  ln(  TT  -tM  l  («T(h.  g(k.  x^(k»  -  *T(k,  x*(k))R~l(k)  g(k.  **<k)» 

k-0  a2  k-1  q  V  q  q  q 

It  follows  that  y .  Is  a  linear  (unction  of  «(k),  which  la  a  multlwatlate  (Sauttlan  density  function 
whan  Hj  Is  given.  nsnco,  the  conditional  density  function  of  Jjj,  given  Kg,  is  a  Nortel  density 
function  with 


■ean  *  lUy^ly) 

-  |  2(#(k,  »J(k))  -  »(k,  »*(k))lT  *“X(k)  g(k,  *XCk))  O.l.l. S> 

k-1  q  y  q 


and 


Var(J..|N.)  -  *  l  tg(k,  *;J(k)>  -  *<k,  *‘{k)))T  #rl(k)(«<k,  *£(k»  -  (k.  *‘(k))J  (3. 1.2. 6) 

1  S  S  v  ...  H  -  " 


Th*r«tor*  mi  tuv# 


r  ,  p.,  - 

*u 


4<u* 


\i  2  t<Jn*V 


■) 


where  <}(•)  Is  wsetiiu  ref  erred  to  ae  the  Gaussian  Integral  function  and  it  is  defined  by 

Q(y>  4 


(3. 1.1.7) 


0.1.2.#) 
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Combining  (3. 1.2. 7),  (3. 1.2. 4),  And  (3.1. 2.3)  we  obtain  the  following  upper  bound  for  the  error 
probability  of  the  path  H 

m 


<H, 


V  <  l  Q(-^ 

~  i^i  tfarU 


-E(jnlHi}, 


13 


(3. 1.2. 9) 


This  bound  la  some tines  referred  to  as  the  union  bound  for  the  error  probability  of  Hit  Further, 
substituting  this  bound  for  the  path  error  probabilities  In  (2.7.19)  ve  obtain  an  upper  bound  for 
the  overall  error  probability. 


3. 1.2. 2  The  Upper  Bound  for  the  Overall  Error  Probability.  Setting  p  equal  to  one  in  (2. 8. 2. 8), 
and  changing  the  order  of  multiplication  and  integration,  we  get 


PE  (B,.H, . JU  <  I  (TT  4)1/2  TT  [  (p*<x(k)|x*(k))  p*  (*(k)  |x£(k))J 1/2  dz(k)  (3.1.2.10) 

i  1  2  "  ~j?i  k-0  i£  k-1  U(k)  q  q 


From  (3. 1.1.1)  and  (A. 4)  in  Appendix  A,  ve  have 

B(x1(k),  jrj  (k) )  A  [  Ip*  (*(k)  |x*(k))  p*(:s(k)!x;l(k))]1/2  dt(k) 

q  q  *  Jr(k)  q  q 

(g(k,  x*(k))  -  g(k,  xi(k))]T  R“1(k)(g(k,  x*(k))  -  g(k,  x;*(k))) 

-  exp - 3 - 3 - *5 - 3 - 3 -  (3.1.2.11) 


Hence  substituting  (3.1.2.11)  into  (3.1.2.10)  ve  obtain  the  following  bound  for  the  error  probability 
of  the  path  »t. 

P.  (H,,U„...,V  <  I  (  TT  -V/2  TT  B(*^(k),  x^(k))  (3.1.2.12) 

6i  1  2  ^  "J?i  k-0  k-1  3  q 

Substituting  this  bound  for  the  path  error  probabilities  in  (2.7.19)  yislds  s  bound  for  ths  overall 
error  probability. 

3, 1.2. 3  Ths  Enaambla  Upper  Bound  for  the  Overall  Error  Probability.  The  ensemble  bound  In 
(2.8.3.39)  is  usad  aa  ths  performance  measure  for  ths  optima*  dacodlng  baaed  aaoothlng  algorithm. 

Using  (A. 4)  with  (3. 1,1.1)  in  this  bound,  ve  obtein  the  following  bound  for  the  ensemble  error 
probeblllty  for  the  detection  of  the  path  moat  likely  followed  by  the  target  being  considered 

L 

K  <  0  •  TT  (  l  l  #<*,.  *,)>  (3. 1.2.13) 

B  "  k-1  Xjtx«  XjCX*  1  * 

where  D  and  B(x.,  x j)  are  given  by  (2.8.3.40)  and  (3.1.2.11),  respectively  and  X*  is  defined  In 
Section  2.8.3.  tf  the  function  g(k,-)  In  tha  observation  modal  In  (3.1.1)  and  the  statistics  of  the 
obasrvetion  noiss  v(k)  (l.e,,  kyik))  ere  time  invariant,  then  tha  snaambls  bound  In  (3.1,2.13) 

becomes 


L  <  0  •  (  l  l  B(x,.  x,))L  (3.1.2.14) 

B  ”  XjCX*  Xjk*  1  2 

3.1,3  The  Stack  Sequential  Decoding  based  Smoothing  Algorithm 


3.1.3. 1  The  Ensemble  Upper  Bound  for  the  Overall  Error  Probability.  In  the  bound  in  (2.9,3. 7), 
substituting  nos  for  p  and  (1/K«)  for  q(x)  for  all  x«  we  obtain  tha  following  bound  for  the  ensemble 
error  probability 


L  i «  TT  c.  ♦  I  TT  («*lVl/J  o.  TT  c.)  0.1.3.1) 

e  k-1  *  t-1  k-i  *  J-i*l  J 

where  t  la  given  la  (2. 9. 3. 8),  and 

{p‘uUH*t)J  tp^Ufk) |k2)>1/2)  d*(k)  (3.1.3.2) 

I L  $  [  t<p<e(k)|»>)l/V  di(k) 

^  'e(k) 


(3.X. 3. 3) 
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Substituting  (2. 9. 3.6)  with  <3. 1.1.1)  into  Ck  and  then  using  (A. 5)  in  Appendix  A,  we  get 

.1.2  (2/9ir)r/4  ,  r  v  -  g(k.x2)lT  R^CkXgOt.Xj)  -  g(k,x2)] 

C.  »  (  )  T7T  i  l  l  «*P  “  r  - ) 

N  (det  Ry(k))/  XjGX®  x2eX®  6 

Similarly,  substituting  (2. 9. 3. 6)  with  (3. 1.1.1)  into  and  using  (A. 4)  we  obtain 


(3. 1.3. 4) 


D 


k 


<V<  I.  I 

N  x^X®  x2eX« 


BtXj,  x2)> 


(3. 1.3.5) 


where  B(xi,x2)  is  given  by  (3.1.2.11);  N®  and  X®  are  defined  in  Section  2.8.3. 

If  the  covariance  matrix  (Ry(k))  of  the  observation  sequence  v(k)  and  the  function  g(k,.)  are 
time  invariant,  then  the  bound  in  (3. 1.3.1)  becomes 


-  F{ck  +  2  1  TT  <®k1V1/2]D*  c?;'1) 

*  i-l  k«l  *  *  * 


(3. 1.3. 6) 


3.2  An  Example  Kith  Interference  and  Gaussian  Disturbance  and  Observation  Noises 
In  this  section,  we  consider  the  following  models: 

x(k+l)  -  f (k,  x(k) ,  u(k) ,  x(k))  (Motion  Model) 

*(k)  -  g(k,  x(k),  I(k))  ♦  h(k,  x(k),  l(k))  v(k)  (Observation  Model)  (3,2.1) 

where 

x(0),  x(k),  u(k),  w(k),  a(k)  and  f(k,  x(k),  u(k),  w(k))  are  as  described  in  Section  3.1, 

g(k,  x(k),  I(k))  and  h(k,  x(k),  l(k))  are  r  a  1  and  rat  dimensional  (linear  or  nonlinear) 
matrices,  respectively, 

v(k)  is  an  l  a  1  Gaussian  observation  noise  vector  with  aero  mean  and  covariance  Kv(k), 

!(k)  is  en  m  x  1  Interference  vector  with  known  statistics. 

Furthermore,  the  following  assumptions  art  made: 

(1)  The  random  vectors  a(0),  v(J),  v(k),  v(l),  v(m),  1(e)  *nd  l(p)  are  independent  for  all 
]■  k,  t,  a,  o,  p. 

(2)  th(k,  a(k),  t (It) >  ^(k)  hT(k,  a(k),  l(k»fl  exists  for  all  k. 

3.2.1  The  Metric  of  e  Branch 

Let  ua  consider  tha  observation  eodel  in  (3.3.1).  The  observation  *(k)  te  a  linear  function 
of  the  normal  obeervetlon  noise  vector  v(k).  Therefore,  the  conditional  probability  density  function 
of  a(k),  given  thet  x(k)  -  x‘(k)  and  l(k)  U  a  multivariate  normal  density  function,  namely, 

pta(k)jx‘(k),  l(k))  4p(a(k)|a(k)  -*‘(k).  l(k)) 

•  A  exp  -  |  (3. 2.1.1) 

where 

4  4  <2t)"r/l  (det(h(k,  «*(k),  t(k))  ky(k)  hV.  x‘(k),  l(k))D*l/J 

I  4  (*(k)  -  «(k,aJ(k),l(k)))T  Ch(k,xJ(k),l(k))»v(k)  hT(k,aJ(k).Uk»rl(a(k)  -  g(k,aj(k?.l(k))  J 

(3. 2. *..2) 


From  (2.7.10),  we  have 
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!  p(t(k)|x*(k),  I(k))  p(I(k))  dl(k) 

JI(k)  q 


p'(*(k)|xj(k))  -< 


using  (2.5.1) 


k 

l  p(*<k)|*q(M,  I^tt))  p(Id£(k))  using  (2.7.7) 


where 


p(s(k)|xj(k),  Idi(k))  A  PU(k)jxJ(k),  l(k)  -  IdJl) 


(3. 2. 1.3) 


which  is  given  by  (3. 2. 1.1).  Substituting  (3. 2* 1.3)  into  (2.7.15)  yields  the  metric  of  the  branch 
between  the  nodes  x^(k-l)  and  **00 » 

H(x*(k-1)  *  x*(k))  ■*  In  n*  +  In  p' (*{k) )x*(k))  . 


3.2.2  The  Opt  law  Decoding  based  Smoothing  Algorithm 

3,2.2. 1  An  Upper  Bound  for  the  Overell  Error  Probability.  Setting  p  equal  to  one  in  (2. 8. 2. 8) 
yields  the  bound  in  (3.1.2.10).  Hence  substituting  (3. 2. 1.3)  into  (3.1.2.10)  we  obtain  a  bound  for 
the  error  probability  of  the  path  H  . 

If  the  observation  eodsl  is  approximated  by  (2.7.7),  then  an  upper  bound,  which  ie  very  easy 
to  evaluate,  for  the  error  probability  of  H*  can  be  obtained  aa  followa!  aubatitute  the  aecond 
equation  (3. 2. 1.3)  Into  (3.1.2.10)  and  than  uae  tha  following  inequality 


<1  O*  <  l  for  say  a  .  >  0  and  X  e  (0,11 

11*  1  ~ 

to  obtain  tha  following  bound  for  tha  error  probability  of  the  path  Hj 
^  L  \  ^1 

ft  <“4 . Vi  l  <TT  4>l/J  TT <  l  l  tpadl(k))  pa  (k)))l/J 

i  *  "  J?i  k-0  k-1  i-l  J-l  dl 

*  [  lp(*(k)|x‘(k),  t .  (k»  p(t(k) ix|(k),  t.,{k))li/2  di(k)J 

i  ,(g)  s  q  uj 


(3. 2. 1.4) 


(3. 2.1. 5) 


where  lgi(k)  end  Igj(k)  ere  euMted  ov.r  the  eet  of  ell  discrete  value*  of  Ij(k)  (see  Section  2.7). 
The  Integral  can  be  evaluated  by  uelng  (3. 2. 1.1)  in  (A.l)  so  that  we  have 


where 


(p(s(h)l*J(k).  tdl(k)>  p(s(k)|*J(k),  !dJ(k»ll/J  ds(k)  -  a*  ♦  % 


(3. 2. 1.0) 


A*  4  (4et(2(»^  ♦  t^l/*  (tf4l  g  ,1/* 

»’  A  ♦  «jVl  btJ  -  gT(k..J(W.l4t(h))»;1  g(Mj(U,J41(k» 

-  gT(k,*J(k),l4J (W)*"1  g(k,«J(k),l4j(W)» 

Bj  4h(k,ajck),  I4l(k))»y(k)  hT(k,  *J(k),  t41(k)) 

*j  Ah(k.aJ(k),  Idi(k))*v(k)  hT(k.aJ(k).  l4j(k)) 

A  l41(k)>  ♦  *]1  •U.nJ(k).  X4J(k)>.  (3.2.1. 7) 


Substituting  this  hound  for  the  error  probability  of  *4  in  (2.7.19),  we  get  an  upper  bound  (nr  the 
ovnreU  error  pcobnbllUy. 
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3. 2. 2, 2  An  Ensemble  Upper  Bound  for  the  Overall  Error  Probability.  Substituting  (3. 2. 1.3) 
with  (3. 2. 1.1)  into  (2.8.3.39),  we  obtain  an  enseable  bound  for  detecting  the  path  (most  likely 
followed  by  the  target  being  considered)  by  the  optimum  decoding  based  smoothing  algorithm. 

If  the  observation  oodel  is  approximated  by  (2,7.7),  then  the  enseable  bound  mentioned  above 
can  be  further  upper  bounded  by  using  the  inequality  (3. 2, 1.4)  so  that  we  obtain  the  following 
ensemble  bound,  which  is  easy  to  evaluate,  for  the  overall  error  probability 

L  rk  rk  . 

f.«I>TT<  I  I  II  (pu..(k»  P(I  (k)))1/2  [p(s(k)|x  ,1  (k))p(z(k)|x,,l..))1/2  d*(k)} 

k-1  x1dC<‘ x2£X®  i-1  J-l  dJ  JZ(k)  1  di  2  dJ 


(3. 2. 2.1) 


where  D  is  defined  by  (2.8.3.40),  and  the  integral  is  given  by  (3. 2. 1.6). 


If  the  functions  g(k,*,*)  and  h(k,*,*)  in  the  observation  model  in  (3.2.1),  tha  covariance, 
RyCk),  of  the  observation  noise,  and  the  statistics  of  the  interference  I(k)  are  time  Invariant, 
then  the  bound  in  (3. 2. 2.1)  becomes 


I,  -  a 

x^eX*  SjEX 


r. 

I  a  I  I  tp(Idi(k))p(I  (k))l1/2  | 
eXa  i-1  j-1  di  dJ  't(k) 


(p(t(k) |x1,ldi(k))p(t(k) |x2, 1  ■ 


i  / 1  , 

'  t  '  dr(k>) 
(3. 2. 2. 2) 


3,2.3  The  Stack  Sequential  Decoding  based  Smoothing  Algorithm 

3.2.3, 1  An  Ensemble  Upper  Bound  for  the  Overall  Error  Probability.  In  the  bound  in  (2. 9. 3. 7), 
substituting  one  for  p  acd  (l/he)  for  q(x)  for  all  *,  and  then  using  (3. 2. 1.3),  we  obtain  an  ensemble 
bound  tor  the  overall  error  probability  for  detecting  the  path  (uost  likely  followed  by  the  target 
being  considered)  by  the  stack  sequential  decoding  based  smoothing  algorithm. 

if  the  observation  model  Is  approximated  by  (2.7.7),  then  the  ensemble  bound  mentioned  above 
can  be  further  upper  bounded  by  using  inequality  (3. 2. 1.4)  so  that  we  can  obtain  an  ensemble  bound 
which  la  easy  to  evaluate  as  follows.  Using  the  inequality  (3. 2. 3.1),  we  get 


(p‘(*(k)|x))l/2  «  I  tp(«(k)|x,  l..Ck))ll/2  (p(l  (k))l-/3 
~  l«l 

then  subetltuting  this  bound  Into  and  in  (2.9.3.S),  we  obtain 

‘‘U  1?,!  ‘l  l  .y  ,yL  *«“” 


(). 2,3.1) 


•  (p(«(k)i»r  tdj(k»)1/2  d*«*)J? 


(3.2. 1.2) 


*<£ 


and 


'fewm-1  ~  i -k  4''  i’wk,,'r 

♦  p(a(k) j«2,  Vsj(h»)l/2  d*(k)} 


(k)) 


(3,2. 3.3) 


,  _b 

iDk 

wfisrs  the  Integral  t«  (3.2.3.33  ’ .  given  by  (3. 2. 1.8)  and  the  Integral  in  C;,  can  be  evaluated  by 
eubatitwtUg  (3.2. 1.1)  lata  It  aad  than  using  (A. 3)  Is  Appendix  A  so  tfc*t  m  have 


|  plslkJlxj.  l4lCU)  (p(«(k>|a2,  IdJ(k))ll/2  d*(k)  -  *'  exp 


(3.2.3.*) 


where 


A*  4  Uatl(lh[l  ♦  -§->'l)}l72/(2x)r/*  (det  tj)1/2  (del  t.)17* 

4bj2i2tjx  ♦  t‘2rl  b’j  -  2*T(k,ml,i4|(h»tJl  *  gt(k,x2,i4J(k))«2l  g(h.x2,i4j(h» 
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b{2  A  2R^1g(k,x1,I(Ji(k))  +  R'1g(k,x2,Idj(k)) 
Rx  A  h(k,x1,Idi(k))  Ry(k)  hT(k,x1,Xdi(k)) 

R2  A  h(k,x2,Idj(k))  Ry(k)  hT(k,x2,Xdj(k)). 


Finally,  substituting  these  bounds  for  and  in  (2. 9. 3. 7),  we  obtain  the  following  ensemble 
bound  for  the  overall  error  probability 


p  <f{_TT  c  +  I 


TT  (\in)'1/2  7T  cb) 

k-l  j-i+1  J 


(3. 2. 3. 5) 


where  F  is  as  given  in  (2. 9. 3. 8).  Tf  the  functions  g(k,*,*)  and  h(k,‘,*),  and  the  covariance  matrix, 
Rytk),  of  the  observation  noise  v(k)  are  time  invariant,  then  and  Dg  become  time  invariant.  Hence, 

in  this  case,  the  bound  in  (3. 2. 3. 5)  can  be  rewritten  as  follows 


pf  <  F{(chL  +  \  [  fr  (it”inr1/2]'  (oh1  (ch1 

i-l  k-l 


(3. 2. 3. 6) 


4.  Nl&iHRICAL  EXPERIMENTS 

The  purpose  of  simulating  was  to  find  out  how  well  the  smoothing  algorithms,  developed  in 
Section  2,  perform  both  in  a  clear  environment  and  in  the  presence  of  interference. 

In  a  clear  environment,  the  aim  was  to  compare  '  ,ie  smoothing  algorithms  with  the  Kalman  filter 
algorithm  for  linear  discrete  models  and  the  extended  Kalman  filter  algorithm  for  nonlinear  discrete 
models.  However,  in  the  presence  of  interferer  ,  the  smoothing  algorithms  may  not  be  compared  with 
the  (extended)  Kalman  filter  algorithm  since  it  cannot  handle  the  case  of  interference.  Therefore, 
the  purpose  was  to  discover  how  good  the  estimates  produced  by  the  smoothing  algorithms  arc,  and 
also  to  observe  the  estimates  obtained  by  the  (extended)  Kalman  filter  algorithm  (which  considers 
only  observation  noise,  i.e.,  with  zero  i  terference).  All  of  this  was  done  for  both  linear  and 
nonlinear  discrete  models  with  interferei.ee. 

For  all  simulations,  the  IBM  Systems/370  Model  3033,  Fortran  IV,  and  IMSL  library  wore  used. 

For  each  simulation,  the  disturbance  uoise  w(k),  observation  noise  v(j),  initial  state  x(0),  and 
interference  I(£)  (i.e.,  ir.  the  presence  of  interference)  wore  taken  to  be  white  Gaussian  and  also 
independent  of  each  other.  For  a  discrete  random  variable  (with  a  given  number  of  possible  values) 
which  approximates  the  Gaussian  random  variable  with  mean  u  and  variance  o^,  the  one  in  (B.9)  in 
Appendix  d  was  used.  In  addition,  the  approximate  observation  model  (2.7.7)  was  used  in  all  the 
caues  of  interference. 

Simulation  results  are  presented  in  figures,  /t  the  top  left  corner  of  each  figure,  the  used 
models,  noise  statistics,  gate  size,  and.  the  number  of  possible  values  of  the  discrete  random 
variables  wg(-),  !,}(•),  and  x^(o)  (which  approximate  the  disturbance  noise  w(*)»  interference  1(0 
and  initial  state  x(0))  are  provided. 


The  following  abbreviations  and  terms  are  used  in  the  figures: 

AAEK  represents  the  average  absolute  error  for  the  (extended)  Kalman  filter  osrimatea. 
The  absolute  error  at  time  j  and  the  average  absolute  error  are  defined  as  follows: 


ABSOLUTE  ERROR  (at  time  j)  A  |x(j)  -  (j|j)| 

la  K 

aaek  ^  —j-  f o  |*(j)  -  syjlj)! 


(4.1) 

(4.2) 


where  L  is  the  time  which  the  target  was  tracked  up  to  and  including,  SrOIJ)  is 
the  (extended)  Kalman  estimate  of  the  state  x(J),  p<ven  the  observation  sequence 
from  time  one  to  time  j . 


AAEOP  represents  the  average  absolute  error  for  the  estimates  obtained  by  the  smoothing 
algorithm  uaed.  The  absolue  error  at  time  j  and  the  average  abeolute  error  ere 
defined  as  follows: 


ABSOLUTE  ERROR  (at  time  j)  A  |x(j)  -  «  (j)| 

l  L 

AaEOP^t -rr  j,  |x(j)  -  Mj)| 

1,1  j-0  * 


(4.3) 

(4.4) 


whern  l  is  as  defined  above,  and  (J)  la  the  estimate  of  the  atato  x(J),  obtained 
by  the  smoothing  algorltta  used. 
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ACTUA’.  stands  for  the  actual  values  of  the  states 

BOUND  represents  the  bound  in  (3.1.2.14)  for  the  optimum  decoding  based  smoothing  algorithm 
(ODSA)  using  an  example,  without  interference,  the  bound  in  (3. 2. 2. 2)  for  ODSA  using 
an  example  with  interference,  the  bound  in  (3. 1.3. 6)  for  the  stack  sequential  decoding 
cased  smoothing  algorithm  (SSDSA)  using  an  example  without  interference,  or  the 
bound  in  (3. 2. 3. 6)  for  SSDSA  using  an  oremple  with  interference. 

ER.eOV,  represents  the  estimation  error  covariance  matrix  for  the  (extended)  Kalman  filter 
algorithm.  The  estimation  error  covariance  matrix  at  time  j  is  defined  by 


E{ (x(j)  -  2  (jjj»  (xtj)  -  S  (j|j))T) 


where  E{  }  stards  for  expectation.  Obviously  in  a  scalar  case,  this  matrix  reduces 
to  the  mean  square  error 

EX.KAL.  represents  the  extended  Kalman  filter  algorithm  used 
E(A(-))  stands  for  the  expectation  of  the  random  variable  A(-) 

GATE  SIZE  represents  the  gate  size  used  for  the  quantization 
KAIMAN  stands  for  the  Kalman  filter  algorithm  used 

NUM.  OF  DISC.  FOR  A(")  represents  the  number  of  possible  values  of  the  discrete  random  variable 
(used  for  the  simulation)  which  approximates  the  random  variable  A(-) 

OPD  stauds  for  the  optimum  decoding  based  smoothing  algorithm  used 

SOD  stands  for  the  suboptimum  decoding  based  smoothing  algorithm  used 

3SP  stands  for  the  stack  sequential  decoding  based  smoothing  algorithm  used 

VAR(A(*))  stands  for  the  variance  of  the  random  variable  A(-). 


4.1  The  Optimum  Decoding  Based  Smoothing  Algorithm 

Many  examples  vere  simulated  with  the  optimum  decoding  baaed  smoothing  algorithm  and  the  (extended) 
Kalman  filter  algorithm.  The  simulation  results  of  tone  of  them  are  presented  in  Figure  4,l.l«-4c. 

The  simulations  were  stopped  after  seven  steps  because  of  the  exponentially  ^rowing  memory  require¬ 
ment  of  the  optimum  decoding  based  smoothing  algorithm.  For  each  example,  Che  simulation  reaults 
aro  presented  ;.n  three  figures. 

The  first  figure  presents  the  variations  of  the  actual  and  estimated  value*  of  the  states 
versus  time.  The  actual  values  are  marked  by  Symbol  0)  >  the  (extended)  Kalman  filter  estimates 
by  Symbol  A  •  and  the  OPD  estimates  (i.e.,  the  estimates  obtained  by  the  optimum  decoding  baaed 
smoothing  algorithm'  by  +,  The  second  figure  presents  the  variation  of  the  estimation  error  co- 
variance  matrix  (for  the  (extended)  Kalman  filter  algorithm)  vers.ua  time  as  well  as  the  bound  J-' 
(3.1.2.14)  (If  the  example  does  not  have  any  interference)  or  the  bound  In  (3. 2. 2. 2)  (if  the  example 
contains  interference).  This  bound  la  used  as  the  performance  measure  of  the  optimum  decoding 
based  smoothing  algorithm  while  the  error  covariance  matrix  is  used  a*  the  performance  measure 
of  the  (extended)  Kaizen  filter  algorithm.  .he  third  figure  presents  two  curves  a*  well  aa  the  average 
absolute  errors  for  the  (extended)  Kalman  estimates  and  the  OPD  estimates.  One  of  these  curve*  *howa 
the  variation  of  eha  absolute  errors  for  the  (extended)  Kalman  estimates  versus  time  and  is  marked 
by  A  •  The  other  curve  ehowe  the  variation  of  the  absolute  error  for  the  OPD  estimates,  and  is 
marked  by  Symbol  +. 

4.2  The  Stack  Sequential  Decodtn;  3a sod  Smoothing  Algorithm 

A  large  number  of  example*  were  simulated  with  the  stack  sequential  decoding  based  smoothing 
algorithm  and  rhe  (extended)  Kalman  filter  algoritla,  The  simulation  results  of  some  of  them  are 
presented  in  Figure  4.2.1*-4c.  For  each  example,  the  simulation  results  are  presented  in  three 
figures  (aa  in  Section  4.1). 

The  flrat  figure  presents  the  variations  of  the  actual  valuas,  the  (extended)  Kalman  estimate*, 
and  tha  SSD  estimates  (1  the  estimate*  obtained  by  the  stack  sequential  decoding  baaed  smoothing 
algorithm)  of  tha  states  versus  time.  The  second  figure  presents  the  estimation  error  covariance 
matrix  versus  time  aa  wall  as  the  bound  in  (3. 1.3. 6)  (if  the  example  does  not  have  any  interference) 
or  the  bound  in  (3. 2. 3. 6)  (if  the  example  contains  Interference) .  This  bound  la  used  as  tha  performance 
measure  of  the  stack  sequential  decoding  baaed  smoothing  algorithm.  Tha  third  figure  presents  tha 
variations  of  tha  absolute  errors,  ami  the  average  absolute)  errors  for  tha  (extended)  Kalman 
estimate*  and  tha  SSD  estimates. 

4.3  The  Suboptimum  Decoding  haled  Smoothing  Algorithm 

Many  exempli*  were  almualted  with  the  (extended)  Kalman  filter  algorithm  and  the  euboptlmum 
decoding  based  smoothing  algorithm  considering  three  steps  at  each  of  which  Six  observation*  warm 
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were  used.  The  simulation  results  of  some  of  them  are  presented  in  Figure  4.3.1a-4c.  For  each 
example,  the  simulation  results  are  presented  in  three  figures  (as  in  Section  4,1). 

The  first  figure  presents  the  variations  of  the  actual  values,  the  (extended)  Kalman  estimates, 
and  the  SOD  estimates  (i.e.,  the  estimates  obtained  by  the  suboptimum  decoding  based  smoothing 
algorithm)  of  the  states  versus  time.  The  second  one  presents  the  variation  of  the  estimation  error 
covariance  matrix  for  the  (extended)  Kalman  filter  algorithm.  The  third  figure  presents  the  average 
absolute  errors,  and  the  variations  of  the  absolute  errors  for  the  (extended)  Kalman  filter  estimates 
and  the  SOD  estimates. 


4.4  Comments 


Let  m^,  no  and  r^  be  the  numbers  of  possible  values  of  the  discrete  random  variables  w,j(k),  x^fO), 
and  I,j(k),  which  approximate  the  disturbance  noise  w(k) ,  initial  state  x(0),  and  interference  1(10, 
respectively.  These  numbers  were  taken  to  be  time  invariant  for  all  simulations.  Definitely,  the 
performance  of  the  smoothing  algorithms  depends  on  these  numbers  as  well  as  the  gate  size  used.  The 
smoothing  algorithm  produces  better  estimates  of  the  states  for  a  suitable  gate  size,  and  larger  mk, 
nQ,  r^..  This  follows  from  the  fact  that  the  disturbance  noise,  initial  state,  and  interference  are 
approximated  better  for  larger  m^,  no,  and  rg.  For  a  large  gate  size,  good  estimates  of  the  states 
are  not  expected  since  more  quantization  errors  are  made. 

The  ensemble  bounds  of  Gallager's  type  is  used  as  the  performance  measure  of  the  smoothing 
algorithms.  The  values  of  these  bounds  depend  on  the  models  as  well  as  the  quantization  (i.e.,  gate 
size,  m^,  no,  and  r^)  used.  Since  some  inequalities  are  used  to  derive  these  bounds,  they  can  be 
greater  than  or  equal  to  one  for  some  models  or  quantization  used.  It  should  also  be  noted  that  even 
in  the  cases  where  these  ensemble  bounds  are  less  than  one,  they  do  not  give  complete  information 
about  the  performance  of  the  smoothing  algorithms  (see  Theorem  2. 8. 3. 4). 

S.  CONCLUSIONS 

Three  completely  new  smoothing  algorithms  have  been  presented  far  the  following  type  of  discrete 
models  with  or  without  random  interference:  they  can  be  linear  or  nonlinear',  the  disturbance  noise, 
observation  noise,  and  interference  can  be  any  Independent  (of  each  other,  and  from  sample  (time) 
to  sample)  not  necessarily  Gaussian  noises;  the  disturbance  noise  and  interference  can  affect  che 
motion  (model)  and  observation  (model)  in  a  general  and  not  necessarily  linear  way;  functions  which 
define  the  models  do  not  even  have  to  be  continuous. 


The  new  smoothing  algorithms  are  based  on  the  quantization  of  states  to  a  finite  set  of  states 
sod  Decoding  Technique  of  Information  Theorv.  If  the  quantization  errors  are  neglected,  the  first 
smoothing  algorithm,  which  la  referred  to  aa  the  optimum  decoding  based  smoothing  algorithm,  is 
optimum  with  respect  to  the  minimum  error  probability  criterion  (Bayes*  decision  rule).  However, 
this  requires  an  evergrowing  emount  of  memory  with  time  for  its  implementation.  Hence  the  second 
algorithm,  vhlch  la  referred  to  ae  the  stack  sequential  decoding  baaed  smooching  algorithm,  has 
been  proposed.  Even  this  algorithm  requires  a  growing  amount  of  memory  with  time  (but  not  as  fast  aa 
the  flrat  algorithm)  for  ito  Implementation.  Therefore,  the  third  meoothlng  algorlth,  which  la 
referred  to  ee  the  suboptimum  decoding  base.l  smoothing  algorlth#,  has  been  proposed.  This  algorithm 
requlrsa  a  finite  emount  of  memory  for  eny  time  (i.e.,  no  matter  how  long  the  target  la  tracked). 

In  target  trecking,  it  le  very  difficult  to  determine  a  motion  model  which  la  analytically  tract¬ 
able  and,  at  the  seme  time,  eatlafectorily  close  to  reality.  Therefore,  by  making  many  approximations, 
an  analytically  tractabla  motion  modal  la  obtained.  However,  che  amooching  algor  it  time  presented  in 
thl*  chepter  usee  this  motion  model  only  to  determine  the  transition  probabilities  of  the  tracked 
target  from  gats  to  gate.  If  theee  transition  probabllitias  can  be  determined  in  another  way,  these 
smoothing  algorithms  can  be  used  with  only  an  observation  modal  so  that  the  motion  model  (which  le 
approximate)  is  not  necessary  any  more.  Hence  the  more  accurate  che  transition  probabilities  are 
determined,  the  more  accurate  che  estimates  will  be. 

The  ensemble  bounds  of  Gallager's  type  have  been  derived  and  ustd  as  the  performance  measure  of  the 
nsu  smoothing  algorithms,  these  bounds  are  sometlmee  totally  useleea  since  they  become  nuabere  which 
are  greater  than  or  equal  to  one. 

in  order  to  teat  the  smoothing  algorithms.  Digital  Computer  Simulations  have  been  perforated.  Some 
of  the  simulation  reaulta  are  presen  ted  in  detail.  These  results  show  that  for  both  linear  and  non¬ 
linear  discrete  modsle  with  interference,  these  smooching  algorithms  perform  very  well  even  though 
neither  the  Kalman  nor  the  extended  Kalman  .'’liter  algorithm  le  capable  of  handling  Interference.  Also, 
theee  smoothing  algorithm#  pt  form  better  than  the  extended  Kalman  filter  algorithm  for  somt  nonlinear 
models  with  Causalan  noises  fend  without  interference)  while  they  perform  almost  as  well  ae  the  Kalman 
filter  algorithm  for  llnoar  -udela  with  Gaussian  noises  (sod  without  interf mrence) . 

APPENDICES 


A.  Theorem 


Let  pUlx.,)  and  p(i|x.)  be  two  r-dimenslonal  (multivariate)  Gaussian  density 

J  -r/2  -1/2  <*  -  *<*,»  VU  *  *<*i)) 

pUlXj)  -  <2t)"t/2  (det  exp - 


functions  such  that 


A  NfeO^).  *t> 
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and 


p(z|Xj)  A  N(g(Xj) ,  Rj). 

Then  the  following  equalities  hold: 

I  —  [  tpUlx^  p(z|x  )]1//2  dz  -  A  exp{B/4} 

*  Z  * 


(A.l) 


where 


A  A  {detUft”1  +  R"1)~1l}1/2/(det  (det  R^) 

B  A  {bJ^R"1  +  r"1)-1  by  -  gU/  R’1  g (*t>  -  g(x^)T  Rj1  g(Xj)} 

A  R|X  sW*)  +  Rj1  g(*j) 


,1/4 


,1/4 


II  -  j  pCzlx^  (pfzlx^)172  dz  -  A*  exp{B'M) 


(A.  2) 


(A.  3) 


where 


,1/4 


A'  A  {detKR"1  +  — |-)~1]>1/2/<2TT)r/4  (det  R1>1/2  (det  R^1 
B'  A  bi>Il  +  *]lfX  bij  -  Rj1  i(\)  -  S(*j)T  Rj1  8(*j) 

bjj  A  ^Rj1  gCXji)  +  R"1  g(xj). 

If  the  covariance  eatricea  Rt  and  Rj  are  equal,  aay,  R*  -  Rj  A  Rv>  Wen  th«  equalitiee  above  becoee 

f  .  '  I/,  lS<*t>  "  R^lsWJ  -  8(*4)1 

III  —  j  lp(*|*t)p(«[«j)J1/2  dz  «  exp - - ^ - g-^ -  *- - -  i . 

(A. 4) 


and 


t  .  .  I/*  UWJ  “  iW.H*  R^ltW,)  -  8(0) 

IV  —  j  p(a|ai)(p(*|xj)]i/z  da  -  G  exp - *- - l—.g-X.-—  .,ft .  ...  J  ■ . 


(A.  5) 


where 


c  A  VX/*  * 

RTOof:  Proa  the  definition*  of  the  probability  density  function*,  it  i*  easily  obtained  that 

tpUtap  p(*laj)|1/2  -  C  eap  -  |  (A.6) 


where 


C  Ai/t(*»)f/i  Wet  t1)lM  (det  Rj)17*] 

DAW-  tWj))1  R^U  -  8Wt»  ♦  (*  -  fWj))T  Rjl<*  -  gWj)) 


Also,  D  tea  he  rewritten  a* 


D  •  (a  •  *>T  (R*1  ♦  t‘li  (•  -  a)  -  8 


(A.l) 


(A.#) 


where 


a  A  (R^1  +  R'1)'1  by 

by  A  R^1  g(Xl)  +  R~X  g(Xj)  (A. 9) 

B  is  as  defined  in  (A. 2). 


Thereforet  ve  have 


[p(z|xi)p(z|xj)I1/2 


(z  -  a)T  (R'1  +  R*1)  (z  -  a)  R 

«P[-  4  j  +J] 

(2ir)r/2  (det  R^1^4  (det  Rj)1/4 


Multiplying  the  denominator  and  numerator  by 


{det^fR’1  +  Rj1)"1])172 


(A.  10) 


gives 


,  »  exp  -  I C*  -  a)  - s -  v*  -  a;/zj 

[p(z|x  1)p(z|x  )]1/Z  -  (A  expf}  { - £72 - IT - !>2 - > 

1  3  4  (2Tr)r/2[det[2(Ri1  +  R^1)  ^J1'2 


x^i1 


(*  -  a)/2] 


(A.  11) 


It  should  be  noted  that  the  term  in  the  braces  is  s  multivariate  Gaussian  density  function  with  mean  a 
and  covariance  •  2(Rjl  +  Rrl)-1.  Hence,  integrating  this  tern  over  all  c  yields  one,  so  that  ve 
obtain  the  equality  In  (A.l). 


¥6t  the  proof  of  the  equality  in  (2,3),  let  us  rewrite  the  Integral  in  (2.3)  as 
|  p(*|x1)(p(t|Xj)]1/2  d*  »  |  {p(*|Xl)2  p(*|x^)]1/2  ds 

On  the  other  hand,  ftad  the  equality  that 

det  -  2r  det(R1/2) 


we  have 


where 


p2(*l*i)  -  C  •  pjtekj) 

P1<*l«i>  &  »(»Ut).  y) 
end 


C  4  l/2r/2  (2e)r/a  (det  IJ1!2 


(A. 12) 


(A. 13) 


(A. 14) 


(AflS) 


Substituting  (A. 14)  into  (A, 12)  yields  ' 

J  p(AU^)tp(«Jaj)}l/2  ds -Ol/2  |  t»l(t|«|)»Cs|ii)!1/t-.4a 

end  uelsg  the  quality  in  (A.l)  for  the  integral  on  the  right  head  side  is  (A. 16),  we  can  easily 
obtain  the  equality  la  (A. 3). 


(A.  if). 
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If  Rg  •  R,  ^  Ry,  substituting  R„  for  Rg  and  Rj  in  the  equalities  in  (A.l)  and  (.A. 3),  the 
equalities  in  XA.4)  and  (A, 5)  can  readily  be  verified.  This  proves  the  theorem. 


B.  Approximation  of  an  Absolutely  Continuous  Random  Vector  by  a  Discrete  Random  Vector 

Let  n  be  a  given  positive  Integer  and  let  Dm  be  the  set  of  all  distribution  functions  of  all 
a  x  1  discrete  random  vectors  with  n  possible  values  where  superscript  m  stands  for  the  dimensionality 
of  random  vectors.  Then  the  problem  of  approximating  an  absolutely  continuous  m  x  1  random  vector 
Xm  with  distribution  function  Fjrm(»)  by  an  m  x  1  discrete  random  vector  with  n  possible  values  is 
to  find  a  distribution  function F  (•)  e  Dm  which  minimizes  the  following  objective  function  over  the 
set  D°.  y“ 

J  (F  <•))  A  [  J*  „<•>  -  F  <la;  F  (•)  e  D®  (B.l) 

m  ym  -)Rm  x»  ym  m 

That  is 


J(Fm<-))-  min  J 0?  _('))  (B.2) 

v"  F  .<•>**  y 

y 

The  discrete  random  vector  defined  by  F  m<>)  is  referred  to  as  the  optimum  discrete  random  vector 
approximating  the  random  vector  XB.  yo 

Here,  the  approximation  of  an  absolutely  continuous  random  variable  X  with  distribution  Fy(*)  by  a 
discrete  random  variable  with  n  possible  values  is  considered.  The  necessary  conditions  that  the 
optimum  discrete  random  variable  approximating  X  must  satisfy  is  obtained.  Finally,  discrete  random 
variables  approximating  normal  random  variables  are  obtained. 

Let  us  now  state  two  theorems  and  define  some  symbols  which  will  be  used.  The  proofs  of  the 
theorems  are  given  in  Reference  [18]. 

Theorem  B.l1® 

0 

Let  f (y)  A  £(yg,  yj,  y, )  be  a  real  valued  function  on  an  open  set  T  of  R  and  let  f(y) 
have  finite  partial  derivatives  Sffyl/dy^,  It  •  1,  2,  ....  A  at  each  point  of  T.  If  f(y)  has  a  local 
minimum  at  the  point  y  A  <y  ,  y  Q,  ....  yt  0>  in  T,  then 


■53*  f(y)  »  0  for  each  k  ■  1,  2,  ...,  A 
yk  !y-y0 

Theorem  B.21* 

Let  f(y)  £  f(yj,  yj,  ...  yg)  be  a  real  valued  function  on  an  open  aet  t"  of  R*  and  let  f(y) 
have  continuous  second-order  partial  derivatives  on  f.  Let  y.,  A  <y,  n,  y,  n,  ....  y,  n)  be  a  point 
of  r  for  which  0  1,0  1,0  1,0 

33-  f(y>  •  0  for  each  k  «  1,  2,  i 
8yk  'yy0 


Assume  that  the  determinant 

0  Adet[[V2  f(y)]l  v  )  i  0 


where 

Let  Co.it  be  the  determinant  obtained  froei  G  by  deleting  the  last  k  rows  and  column.  If  the  o  numbers 
fig,  Cj,  ...,  Cg  are  all  positive,  then  f(y)  has  e  local  minimum  at  yQ. 

D  la  tha  aat  of  all  distribution  functions  of  all  discrete  random  variables  with  n  possible 
values  where  n  is  a  given  positive  Integer. 
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S  is  the  set  of  all  step  function  with  n+1  steps  where  a  step  function  with  n+1  steps  (where 
n  is  given)  is  defined  throughout  R  (real  line)  and  is  constant  on  each  one  of  n+1  non¬ 
intersecting  intervals  whose  union  is  R.  This  constant  is  zero  on  the  interval  containing 
and  is  one  on  the  Interval  containing  That  is 


S  A  (g(x):  g(x)  «  0  for  x  <  y^  g(x)  »  PA  for  y1  <  *  <  y1+1>  e  (0,1);  g(x)  «  1  for 
x  i  yn*  yi+l  >  yi*  yi  £  1  "  l»  2 . n’l} 


In  order  to  find  an  optimum  discrete  random  variable  with  n  possible  values  (where  n  is  a  given 
positive  Integer)  that  approximates  an  absolutely  continuous  random  variable  X  wi-:h  distribution 
function  Fx(*) ,  we  must  find  a  distribution  function  Fy.(*)  which  minimize  the  following  objective 
function  over  the  set  D 


Namely, 


j<y-»  a 


[Fx(a)  -  Fy(a)]2  da 


(B.3) 


J(F  (*)) 
y0 


min  J(F  (•)) 
Fy(OtD  y 

,W«S 


(B.4) 

(B.5) 


The  last  equality  follows  from  tha  following  arguments.  Let  a  step  function  gg(*)  e  S  minimize 
(B.3)  over  the  set  S;  since  the  distribution  function  is  nondecreaaing,  gg(’)  must  be  nondecreaalng , 
hence  it  is  a  nondecreaaing  step  function  (from  zero  to  one);  therefore  gg(-)  e  D.  Thus  the  aim  la 
to  find  a  step  function  gg(')  t  S  which  minimize  (B.3)  over  $  •  i.e,,  we  would  like  to  minimize 
tha  following  function  over  y^  e  (-“,*)  and  P^  c  (0,1)  (where  i  •  1,  2,  ...»  n;  J  »  1,  2,  ....  n-1). 


J(S(0>  - 


F*(a)  da  + 


(Fx(a)  -  PjT  da  + 


;y2 


(Fx(.)  -  P2)‘  da  +  ...  + 


<  V*>  ‘  Pn-lJ 


|*  (Fx(*>  -  l)2  da 


Vi 


(8.6) 


It  follows  from  Theorem  B.l  that  if  gg(x)  which  la  defined  by 


'  1  4  yl,0 

**<*>  m  <  Pi,0  U  yl,0  1 «  <  *i+l,0{  1  •  . . . 


z  >  y 


n,0 


(8.7) 


is  a  etep  function  (distribution  function)  which  minimize  (8.6).  This  must  satisfy  the  following 
amt  of  equations 


8 


d* 
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1.0 

‘  "m^l.O1 

+  p 

l.o  *2,0 

’  2Px(y2,0: 

+  P 

2,0  3,0 

■  2Vy3,0: 

Pn-2,0  +  Pn-1,0 


■  2Fx(yn_ijo) 


1  +  P 


n,0 


Pl,0^y2,0  ~  yl,(P 


2F  (y  ) 
x  /n 


2,0 


Fx(a)  da 


rl,Q 


P2,0^y3,0  ~  y2,0^ 


Pn-l,0<yn,0’Vl,0) 


'3,0 


Fx(a)  da 


'2.0 


tv, 


o,0 

;yn-l,0 


Fx<a)  da 


(B.8) 


Ualnt  that*  aquation*  and  Theorem  8.2,  tha  diacrcte  randoa  varltblai  which  approximate  the 
normal  random  varlabla  with  taro  mean  and  unit  variance  have  bean  mmarically  obtained,  and  thay  are 
tabulated  In  Table  8.1. 

Let  yo  be  tha  dlacrata  random  variable  with  n  poaalble  value*  yj  g,  y2tQ,  •  «•»  yn,0  4>ich 
approximate  tha  normal  random  variable  with  taro  mean  and  unit  variance  and 'let  Q  b*  auch  that 

P1.0  "  "  yl,0} 

Let  tg  b*  tha  dlacrata  random  variable  with  n  poeatbl*  value*  0,  t2|o.  »•*»  *n  o  which  approximate* 
the  normal  random  variable  with  naan  u  and  variance  o*  and  let  p’  Q  be'tuch  that 

Pl,0  4  ^rob(«  -  *t  0>  . 

Throujh  tha  aquation*  In  (8.S),  It  can  easily  ba  verified  that 


*1,0  *  0yl.C  +  " 


1,0  ri,Q 


i  *  l*  2*  •**» 


0 


<».*> 


TABLE  B.l,  Discrete  Random  Variables  Approximating  the  Gaussian  Random  Variable 
with  Zero  Mean  and  Unit  Variance 
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absolute  errors  Figure  4. 1 .4(a)  Actual  and  estimated  values  of  states 
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Figure  ♦.  1.4(b)  Error  covariance  and  bound  Figure  4. 1 .4(c)  Absolute  and  average  absolute  errors 
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Figure  4.2.1(c)  Absolute  and  average  absolute  errors  Figure  4.2.2(a)  Actual  and  estimated  values  of  states 
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Figure  4.2.X*}  Actual  and  estimated  values  of  states  Figure  4.2.3(b)  Error  covariance  and  bound 
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Figure  4.2.4<b)  Error  covariance  and  bound  Figure  4.2.4(c)  Absolute  and  average  absolute  errors 
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Figure  4.3.2(b)  Error  oonriinct  Figure  4.3.2(c)  Absolute  and  average  absolute  errors 
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Figure  4.3.3(a)  Actual  and  estimated  values  of  states  Figure  4.3.3(b)  Error  covariance 
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Figure  4.3-3<e>  Absolute  tad  average  absolute  errors  Figure  4.3.4(a)  Actual  and  estimated  values  of  states 
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